๐Ÿ“š
Docs
  • Welcome
  • Santhosh Thottingal
    • Coding
    • Software I use
    • Research Papers
    • Talks
    • Projects
    • In news
    • Ideas
    • Books
  • Malayalam Computing
    • Unicode
      • Syllable
      • Conjunct
      • Articles
    • Input methods
      • Inscript
      • Swanalekha
      • Handwriting Recognition
        • Procrustes Analysis
      • Proprietory Input Methods
      • What is a good input method?
      • Typewriter
    • Script Rendering
      • Orthography
      • Ya Ra Va Signs
      • U signs
    • Type Design
      • Color Fonts
      • Curves
      • Design Ideas
      • Manjari
        • Gallery
      • Chilanka
      • Gayathri
      • Customize Malayalam fonts in Linux
      • Articles
      • Tools
      • Type classification
        • Display typefaces
    • Spellcheck
      • History
      • Dictionary based approach
      • Nature of Malayalam spelling mistakes
      • Morphology analyser based approach
      • Tools and services
      • Links
    • Hyphenation
      • Web page
    • Typesetting
      • LaTeX
      • Scribus
      • PDF
      • XeTeX
      • Indesign
      • Markup languages
    • Speech Recognition
    • Speech Synthesis
      • Dhvani
    • Collation
    • Corpus
    • Morphology Analysis
      • Mlmorph
        • Snippets
      • Part of speech tagging
      • Morphology complexity
    • Named Entity Recognition
    • Numbers
      • Number spellout
      • Hindi
    • Machine Translation
      • Neural Machine Translation
    • Optical Character recognition
    • Transliteration
    • Digitization
    • NLP
      • Low resource languages
      • Natural Language Generation
    • Grammar analysis
      • Style checkers
    • Dictionary
      • Lexicon
    • Natural Language Understanding
    • Natural Language Generation
    • Swathanthra Malayalam Computing
    • Meta
      • Malayalam Sign Language
      • เดชเดฆเดจเดฟเตผเดฎเดฟเดคเดฟ
      • History
      • เดฒเดฟเดชเดฟเดชเดฐเดฟเดฃเดพเดฎเด‚ เดจเดฟเดฒเดšเตเดšเตเดชเต‹เดฏเต‹?
      • เดญเดพเดทเดพ เดชเด เดจเด‚
      • เดถเตเดฐเต‡เดทเตเด  เดญเดพเดท
      • Dictionary
    • Encyclopedia
    • Government
      • Script
      • เด•เต‡เดฐเดณ เดญเดพเดทเดพ เด‡เตปเดธเตเดฑเตเดฑเดฟเดฑเตเดฑเตเดฏเต‚เดŸเตเดŸเต
  • Academic Research
    • Knowledge Dissemination
    • Research papers
    • Reproducible Research
  • Arts
  • Books
  • Blockchain
  • Computer Science
    • Data, Information, Knowledge
    • Theory of computation
    • Compilers and Interpreters
    • Graphics
    • Data Visualization
    • Parsers
    • Data Structures & Algorithms
    • Finite State Transducer
  • Cyberspace
    • Digital Governance
    • เด•เต‡เดฐเดณเดคเตเดคเดฟเตฝ
    • Online Abuse
  • Databases
  • Education
    • Finite State Transducers
    • Digital Education
    • Digital Literacy
      • เดกเดฟเดœเดฟเดฑเตเดฑเตฝ เดธเดพเด•เตเดทเดฐเดคเดพ เดชเดฆเตเดงเดคเดฟ
      • Resources
    • Remote Learning
    • General Learning
  • Entertainment
  • Frontend technology
    • Colors
    • Design systems
    • CSS
    • PWA
    • SPA
    • Vue
  • Generative Graphics
    • Drawbot
    • Matrix Digital Rain
  • Hardware
  • Internet
    • Etiquettes
    • Privacy
    • IPFS
    • Resilience
    • Decentralization
    • Network debugging tools
  • Knowledge Representation
  • Languages & Scripts
    • Arabic
    • Vattezhuth
  • Life
    • Digital Minimalism
  • Linux
  • Machine learning
    • Neural Networks
    • Dialog systems, Information retrieval
    • Large Language Models
    • Embedding
    • ML in Production
    • Retrieval Augmented Generation
  • Mathematics
  • Music
  • Parenting
  • Politics
    • Hatred, Hinduthwa, Nationalism
  • Productivity
  • Problem Solving
  • Science
  • Software Libraries
  • Software Engneering
    • Architecture
    • Product Management
    • Docker
    • Programming
      • Javascript
    • People
    • Performance
    • Code Review
  • Web3
  • Web Typography
  • Writing
  • เดชเดพเดŸเตเดŸเตเด•เตพ
    • เด•เตเดŸเตเดŸเดฟเดชเตเดชเดพเดŸเตเดŸเตเด•เตพ
  • เดฎเดฒเดฏเดพเดณเด‚ เด…เดšเตเดšเดŸเดฟ
  • เด—เดตเต‡เดทเดฃเดชเตเดฐเดฌเดจเตเดงเด™เตเด™เตพ
Powered by GitBook
On this page
  1. Malayalam Computing

Digitization

PreviousTransliterationNextNLP

Last updated 4 years ago

Preserving old documents and books in Malayalam in digital formats is very important for the language. The most active effort in this front is lead by with little support from Government Institutes who are yet to recognize its importance.

Shiju Alex's efforts are well documented in . It also has links where the digitized documents are saved.

Digitization often starts with scanning the old documents as good quality images. The conversion of content in these documents to text is also part of digitization. It is mostly accomplished using a combination of Optical character recognition and manual transcription. did some efforts on this front for copyright expired documents.

Links

Shiju Alex
his website
archive.org
Malayalam Wikisource
EurekaKSSP Publications Online
Eureka Archive: 50 เดตเตผเดทเดคเตเดคเต† เดฏเตเดฑเต€เด•เตเด• เดธเต—เดœเดจเตเดฏเดฎเดพเดฏเดฟ เดตเดพเดฏเดฟเด•เตเด•เดพเด‚
http://keralasahityaakademi.org/online_library/keralasahityaakademi.org
Kerala Sahithya Academi Online library
Logo