Optical Character recognition

Tesseract

Install tesseract and mlayalam language pack
$ apt install tesseract-ocr tesseract-ocr-mal
Make sure tesseract is available with Malayalam support
$ tesseract --list-langs
List of available languages (4):
Malayalam
eng
mal
osd
Run OCR on an image. Assuming you have testocr.png as the image file to OCR. The out.txt is your output file
$ tesseract ~/testocr.png out.txt -l mal
The out.txt file will have the text recognized from image

Tesseract OCR in browser

Optical Character Recognition

Links

GitHub - harish2704/pottan-ocr: A stupid OCR for malayalam language
GitHub
Copy link
On this page
Tesseract
Tesseract OCR in browser
Links