Optical Character recognition
Install tesseract and mlayalam language pack
$ apt install tesseract-ocr tesseract-ocr-mal
Make sure tesseract is available with Malayalam support
$ tesseract --list-langs
List of available languages (4):
Malayalam
eng
mal
osd
Run OCR on an image. Assuming you have testocr.png as the image file to OCR. The out.txt is your output file
$ tesseract ~/testocr.png out.txt -l mal
The
out.txt
file will have the text recognized from image

Last modified 2yr ago