Optical Character recognition

Tesseract

Install tesseract and mlayalam language pack
1
$ apt install tesseract-ocr tesseract-ocr-mal
Copied!
Make sure tesseract is available with Malayalam support
1
$ tesseract --list-langs
2
List of available languages (4):
3
Malayalam
4
eng
5
mal
6
osd
Copied!
Run OCR on an image. Assuming you have testocr.png as the image file to OCR. The out.txt is your output file
1
$ tesseract ~/testocr.png out.txt -l mal
Copied!
The out.txt file will have the text recognized from image

Tesseract OCR in browser

Optical Character Recognition

Links

GitHub - harish2704/pottan-ocr: A stupid OCR for malayalam language
GitHub
Last modified 11mo ago