pytesser - Python OCR Library (Optical Character Recognition)

Today I roamed Google for projects and came across a rather interesting development - pytesser. According to the developer to recognize the text in the image. The library uses tesseract-ocr and PIL.

Example of use:

>>> from pytesser import *
>>> image = Image.open('fnord.tif') # Open image object using PIL
>>> print image_to_string(image) # Run tesseract.exe on image
fnord
>>> print image_file_to_string('fnord.tif')
fnord

A big minus for me is the ability to use only on Windows. Therefore, I could not test this development.
')
Included with the test library are the following image, which it should (in theory, once the author placed it there) recognize:

I would like to hear from the habra people the test results on other images.

Official page: code.google.com/p/pytesser

Source: https://habr.com/ru/post/80804/

All Articles

pytesser - Python OCR Library (Optical Character Recognition)

More articles: