pytesser - Python OCR Library (Optical Character Recognition)
Today I roamed Google for projects and came across a rather interesting development - pytesser. According to the developer to recognize the text in the image. The library uses tesseract-ocr and PIL.
Example of use:
>>> from pytesser import * >>> image = Image.open('fnord.tif') # Open image object using PIL >>> print image_to_string(image) # Run tesseract.exe on image fnord >>> print image_file_to_string('fnord.tif') fnord
A big minus for me is the ability to use only on Windows. Therefore, I could not test this development. ')
Included with the test library are the following image, which it should (in theory, once the author placed it there) recognize:
I would like to hear from the habra people the test results on other images.