📜 ⬆️ ⬇️

Taming of the Shrew (in fact, no) FineReader


After a short story about how ABBYY FineReader works (aka the “theoretical part”), it's time to move on to applying the knowledge gained. And yes, there are no cats under the cut: everything is very serious.

How to the user to participate in the processing of the document


In order not to reinvent the wheel, I'll start with a simple and clear scheme from Help (see the figure on the right).

Now, knowing the list of all operations, we will look at examples - what can not go according to plan and how to deal with it.

Well recognized only good images.


What to do when there are images, but not very good ones? Improve right in FineReader all you can, and if you can not improve - try to get the image again, eliminating the problem. Since the topic is very extensive, with proper interest there will be a separate post about how to make friends with automatic and manual image processing tools right in FineReader. In the meantime, I limit myself to noticing that the image will be processed better if it:
')

Document / Project Setup Phase


It is possible and necessary to immediately indicate the language of the text, the parameters of image preprocessing, some parameters of analysis and recognition. Here is a screenshot of one of the tabs in the settings dialog.

These and other settings are described in detail in the Help.

Stage of analysis


The program automatically selects areas of different types in terms of recognition. At this stage, we can both independently mark out the areas, and correct (if necessary) those that have found the Analysis module.

In order not to write a lot of superfluous about the tools for working with areas, I refer to the Help section , but here I will explain what for what, “what is good, what is bad” (applied to areas) and how to fix a bad result.

Assigning areas of different types


In the FineReader user interface, several types of areas are available, for them there are various options for the hidden property panel (at the bottom of the Image window) and the context menu (by right-clicking):



Important Considerations




Features of the interaction of closely spaced or intersecting areas


The following rules are important both for the proper handling of areas in the program shell, and for understanding what will happen to them in the recognition and preservation results.

Source: https://habr.com/ru/post/240361/


All Articles