📜 ⬆️ ⬇️

Automatic captcha input - the theory and practice of conquering the Internet

In 2011, the 75th anniversary of the term “spam” was marked by the introduction of a captcha 200 million times DAILY!



All these entries are the result of the struggle of site administrators against spam bots.
')
Automating the process of captcha recognition for many people actively doing business on the Internet is a pressing issue. You can treat such businessmen and professionals as "bad and annoying spammers." However, it is not possible to stop the spam posting process, at least in the foreseeable future.

Link marketing here fully and uniquely combines the solution of the tasks of promotion, increasing the reputation of the site being promoted in the eyes of search engines. This happens for the simple reason that every link to a site (including from a spam post) increases its position in the output of Google, Yandex, etc. Therefore, such a method of “killing two birds with one stone” is advantageous initially. And a significant part of Internet businessmen should not struggle with spam posting, but try to use it for their own purposes.

So, the relevance of solving the “bypass captcha” problem is beyond doubt.




The task is automatically solved when carrying out companies in manual mode, by hiring hundreds of posters. But to talk about the effectiveness of this method, if not today, then tomorrow will not have to. Yes, the problem of captcha input for the customer here is really not relevant. But organizational, time and financial costs with this method of action do not stand up to serious criticism.

Therefore, specialized software products, automatic posters, have been actively developing for several years already. Some of them are well known in the market (the same XRumer), some - developed and used only within certain firms. In the case of using an automatic poster, the solution to the problem “how to get around a captcha” is possible in two ways:



Manual input



We note immediately that manual input is unacceptable with serious posting volumes.

Recognizing captcha today can be entrusted to special services (for example, antigate). Issue price - $ 1-2.5 per thousand recognition. The disadvantages of this method include:



A positive feature for this kind of services is independence from the type of captcha, since the recognition is carried out by a real human operator.

Software recognition



Today, a complex theoretical and practical problem is the development of artificial systems for recognizing graphic images. Optical character recognition, as applied to captchas, is not so simple a task as the recognition of scanned or handwritten text, because the captcha developers impose such effects on the characters so that software recognition becomes impossible.

However, despite this, the creation of captcha recognition programs is our specialty. Of course, this activity can be treated differently, including, and negatively, but we understand that thanks to our programs, comments are automatically posted to blogs, SMS messages are sent out, mail accounts are registered for subsequent spam. But what we do can be compared to selling knives - you can cut bread with a knife, or you can kill someone ... Is the manufacturer or seller of the knife guilty in this case? .. Everyone has his own opinion ...

There is no universal software for recognizing any type of captcha. Therefore, the software of automatic posters is consistently supplemented with modules for recognizing its necessary varieties. The development of such software is carried out by individual teams, for example, we www.captcha-lab.org . In our portfolio, a demo program for captcha input is not presented for one type. Of particular interest are the development team for the CMS Bitrix captcha (officially - 1C-Bitrix). This CMS is not just popular in Russia, but takes the first place among paid circulation "engines". Naturally, the "breaking" of the captcha Beatrix
He was interested and interested in many specialists. In 2006, there was even a successful attempt to perform such an “operation”. However, then the CMS Bitrix developers changed the type of captcha, and so far it has remained invulnerable. As the demo programs from www.captcha-lab.org demonstrate , this problem has now been solved with rather high rates - 64% and 60% for different versions of Beatrix. Do you think these figures are high enough? Indeed, other types of captcha software released by our team of programmers are recognized with a probability of up to 90%. There really is no limit to perfection. But these figures are high enough for work. Note that the use of a captcha service also ensures correct recognition only in 80-95% of cases.


Fig. 1 - Recognizing the old version of the CMS Bitrix captcha



Fig. 2 - Recognizing the new version of the CMS Bitrix captcha


How much will the development of a captcha program cost? $ 100-500, depending on its type, complexity. Note that this is a one-time waste. Thus, unlike captcha services, automatic recognition allows you to seriously win the issue price. In addition, a significant time gain is also ensured: software rarely takes more than a second to recognize.

We remind you that you can see all the “made” by us captcha on our website in the portfolio section .

Source: https://habr.com/ru/post/153413/


All Articles