📜 ⬆️ ⬇️

Habrastatistika

From doing nothing decided to collect statistics Habra.
Statistics may not completely reflect reality, because of the 199k articles, only 75k were open, the rest were deleted \ hidden in the draft \ access to the publication is closed.
It can be assumed that most of the missing articles are mined.

Actually, the data itself:

Average number of posts per year:

')
Average count in. Comments on posts by year:

For all time: 46.

Average post rating by year:

Average all-time rating: 31.


What collected data
Used 3 self-written programs, download here (C #)
habrDownloader - to download all the topics. (uncleaned html code pages pumped at 16 gigs)
habrExtracter - parser downloaded and translated into clear view
habrAnaliser - analysis of the information received and compiling statistics

At the end of the last got it:

Source: https://habr.com/ru/post/199400/


All Articles