It's no secret that more data - this does not mean that the data will become better. More often it happens differently and examples when the data are published in a terrifying form a lot.
For example, Rosoboronzakaz data about which I wrote earlier (
habrahabr.ru/company/infoculture/blog/201260 ) and where you can see.
Similarly, the data are published by the Ministry of Internal Affairs of Russia - this is such an array -
mvd.ru/opendata/od1And there is a lot of other data when CSV files are created on the basis of files unsuitable for Excel analysis, or when data is published without a description of the schemes, or when invalid XML files are published, and much more.
')
To collect all such cases, colleagues from the Open Knowledge Foundation began to collect all the examples.

Here is the project site -
okfnlabs.org/bad-data , and here is the section in github'e -
github.com/okfn/bad-data/issues?page=1&state=open where examples are collected from which 5 are being worked out and 2 are already registered .
Of course, for the time being, what has been described does not seem very complicated, incorrect formatting or tables mixed with pictures in PDF. But there will be more.
Let's help our colleagues and if you find any published data with which it is impossible to work, I suggest placing them in the comments to this post, and for those who are ready to join in, also with colleagues from OKF.