
Researcher Kavel Litaru, from the University of Djordtown, collected a
catalog of 250 million events in the world since 1979, which is updated daily and is available to anyone who wants to study it.
Each record set has 58 attributes. And the set itself is divided into 300 different categories. At the moment, the catalog has a volume of 100 GB, and the host is Goolge.
To analyze, the user can download the entire set or a category of interest, or use
Google BigQuery directly on the site.
The database is automatically updated from many news sources from around the world. All of them are processed using various text mining and geocoding algorithms created by Litar, and then entered into the database. In addition, the author notes that due to recent advances in the processing of natural languages, the share of non-English-language sources will soon increase.
')
VIA
GIGAOM