Digest of articles on data analysis No. 2 (05/26/2014 - 06/08/2014)
Good afternoon, dear readers. I present to you the digest of news and useful materials from the world of data analysis. The previous digest was very popular and so I decided to make them regular. The frequency of such selections will be 1 time in 2 weeks.
In today's compilation, you will learn what is common between statistics and data analysis, how to identify a false correlation, and what algorithms rule the modern world. In addition, you will receive small cheat sheets on methods of machine learning and NoSQL databases, well, and a lot more interesting.
Theory
List of NoSQL Databases (EN) A complete list of NoSQL DB, broken down by category and brief description.
False Correlations in Big Data Another article on the difference between false correlations from the true. We consider 6 types of correlations.
Evolution in data analysis (EN) The main milestones in the development of yourself as a data analyst are highlighted. In my opinion, the article will be useful to those who have recently begun to engage in data analysis, but already have a basic understanding of this.
3 trends in data warehouses for observation (EN) The article highlights the following trends: taking data from various sources, direct access of analysts to the data and speed of working with them.
Building a team of data analysts The article shows which specialists should go to the team and why. Basically, it’s about 3 people: a customer service specialist, context analyst, visualizer.
Life data analytics in small countries The article describes a number of difficulties, as well as ways to solve them, when working as an analyst in small countries, such as Belgium, Switzerland, etc.
The Graphviz Cookbook (EN) Collection of recipes for using Graphviz to visualize data.
10 tips for analysts Experts in the field of analytics share tips that will help you in the analysis.
Examining statistics with IPython Notebook Selection of IPython consoles demonstrating the basic techniques of data analysis. And all this is done in the form of a textbook. For beginners to learn data analysis using Python, I highly recommend.
Prediction of the world champion 2014 (EN) The article describes the concept of prediction, as well as a link to the description of the methodology (which, by the way, is very beautifully visualized).
Chess Evolution Study The article examines the question of how the beginning of chess games has changed from 1850 to the present day.
Why banks are still against “big data” The article provides a series of answers to the questions of various bankers on how to apply “big data” in their business.