22 Data Science Tips In this article, you can find 22 tips on Data Science from Vincent Granville, a renowned data analyst and creator of the Data Science Central portal.
Data model flexibility A little reflection on such an important property of the data model as flexibility.
Open issues on Facebook data handling An article from the blog of the company Facebook tells about various unsolved problems and issues of the company in the topic of working with data.
Hello world machine learning Another excellent article from the author of the blog MachineLearningMastery, which will be interesting to beginners and will help to understand the huge number of algorithms that are in machine learning.
Clustering and distributed computing model A story about various clustering methods and the possibility of using the distributed computing model when using data from clustering algorithms.
Analysis of R code coverage by unit tests A very interesting article devoted to the analysis of code level coverage by unit tests in the R programming language using the library testCoverage.
Flafka: Apache Flume and Apache Kafka for event handling These reviews have already had several links to Apache Kafka materials, but in this case this is quite an interesting article from Cloudera’s blog about using Apache Kafka and Apache Flume to handle events.
NoSQL in the world of Hadoop An interesting article from the Cloudera blog about the place NoSQL takes in the world of Hadoop.