What to do during model training runs - some interesting ideas from the author of the blog MachineLearningMastery about what to do during pauses that may arise during the launch of the model training processes in machine learning tasks.
Example of hierarchical clustering - a visualized example of hierarchical clustering, created using the R programming language and the Shiny visualization library.
Apache Spark - SDK for all Big Data platforms - an interesting report on Apache Spark. In this speech, Pat McDonough talks about the development of Apache Spark and the possibility of using this product in the field of data processing and analysis.
DataFrame in Apache Spark for scaling Data Science tasks - video from a recent mitap in addition to the news that Apache Spark 1.3 will have a new opportunity to use DataFrame. Actually in this video, Reynold Xin will talk about this new functionality in Apache Spark.
Recent Apache Spark - Reynold Xin Performance Improvements provides an overview of recent significant Apache Spark performance improvements.
Announcement of DataFrame in Apache Spark - Apache Spark version 1.3 will be able to use DataFrame, this article will tell you about the details of implementing and using DataFrame in Apache Spark.