📜 ⬆️ ⬇️

Overview of the two courses “Machine Learning” from Coursera

I want to share my learning experience on the resource “Coursera”, namely, the development of the courses “Machine Learning Foundations: A Case Study Approach” and “Machine Learning: Regression” . These courses are part of the specialty “Machine Learning” (University of Washington).

Machine learning is not related to my current specialty. Interest in him was due to the desire to get acquainted with what is now paid a lot of attention. In my university days (2003-2010), this topic was not touched upon, so machine learning and big data are an unknown area for me. I would like to build in my head an idea about this topic and be able to solve simple tasks in order to delve into something concrete as necessary.

There were several reasons for choosing the Coursera portal and this particular course. First, reading articles on disparate topics about a little-known subject is not useful, since knowledge is not systematized. Therefore, there is a need for a built course. Secondly, there was a negative experience of listening to lectures, where the authors for a very long time tried to explain the obvious things, without actually getting to the point. What attracted me to the Machine Learning course is that the lecturers Carlos Guestrin and Emily Fox look extremely passionate about their subject matter (passioned & excited), speak quickly and to the point. And besides, it is noticeable that the authors deal with practical application, i.e. with industry.

According to the authors of the course, the reason for its creation was an attempt to convey the tasks of machine learning to a wide audience, i.e. for those whose training took place in different areas. The main differences include the fact that first focuses on specific tasks that can be found in existing applications, and how machine learning can help solve them. Then the methods used are analyzed, how they are arranged and how they can be useful. Thus, it can be seen in simple examples of how machine learning can be applied in practice. Moreover, nowadays, according to the authors, the consequences of using machine learning are noticeable. Previously, it was perceived differently. A certain set of data was fed to the input of a poorly understood algorithm, with the result that the conclusion “my graph is better than yours” was made, and the results were sent to a scientific journal.
')
Classes are divided into theoretical and practical part, and tests. In the theoretical part of the lecture (in English, English or Spanish subtitles). There are pdf presentations on which you can prepare for tests. Also on the forum are links to additional literature. In the 1st course of “Machine Learning Foundations: A Case Study Approach” there are lectures where you learn how to work in the IPython interactive shell. Here they talk about the basics of programming in Python (just everything you need to be able to perform tasks). In addition, there are lectures, where they talk about the principles of working with the library GraphLab Create. Tests are divided into theoretical and practical. Questions in theoretical tests require understanding, superficially listening to the material and successfully passing the test is unlikely to turn out. Sometimes lectures are not enough and you have to use additional materials. It is worth noting that here in one lesson you can yourself demonstrate with the help of assignments the main theoretical points.

The practical part is a test with tasks. The execution of tasks involves the ability to process a large set of data, as well as to perform operations on them. The authors recommend using the GraphLab Create library, which has an API in the Python language. With it, you can load data files from files into convenient structures (SFrame). These structures allow you to visualize data (special interactive graphs) and conveniently modify them (add columns, apply operations on rows, etc.). The library has machine learning algorithms with which to work. To perform tasks, you can use the template implemented in the IPython Notebook web shell. This is the file that stores the framework functions, as well as recommendations. For local work with GraphLab Create and IPython Notebook, the authors recommend using the Anaconda installer. You can also work on the Amazon EC2 web service, where all the necessary programs are already installed. I have chosen the second option, since you can immediately get down to work.

Now it’s worth talking about the course plan. The first course in Machine Learning Foundations: A Case Study Approach is introductory. Lectures of the first week are devoted to the description of the language Python, the library GraphLab Create. The authors also briefly talk about the content of other specialization courses. This is very useful, because the designated action plan does not forget which way you are moving and what you should be able to do at the end of the training. The remaining weeks contain an introduction to topics that will be covered in detail in future courses. What is given in these introductions requires a good understanding, you also need to be able to use algorithms in practical tasks. It should be noted that these tasks clearly demonstrate the read theory. Below is a course plan for Machine Learning Foundations: A Case Study Approach.


Course structure
Figure 1. Machine Learning specialization structure (taken from the materials of the Machine Learning: Regression course, © 2015 Emily Fox & Carlos Guestrin)

To complete the second course of “Machine Learning: Regression” specialization, you need to have an idea about derivatives, matrices, vectors and basic operations on them. The ability to create at least simple Python programs will be useful. A brief description of the second course of specialization “Machine Learning: Regression” is given below.


Machine learning. Regression
Figure 2. Topics studied in the Machine Learning: Regression course (Taken from the materials of the Machine Learning: Regression course, © 2015 Emily Fox & Carlos Guestrin)

Weekly load is adequate. However, the second course of “Machine Learning: Regression” is more intense. If you are more than two weeks late, you will be asked to switch to another session, but this is not necessary. I listened to lectures during the working week, on Friday or on weekends practical tasks were performed. I spent about three hours on them.

In conclusion, I would like to say that the described courses of the specialty “Machine Learning” made a good impression. The advantages I take practical tasks, they are carefully thought out and illustrate the theory. I liked the lectures, which are capacious and in which there is no "water". The courses are structured, there are diagrams that help you understand what “part” of machine learning you are currently studying, what you know and what you have to learn. The disadvantages are that sometimes there is not enough read theory. I would like more links to other resources, although the official forum lists recommended books, which can also be downloaded there. In general, the Machine Learning specialization courses will be very useful if you want to learn how to practically use machine learning methods.

Source: https://habr.com/ru/post/307048/


All Articles