📜 ⬆️ ⬇️

Mastering Computer Vision - 8 Basic Steps

Hello reader.

For you, the fact that everyone has tried aging masks through the Face App is no longer news. In turn, for computer vision there are tasks more interesting than this. Below I will present 8 steps that will help you master the principles of computer vision.

image
')
Before starting with the steps, let's understand what tasks you and I can solve with the help of computer vision. Examples of tasks may include the following:


The minimum knowledge required to master computer vision



So, now let's get down to the steps.

Step 1 - Basic Imaging Techniques


This step is about the technical basics.

Check out this great YouTube playlist, The Ancient Secrets of Computer Vision, by Joseph Redmon.





Read the third chapter of Richard Sheliski's book Computer Vision: Algorithms and Applications.

image

Build knowledge - try yourself in image conversion using OpenCV. The site has many step-by-step electronic manuals , guided by which you can figure out everything.

Step 2 - Motion Tracking and Optical Flow Analysis


An optical stream is a sequence of images of objects obtained by moving an observer or objects relative to a scene.

Take the course - a computer vision course at Udacity , especially lesson 6.
Take a look - the 8th video on the YouTube list and a lecture on optical flow and tracking.


Read - sections 10.5 and 8.4 of the Sheliski textbook.

image

As a training project, figure out how to use OpenCV to track an object in a video frame.

Step 3 - Basic Segmentation


In computer vision, segmentation is the process of dividing a digital image into several segments (super pixels). The purpose of segmentation is to simplify and / or change the presentation of an image so that it is easier and easier to analyze.

So, the Hough transform allows you to find circles and lines.

Check out these videos:





Check out - an excellent project whose similar tasks are extremely important for the computer vision of self-driving electric vehicles.



Step 4 - Fitting


Different data require a specific fitting approach and their own algorithms.

Watch the video:


Read - sections 4.3.2 and 5.1.1 of the Sheliski textbook.

As a task for independent work, analyze the problem of determining the coordinates of the place where the lines converge on the perspective horizon.

Step 5 - Combining images obtained from different points of inspection


Check out the Youtube playlist


Read the cover letter.

For the project, you can take your own data. For example, take a picture of furniture from different sides and make a 3D object in OpenCV from an album of flat images.

Step 6 - 3D Scenes


Being able to create 3D objects from flat images, you can try to create three-dimensional reality.

Take - stereo vision and tracking course

Watch the video:




As a project, try to reconstruct the scene or track an object in three-dimensional space .

Step 7 - Object Recognition and Image Classification


It’s convenient to use TensorFlow as a framework for deep learning. This is one of the most popular frameworks, so you can easily find enough examples. To get started with images in TensorFlow, go through this tutorial .

Further, using links, consider the following topics:


As a project, create a neural network in TensorFlow that determines the brand of a car or a breed of dog from an image .

Step 8 - Modern Deep Learning


Read - Stanford Course Lectures

Watch the video:




On this our steps in the study of computer vision came to an end. I hope you learned something new for yourself. As is customary on Habré, I liked the post - put a plus. Do not forget to share with colleagues. Also, if you have something that you can share yourself - write in the comments. More information about machine learning and Data Science on Habré and in the telegram channel Neuron (@neurondata).

All knowledge!

Source: https://habr.com/ru/post/461365/


All Articles