Demystifying Machine Learning

#machinelearning #ai #technology

Even though a quick google search can give you plenty of “answers” for this question, it appears that they have actually not been satisfactory enough. The fact that you are reading this post should be enough to back my claim but if you are still not convinced, take a look at the google trends report for the term “Machine Learning”.

So what is Machine Learning? Let’s find out!

Before you go any further, I should warn you that this post is intended for people with little or no idea about the topic and thus has a super-simplified view of what Machine learning is.

To put it in Layman’s terms— Machine Learning is the ability to let machine form an equation to correctly solve any problem with little or no human intervention.

With Machine Learning, instead of programming the machine to do something, we let the machine learn to find a solution to the problem by itself.

“It sounds cool, but how is that it even possible.” — Your Brain

Hold your horses, we will get to that question in a minute. Before that, let’s understand how is it different than the “usual” way of solving a problem.

To understand this analogy better, let’s take a simple real-world example of — “Classifying shapes”.

Problem:

Given an image of a particular shape, let’s find out if the image is for a Square or Triangle.

Non Machine Learning Approach

The pseudo-code (fancy term programmers use to define “steps”) for that would look like:

Take the input Image
Find the number of corners in the image. This operation would basically include: (Don’t worry if you don’t understand all the terms below, neither do I)
- Convert the image from RGB (i.e. Colored Image) to Grayscale (i.e. B/w)
- Smooth-en out the image
- Use thresholding to convert the image to Bi-Level Binary Image
- Use an edge-detector algorithm to get the list of edges using pixel gradients
- Use contours to get the list of corners in the image
Write an if-else block to return the shape of the object based on the number of corners returned

Here’s a small visualization to help you better visualize what the above steps might look like:

Huh! You must be thinking — That seems a lot of work for such a trivial task. You are not entirely wrong but this has been the general approach to solve such problems until a few years back. Even though this approach “worked”, it had a lot of issues. For starters, it only worked well with perfectly captured images i.e. pictures taken under proper light and clear background which means that it will have a high percentage of failure in real-world scenarios.

Machine Learning Approach to the rescue

Let’s see what the same would look like with Machine Learning:

Gather some images for different shapes — squares and triangles.
Feed it to a ML Algorithm (more technical term would be a Convolutional Neural Network)
Run the image to be tested on the above trained model
Sit back and relax!

Now you must be thinking “Ok. That seems to be awfully easy. But how does that even work?” Let me answer that question with a question of my own — How would you teach a child a difference between a square and a triangle?

You won’t start by telling him about the number of corners or that the angles between them should be 90; Rather you would show him some images and help him relate that one of them is called a square and other is called a triangle.

Exactly! This is the way Machine Learning works too — based on feedback from the label provided with the data.

The below image might help understand it better:

Let’s take off the training wheels and see how far our newly trained child can go:

“All that’s cool but I don’t need a machine to categorize shapes.” — Almost everyone

To give a little more perspective on its potential - Machine Learning is the magic behind how Google lets you find your favorite cat photos (Who doesn’t love cats!) on the internet. If you look closely, almost everything you interact with on a day-to-day basis has machine learning weaved into it — Amazon product recommendations, Netflix movie recommendations, predicting stock prices, Alexa Voice, Sales forecasting, Automatic subtitle generation, Language translation and a lot more. The details pertaining to those is outside the scope of this post.
The major advantages with the Machine Learning approach is that:

You don’t need any domain knowledge to get started
The same classifier which can recognize shapes can also recognize numerical digits, cats vs dogs etc. since the Machine trains itself based on the input data
Can be continuously improved based on its own performance

The few disadvantages with Machine Learning are:

Needs a lot of training data for better predictions
Needs a lot of processing power for training the model and deriving inferences

However, the advent of the information age, providing us with humongous amount of data, and the drastic reduction in CPU and GPU costs over the years have helped solve both the above problems. Thus, it makes sense now more than ever, to have a generalized solution for such cases rather than a problem-specific solution.

That’s where Machine Learning comes into play. Sounds really cool, doesn’t it? If it’s so fascinating, why isn’t everyone trying to get their hands dirty with Machine Learning?

I will be addressing this question in my next post. Stay Tuned!

Until then, That’s all folks!