DEV Community

Azad Kshitij
Azad Kshitij

Posted on

1

Machine Learning: Basic Terms

In these series of articles we will talk about ML algorithms and implement them in python and the main focus of these articles is to understand math and logic behind these algorithms. We will start by understanding basic terms of ML.

1. Classification Algorithms

As the name suggest these are classification algorithms that categorizes the data into different category. It is a Supervised Learning technique.

2. Supervised Learning

Supervised learning is associated with learning with supervision or training. In supervised learning, the algorithms are trained with data which is labeled or classified.

3. Unsupervised learning

Unsupervised learning is associated with learning without supervision or training. In unsupervised learning, the algorithms are trained with data which is neither labeled nor classified. In unsupervised learning, the agent needs to learn from patterns without corresponding output values.

4. Cross-Entropy Loss

Is a metric used to measure how well a classification model in machine learning performs. The loss (or error) is measured as a number between 0 and 1, with 0 being a perfect model.

5. Condition positive (P)

The number of real positive cases in the data

6. Condition negative (N)

The number of real negative cases in the data

7. True positive (TP)

A test result that correctly indicates the presence of a condition or characteristic

8. True negative (TN)

A test result that correctly indicates the absence of a condition or characteristic

9. False positive (FP)

A test result which wrongly indicates that a particular condition or attribute is present

10. False negative (FN)

A test result which wrongly indicates that a particular condition or attribute is absent

11. Confusion Matrix

It provides us the result of our model in matrix form (as shown below). We use this to measure accuracy of the mode.

Actual positive Actual Negative
Predicted Positive True Positive False Negative
Predicted Negative False Negative True Negative

12. Area Under the Curve (AUC)

Is the area encapsulated between the curve and x-axis.

Area Under the Curve

13. Receiver Operating Characteristic (ROC)

The receiver operating characteristic (ROC) curve, which is defined as a plot of test sensitivity as the y coordinate versus its 1-specificity or false positive rate (FPR) as the x coordinate

ROC=hitshits+misses ROC = \frac{hits}{hits + misses}

Receiver Operating Characteristic

This image from Wikipedia shows exact definition of ROC.

If you want me to include any other terms in this article comment them and I will do my best to include them.

References

Sentry blog image

How I fixed 20 seconds of lag for every user in just 20 minutes.

Our AI agent was running 10-20 seconds slower than it should, impacting both our own developers and our early adopters. See how I used Sentry Profiling to fix it in record time.

Read more

Top comments (0)

Billboard image

The Next Generation Developer Platform

Coherence is the first Platform-as-a-Service you can control. Unlike "black-box" platforms that are opinionated about the infra you can deploy, Coherence is powered by CNC, the open-source IaC framework, which offers limitless customization.

Learn more

👋 Kindness is contagious

Please leave a ❤️ or a friendly comment on this post if you found it helpful!

Okay