DEV Community

Jenil Sheth
Jenil Sheth

Posted on • Edited on

ML ALGOS List

1️⃣ Supervised Learning Algorithms

(Labelled data hota hai)

πŸ”Ή Regression Algorithms

  • Linear Regression
  • Multiple Linear Regression
  • Polynomial Regression
  • Ridge Regression
  • Lasso Regression
  • Elastic Net
  • Bayesian Regression
  • Quantile Regression
  • Support Vector Regression (SVR)
  • Decision Tree Regression
  • Random Forest Regression
  • Gradient Boosting Regression
  • XGBoost Regression
  • LightGBM Regression
  • CatBoost Regression

πŸ”Ή Classification Algorithms

  • Logistic Regression
  • Naive Bayes
    • Gaussian NB
    • Multinomial NB
    • Bernoulli NB
  • K-Nearest Neighbors (KNN)
  • Support Vector Machine (SVM)
  • Decision Tree
  • Random Forest
  • Extra Trees (Extremely Randomized Trees)
  • Gradient Boosting
  • AdaBoost
  • XGBoost
  • LightGBM
  • CatBoost
  • Linear Discriminant Analysis (LDA)
  • Quadratic Discriminant Analysis (QDA)
  • Perceptron
  • Passive Aggressive Classifier

2️⃣ Unsupervised Learning Algorithms

(Label nahi hota)

πŸ”Ή Clustering Algorithms

  • K-Means
  • K-Medoids
  • Hierarchical Clustering
    • Agglomerative
    • Divisive
  • DBSCAN
  • HDBSCAN
  • Mean Shift
  • Gaussian Mixture Model (GMM)
  • Spectral Clustering
  • OPTICS
  • BIRCH
  • Affinity Propagation

πŸ”Ή Dimensionality Reduction

  • Principal Component Analysis (PCA)
  • Kernel PCA
  • Linear Discriminant Analysis (LDA)
  • t-SNE
  • UMAP
  • Independent Component Analysis (ICA)
  • Autoencoders
  • Factor Analysis
  • Truncated SVD

πŸ”Ή Association Rule Learning

  • Apriori Algorithm
  • FP-Growth
  • Eclat

3️⃣ Semi-Supervised Learning

  • Label Propagation
  • Label Spreading
  • Self Training
  • Co-Training
  • Semi-Supervised SVM

4️⃣ Reinforcement Learning Algorithms

πŸ”Ή Value-Based

  • Q-Learning
  • SARSA
  • Deep Q Network (DQN)
  • Double DQN
  • Dueling DQN

πŸ”Ή Policy-Based

  • REINFORCE
  • Policy Gradient

πŸ”Ή Actor-Critic

  • A2C
  • A3C
  • DDPG
  • TD3
  • PPO
  • SAC

5️⃣ Ensemble Learning Techniques

  • Bagging
  • Boosting
  • Stacking
  • Voting Classifier
  • Random Forest
  • Gradient Boosting

6️⃣ Neural Network Based Algorithms

(Deep Learning ML ka hi part hai)

πŸ”Ή Basic

  • Artificial Neural Network (ANN)
  • Multilayer Perceptron (MLP)

πŸ”Ή Advanced

  • Convolutional Neural Network (CNN)
  • Recurrent Neural Network (RNN)
  • LSTM
  • GRU
  • Transformer
  • Graph Neural Network (GNN)

7️⃣ Anomaly / Outlier Detection

  • Isolation Forest
  • One-Class SVM
  • Local Outlier Factor (LOF)
  • Elliptic Envelope
  • Autoencoder based anomaly detection

8️⃣ Probabilistic & Statistical Models

  • Hidden Markov Model (HMM)
  • Conditional Random Field (CRF)
  • Bayesian Networks
  • Markov Chain

9️⃣ Online Learning Algorithms

  • Stochastic Gradient Descent (SGD)
  • Online Perceptron
  • Hoeffding Tree

πŸ“Œ Note:

  • Industry me sabse zyada use hone wale: πŸ‘‰ Linear/Logistic Regression, Random Forest, XGBoost, SVM, Neural Networks
  • Exams / Interviews ke liye: πŸ‘‰ KNN, Naive Bayes, Decision Tree, Clustering, PCA

Agar aap chaho to main:

  • πŸ”Ή sirf Deep Learning algorithms
  • πŸ”Ή exam-oriented short list
  • πŸ”Ή Python ke sath algorithms
  • πŸ”Ή kaunsa algorithm kab use kare

bhi bana sakta hoon πŸ™‚

Prompt:

kaunsa algorithm kab use kare

ye bhi batatao

Response:

ΰ€ ΰ₯€ΰ€• ΰ€Ήΰ₯ˆ πŸ™‚

ΰ€¨ΰ₯€ΰ€šΰ₯‡ β€œkaunsa Machine Learning algorithm kab use kare” ko simple decision guide + examples ke saath samjha raha hoon.


1️⃣ Data ke type ke hisaab se algorithm

πŸ”Ή Numerical data (numbers)

  • Linear Regression β†’ relation linear ho
  • Polynomial Regression β†’ curve type relation
  • Random Forest / XGBoost β†’ complex patterns, best accuracy

πŸ‘‰ Example: House price prediction


πŸ”Ή Categorical data (class / label)

  • Logistic Regression β†’ simple & fast
  • Naive Bayes β†’ text data (spam detection)
  • Decision Tree β†’ explainable model
  • Random Forest / XGBoost β†’ high accuracy

πŸ‘‰ Example: Spam vs Not Spam


2️⃣ Dataset size ke hisaab se

πŸ”Ή Chhota dataset (<10k rows)

  • KNN
  • Naive Bayes
  • SVM
  • Decision Tree

⚠️ Deep Learning avoid karo (overfitting hota hai)


πŸ”Ή Medium dataset (10k – 1M)

  • Random Forest
  • Gradient Boosting
  • XGBoost
  • LightGBM

πŸ”Ή Bahut bada dataset (>1M)

  • SGD
  • Linear / Logistic Regression
  • Neural Networks
  • Online Learning algorithms

3️⃣ Problem type ke hisaab se

πŸ”Ή Regression problem (number predict karna)

Situation Best Algorithm
Linear relation Linear Regression
Outliers present Ridge / Lasso
Non-linear Random Forest
High accuracy chahiye XGBoost

πŸ”Ή Classification problem (class predict)

Condition Algorithm
Binary classification Logistic Regression
Text classification Naive Bayes
High dimension SVM
Interpretability Decision Tree
Best accuracy XGBoost / RF

4️⃣ Unlabelled data (Clustering)

πŸ”Ή Clustering

Data Algorithm
Well-separated clusters K-Means
Noise present DBSCAN
Unknown clusters Hierarchical
Probabilistic GMM

πŸ‘‰ Example: Customer segmentation


5️⃣ Dimensionality zyada ho

  • PCA β†’ features kam karne ke liye
  • t-SNE / UMAP β†’ visualization
  • Autoencoders β†’ deep features

πŸ‘‰ Example: Image / gene data


6️⃣ Time series data

  • ARIMA
  • SARIMA
  • LSTM
  • GRU
  • Prophet

πŸ‘‰ Example: Stock price, weather


7️⃣ Text / NLP problems

  • Naive Bayes
  • Logistic Regression + TF-IDF
  • Word2Vec
  • LSTM
  • Transformers (BERT, GPT)

πŸ‘‰ Example: Sentiment analysis


8️⃣ Image data

  • CNN
  • ResNet
  • EfficientNet
  • Vision Transformers

πŸ‘‰ Example: Face recognition


9️⃣ Real-time / streaming data

  • Online SGD
  • Hoeffding Tree
  • Reinforcement Learning

πŸ‘‰ Example: Recommendation system


πŸ”Ÿ Reinforcement Learning kab use kare?

  • Environment + reward ho
  • Agent decision leta ho

πŸ‘‰ Example: Game AI, Robotics


🧠 Golden Rule (Interview Friendly)

1️⃣ Simple model se start karo

2️⃣ Overfitting check karo

3️⃣ Ensemble try karo

4️⃣ Deep Learning tab hi jab data zyada ho

Top comments (0)