<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Eugene Angwenyi</title>
    <description>The latest articles on DEV Community by Eugene Angwenyi (@angwelo).</description>
    <link>https://dev.to/angwelo</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3257482%2Fc679142d-378b-4276-8347-d591ffd2af31.png</url>
      <title>DEV Community: Eugene Angwenyi</title>
      <link>https://dev.to/angwelo</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/angwelo"/>
    <language>en</language>
    <item>
      <title>SUPERVISED LEARNING: CLASSIFICATION</title>
      <dc:creator>Eugene Angwenyi</dc:creator>
      <pubDate>Mon, 25 Aug 2025 18:04:24 +0000</pubDate>
      <link>https://dev.to/angwelo/supervised-learning-classification-2n0e</link>
      <guid>https://dev.to/angwelo/supervised-learning-classification-2n0e</guid>
      <description>&lt;p&gt;Classification is a supervised machine learning technique used to categorize or assign data points into predefined classes based on their features or attributes. The goal of classification is to predict categories or classes.&lt;br&gt;
The process of classification is as follows:&lt;br&gt;
Data Preparation: The first step includes cleaning the data, handling missing values, and transforming it into a format the model can understand. This can also involve feature engineering, which is the process of creating new features from existing ones to improve model performance.&lt;/p&gt;

&lt;p&gt;Training the Model: The labeled data is split into a training set and a testing set. The model learns patterns from the training data that can be generalized to unseen data.&lt;/p&gt;

&lt;p&gt;Making Predictions: Once the model is trained, it can be used to predict the class of new, unseen data.&lt;/p&gt;

&lt;p&gt;Evaluation: The model's performance is then evaluated using the testing set. Common metrics include accuracy, precision, recall, and the F1 score. Advanced evaluations may also use confusion matrices, ROC curves, and AUC scores to get a deeper understanding of performance.&lt;/p&gt;

&lt;p&gt;Classification problems in machine learning fall into different categories depending on how many classes there are and how labels are assigned. They are categorized as follows:&lt;br&gt;
Binary Classification: These are problems where there are only two possible outcomes. These outcomes include: Yes/No, Positive/Negative, 0/1, True/False.&lt;/p&gt;

&lt;p&gt;Multi-Class Classification: This is a problem where there are two or more classes and each sample belongs to exactly one class. The output labels include a single label chosen from many. The outcomes include: (0,1,2,…k).&lt;/p&gt;

&lt;p&gt;Multi-Label Classification: This is a problem where each sample can be assigned more than one label at the same time. The output labels are a vector of binary values where each element indicates whether the label applies or not.&lt;/p&gt;

&lt;p&gt;In order for classification to work, various algorithmic models are used. These models include:&lt;br&gt;
Logistic Regression&lt;/p&gt;

&lt;p&gt;K-Nearest Neighbours (KNN)&lt;/p&gt;

&lt;p&gt;Decision Trees&lt;/p&gt;

&lt;p&gt;Random Forests&lt;/p&gt;

&lt;p&gt;Support Vector Machines (SVM)&lt;/p&gt;

&lt;p&gt;Naïve Bayes&lt;/p&gt;

&lt;p&gt;Gradient Boosting methods such as XGBoost and LightGBM&lt;/p&gt;

&lt;p&gt;Neural Networks for deep learning-based classification tasks&lt;/p&gt;

&lt;p&gt;The most common use cases for classification are as follows:&lt;br&gt;
Healthcare: It can be used in medical imaging (classifying scans as normal or abnormal), disease diagnosis (predicting conditions like diabetes or cancer), and risk stratification (categorizing patients into low, medium, or high risk).&lt;/p&gt;

&lt;p&gt;Finance and Business: Applications include fraud detection (classifying transactions as genuine or fraudulent), credit scoring (low, medium, high risk), and customer churn prediction (whether a customer will leave or stay).&lt;/p&gt;

&lt;p&gt;Marketing and E-Commerce: Classification is applied in customer segmentation, product recommendations, and sentiment analysis of customer reviews.&lt;/p&gt;

&lt;p&gt;Cyber Security: Use cases include spam email detection, intrusion detection, and phishing website detection.&lt;/p&gt;

&lt;p&gt;Natural Language Processing (NLP): It is widely used for text classification (news categorization), language identification, and intent recognition in chatbots.&lt;/p&gt;

&lt;p&gt;Computer Vision: Tasks include face recognition, object detection, and medical imaging-based diagnosis.&lt;/p&gt;

&lt;p&gt;Social Media and Communication: Filtering inappropriate content, classifying posts or comments by topic, and detecting fake news.&lt;/p&gt;

&lt;p&gt;Autonomous Systems: Used in self-driving cars for classifying road signs, pedestrians, and obstacles.&lt;/p&gt;

&lt;p&gt;Conclusion&lt;br&gt;
Classification, as a supervised learning technique, plays a central role in solving real-world problems across multiple industries. By leveraging structured data and algorithmic models, it enables systems to make informed predictions and decisions. From diagnosing diseases to detecting fraudulent transactions, and from filtering spam emails to recognizing faces, classification remains one of the most impactful applications of machine learning. As data continues to grow and models become more advanced, the scope and accuracy of classification will only expand, making it a vital tool in the present and future of artificial intelligence.&lt;/p&gt;

</description>
    </item>
    <item>
      <title>EXCEL USE IN REAL WORLD DATA ANALYSIS</title>
      <dc:creator>Eugene Angwenyi</dc:creator>
      <pubDate>Wed, 11 Jun 2025 07:55:59 +0000</pubDate>
      <link>https://dev.to/angwelo/excel-use-in-real-world-data-analysis-6lg</link>
      <guid>https://dev.to/angwelo/excel-use-in-real-world-data-analysis-6lg</guid>
      <description>&lt;p&gt;Microsoft Excel is one of the most powerful and widely used tools for data analysis across industries. Whether you're analyzing business performance, preparing financial reports, or managing project data, Excel provides an interface and robust functionality that makes it an essential skill for data professionals.&lt;br&gt;
One of the most common uses of Excel in data analysis is data cleaning and organization. Before any meaningful insights can be derived, raw data often needs to be cleaned—this includes removing duplicates, correcting errors, and filling in missing values. Excel's built-in functions such as Find &amp;amp; Replace, Text to Columns, and Data Validation help make this process more efficient.&lt;br&gt;
Another major application is data visualization. Excel allows users to create a wide range of charts—bar graphs, line charts, scatter plots, and pie charts—that help visualize trends, patterns, and outliers in the data. With tools like conditional formatting and PivotCharts, analysts can present complex data in a simple and understandable format.&lt;br&gt;
During my first week, I’ve gained hands-on experience with several key Excel tools:&lt;br&gt;
SUM(): This function adds a range of numbers. It's essential for tasks like calculating total revenue or expenses.&lt;/p&gt;

&lt;p&gt;IF(): A logical function that returns different values based on a condition. For instance, I used it to classify sales as “High” or “Low” based on a threshold.&lt;/p&gt;

&lt;p&gt;Pivot Tables: A powerful feature for summarizing large datasets. I used pivot tables to group data by category and calculate average values quickly.&lt;br&gt;
Learning Excel has shifted how I see and work with data. What once seemed like rows of meaningless numbers now appears as an organized system full of insights. I’m beginning to understand how data can be transformed into decisions. The ability to manipulate and visualize data directly in Excel has empowered me to explore patterns and trends with confidence. It’s exciting to realize that even at this early stage, I can already create value through data.&lt;/p&gt;

</description>
    </item>
  </channel>
</rss>
