<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Keffas Mutethia Nyamu</title>
    <description>The latest articles on DEV Community by Keffas Mutethia Nyamu (@keffas_mutethia_9628a0a72).</description>
    <link>https://dev.to/keffas_mutethia_9628a0a72</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3223136%2Ff72a8a01-996e-49d9-8efe-4c76ccc52beb.png</url>
      <title>DEV Community: Keffas Mutethia Nyamu</title>
      <link>https://dev.to/keffas_mutethia_9628a0a72</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/keffas_mutethia_9628a0a72"/>
    <language>en</language>
    <item>
      <title>Supervised Learning and the Role of Classification in Machine Learning</title>
      <dc:creator>Keffas Mutethia Nyamu</dc:creator>
      <pubDate>Mon, 25 Aug 2025 18:09:28 +0000</pubDate>
      <link>https://dev.to/keffas_mutethia_9628a0a72/supervised-learning-and-the-role-of-classification-in-machine-learning-49ah</link>
      <guid>https://dev.to/keffas_mutethia_9628a0a72/supervised-learning-and-the-role-of-classification-in-machine-learning-49ah</guid>
      <description>&lt;p&gt;Supervised learning is one of the most powerful approaches in machine learning. At its core, it is about learning from examples. The model is trained on data where both the inputs and the correct outputs (labels) are provided. By studying these patterns, the model learns how to map inputs to outputs. Once trained, it can make predictions on unseen data—a process that powers many of the intelligent systems we use every day.  &lt;/p&gt;




&lt;h2&gt;
  
  
  How Classification Works
&lt;/h2&gt;

&lt;p&gt;Within supervised learning, &lt;strong&gt;classification&lt;/strong&gt; focuses on predicting discrete categories. Unlike regression, which forecasts continuous values, classification assigns inputs to predefined groups.  &lt;/p&gt;

&lt;p&gt;A familiar example is spam detection: every email is classified as either &lt;em&gt;“spam”&lt;/em&gt; or &lt;em&gt;“not spam.”&lt;/em&gt; The algorithm learns from past emails—those labeled as spam or safe—and applies those lessons to new ones. Essentially, classification is about defining boundaries in data space, separating one class from another.  &lt;/p&gt;




&lt;h2&gt;
  
  
  Models Commonly Used in Classification
&lt;/h2&gt;

&lt;p&gt;Over time, many models have been developed for classification, each with unique strengths:  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Logistic Regression&lt;/strong&gt; – A straightforward yet effective method, especially for problems with linear relationships.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Decision Trees&lt;/strong&gt; – Easy to interpret and explain, as they split data into simple decision rules.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Random Forests &amp;amp; Gradient Boosting (XGBoost, LightGBM, CatBoost)&lt;/strong&gt; – Ensemble methods that combine multiple models to achieve strong performance.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Naïve Bayes&lt;/strong&gt; – Particularly useful in text classification tasks such as spam filtering.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;k-Nearest Neighbors (k-NN)&lt;/strong&gt; – A simple technique that classifies based on the closest data points.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Support Vector Machines (SVMs)&lt;/strong&gt; – Effective when classes are clearly separated in high-dimensional space.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Neural Networks&lt;/strong&gt; – Capable of handling complex and unstructured data, such as images and audio.
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The choice of model often depends on the problem, the dataset, and the balance between interpretability and accuracy.  &lt;/p&gt;




&lt;h2&gt;
  
  
  Personal Insights
&lt;/h2&gt;

&lt;p&gt;What stands out most about classification is its wide range of applications. From detecting fraud to predicting customer churn, it enables businesses and organizations to turn raw data into actionable insights.  &lt;/p&gt;

&lt;p&gt;However, I’ve also learned that success in classification rarely comes from the algorithm alone. &lt;strong&gt;Data preparation&lt;/strong&gt;—handling missing values, feature selection, and balancing class distributions—often has a greater impact than the choice of model itself. Good data leads to good results.  &lt;/p&gt;

&lt;p&gt;Another key takeaway is that accuracy isn’t everything. In real-world applications, &lt;strong&gt;interpretability, fairness, and reliability&lt;/strong&gt; matter just as much as performance metrics. For instance, in healthcare, a highly accurate but opaque model may not be as valuable as a slightly less accurate model that clinicians can understand and trust.  &lt;/p&gt;




&lt;h2&gt;
  
  
  Challenges in Classification
&lt;/h2&gt;

&lt;p&gt;Working with classification problems presents a set of challenges that often shape the outcome:  &lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Imbalanced Classes&lt;/strong&gt; – Many datasets are skewed, with one class dominating. Models can easily become biased toward the majority class unless techniques like resampling or adjusted class weights are applied.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Overfitting&lt;/strong&gt; – Complex models can perform well on training data but fail to generalize. Careful validation and regularization are key.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Data Quality&lt;/strong&gt; – Noisy, incomplete, or irrelevant features can reduce model performance significantly. Preprocessing and feature engineering are essential steps.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Evolving Data&lt;/strong&gt; – Patterns change over time. Fraudsters adapt, customers shift behavior, and models need to be retrained to stay effective.
&lt;/li&gt;
&lt;/ol&gt;




&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;Classification is one of the most practical and impactful tasks in supervised learning. It transforms raw data into meaningful categories, enabling informed decisions across industries. While the choice of algorithm is important, the real success often lies in understanding the data, addressing challenges like imbalance and overfitting, and ensuring the solutions are trustworthy and explainable.  &lt;/p&gt;

&lt;p&gt;At its best, classification is not just about predicting labels—it is about &lt;strong&gt;solving real problems and unlocking value from data&lt;/strong&gt;.  &lt;/p&gt;

</description>
    </item>
    <item>
      <title>Premier League 2024–2025: Teams Most Likely to Win — Based on Win Probability</title>
      <dc:creator>Keffas Mutethia Nyamu</dc:creator>
      <pubDate>Thu, 31 Jul 2025 16:23:58 +0000</pubDate>
      <link>https://dev.to/keffas_mutethia_9628a0a72/premier-league-2024-2025-teams-most-likely-to-win-based-on-win-probability-492h</link>
      <guid>https://dev.to/keffas_mutethia_9628a0a72/premier-league-2024-2025-teams-most-likely-to-win-based-on-win-probability-492h</guid>
      <description>&lt;p&gt;As the curtain falls on the 2024–2025 Premier League season, an analysis of team performance based on win probability provides insight into the clubs most likely to dominate in future campaigns. This article uses empirical win rates, calculated from the number of matches won out of 38, to determine each team’s likelihood of winning.&lt;/p&gt;

&lt;h2&gt;
  
  
  How Win Probability Was Calculated
&lt;/h2&gt;

&lt;p&gt;Win probability was derived using the formula:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Win Probability = Wins ÷ Matches Played&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Each Premier League team played &lt;strong&gt;38 matches&lt;/strong&gt; this season. The number of victories per team was divided by 38 to compute a win rate (expressed as a decimal), representing their likelihood of winning a match.&lt;/p&gt;




&lt;h2&gt;
  
  
  Teams with the Highest Win Probabilities
&lt;/h2&gt;

&lt;h3&gt;
  
  
  1. &lt;strong&gt;Liverpool FC – 0.658&lt;/strong&gt;
&lt;/h3&gt;

&lt;p&gt;Liverpool emerged as the season’s most dominant side, winning &lt;strong&gt;25 matches&lt;/strong&gt;. Their high win probability of &lt;strong&gt;65.8%&lt;/strong&gt; highlights consistent performance, likely driven by tactical depth, squad fitness, and solid management.&lt;/p&gt;

&lt;h3&gt;
  
  
  2. &lt;strong&gt;Manchester City FC – 0.553&lt;/strong&gt;
&lt;/h3&gt;

&lt;p&gt;Manchester City followed with a &lt;strong&gt;55.3% win rate&lt;/strong&gt; (&lt;em&gt;21 wins&lt;/em&gt;). While slightly below their usual standard, City remained a formidable side and continue to pose a serious title threat.&lt;/p&gt;

&lt;h3&gt;
  
  
  3. &lt;strong&gt;Arsenal FC, Chelsea FC &amp;amp; Newcastle United FC – 0.526&lt;/strong&gt;
&lt;/h3&gt;

&lt;p&gt;All three clubs secured &lt;strong&gt;20 wins&lt;/strong&gt;, translating to a &lt;strong&gt;52.6% win probability&lt;/strong&gt;.  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;em&gt;Arsenal’s&lt;/em&gt; youthful energy
&lt;/li&gt;
&lt;li&gt;
&lt;em&gt;Chelsea’s&lt;/em&gt; tactical rebuilding
&lt;/li&gt;
&lt;li&gt;
&lt;em&gt;Newcastle’s&lt;/em&gt; resurgence under Eddie Howe
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Each contributed to strong campaigns.&lt;/p&gt;

&lt;h3&gt;
  
  
  4. &lt;strong&gt;Nottingham Forest FC &amp;amp; Aston Villa FC – 0.500&lt;/strong&gt;
&lt;/h3&gt;

&lt;p&gt;With &lt;strong&gt;19 wins&lt;/strong&gt; each, these two clubs hit a &lt;strong&gt;50% win rate&lt;/strong&gt;, suggesting they are now among the league’s upper mid-table performers.&lt;br&gt;&lt;br&gt;
&lt;em&gt;Aston Villa&lt;/em&gt;, in particular, continued their rise with disciplined, attacking football.&lt;/p&gt;




&lt;h2&gt;
  
  
  Mid-Table Contenders
&lt;/h2&gt;

&lt;p&gt;Clubs such as &lt;strong&gt;Brighton&lt;/strong&gt;, &lt;strong&gt;Brentford&lt;/strong&gt;, &lt;strong&gt;Fulham&lt;/strong&gt;, and &lt;strong&gt;Bournemouth&lt;/strong&gt; had win probabilities ranging from &lt;strong&gt;0.395 to 0.421&lt;/strong&gt;, showing competitiveness but room for growth.&lt;/p&gt;




&lt;h2&gt;
  
  
  Underperformers
&lt;/h2&gt;

&lt;p&gt;At the bottom of the win probability table:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Manchester United, West Ham, Everton, Tottenham&lt;/strong&gt; — all posted a &lt;strong&gt;0.289 win rate&lt;/strong&gt; (&lt;em&gt;11 wins&lt;/em&gt;), a surprising dip for clubs with top-six ambitions.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Leicester City (0.158)&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Ipswich Town (0.105)&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Southampton (0.053)&lt;/strong&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These clubs struggled heavily, signaling potential relegation threats or serious structural issues.&lt;/p&gt;




&lt;h2&gt;
  
  
  Final Thoughts
&lt;/h2&gt;

&lt;p&gt;While win probability does not account for &lt;strong&gt;draws&lt;/strong&gt; or &lt;strong&gt;goal difference&lt;/strong&gt;, it offers a clear view of consistent winners across the season.&lt;/p&gt;

&lt;p&gt;Based on current form:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Liverpool&lt;/strong&gt;, &lt;strong&gt;Manchester City&lt;/strong&gt;, and &lt;strong&gt;Arsenal&lt;/strong&gt; remain the most likely to win in future fixtures — barring transfers or managerial changes.&lt;/li&gt;
&lt;li&gt;Clubs like &lt;strong&gt;Nottingham Forest&lt;/strong&gt; and &lt;strong&gt;Aston Villa&lt;/strong&gt; are also worth watching, as their &lt;strong&gt;50% win rate&lt;/strong&gt; suggests growing strength.&lt;/li&gt;
&lt;li&gt;On the flip side, traditional powerhouses such as &lt;strong&gt;Manchester United&lt;/strong&gt; and &lt;strong&gt;Tottenham&lt;/strong&gt; may need &lt;strong&gt;major overhauls&lt;/strong&gt; to return to form.&lt;/li&gt;
&lt;/ul&gt;

</description>
    </item>
    <item>
      <title>Measures of Central Tendency and Their Importance in Data Science</title>
      <dc:creator>Keffas Mutethia Nyamu</dc:creator>
      <pubDate>Tue, 22 Jul 2025 10:46:53 +0000</pubDate>
      <link>https://dev.to/keffas_mutethia_9628a0a72/-measures-of-central-tendency-and-their-importance-in-data-science-2cn1</link>
      <guid>https://dev.to/keffas_mutethia_9628a0a72/-measures-of-central-tendency-and-their-importance-in-data-science-2cn1</guid>
      <description>&lt;p&gt;In any data analysis or statistical endeavour, understanding the behaviour of a dataset is fundamental. One of the primary ways to summarise and interpret data is through &lt;strong&gt;measures of central tendency&lt;/strong&gt;, which identify the central or typical value around which data points cluster. The three main measures of central tendency are &lt;strong&gt;mean&lt;/strong&gt;, &lt;strong&gt;median&lt;/strong&gt;, and &lt;strong&gt;mode&lt;/strong&gt;.&lt;/p&gt;

&lt;h2&gt;
  
  
  1. The Mean
&lt;/h2&gt;

&lt;p&gt;The &lt;strong&gt;mean&lt;/strong&gt;, commonly known as the average, is calculated by summing all values in a dataset and dividing by the number of values. For example, if data represents the ages of participants in a survey, the mean gives the general age around which most participants’ ages are spread.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Advantages&lt;/strong&gt;: Uses all data points, making it highly representative when there are no extreme outliers.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Limitations&lt;/strong&gt;: Sensitive to outliers, which can distort the mean away from the true typical value in skewed distributions.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  2. The Median
&lt;/h2&gt;

&lt;p&gt;The &lt;strong&gt;median&lt;/strong&gt; is the middle value in an ordered dataset. If the dataset has an odd number of observations, it is the exact middle; if even, it is the average of the two central numbers.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Advantages&lt;/strong&gt;: Robust to outliers and skewed data. For instance, in income data where a few very high incomes inflate the mean, the median provides a better measure of typical income.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Limitations&lt;/strong&gt;: Does not use all data points directly, only their order.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  3. The Mode
&lt;/h2&gt;

&lt;p&gt;The &lt;strong&gt;mode&lt;/strong&gt; is the value that occurs most frequently in a dataset. In categorical data, it is particularly useful as the mean or median cannot be computed.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Advantages&lt;/strong&gt;: The only measure that can be used for nominal data and indicates the most common category or value.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Limitations&lt;/strong&gt;: A dataset can be bimodal or multimodal (having more than one mode), and in some cases, no mode exists if all values are unique.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Why Are Measures of Central Tendency Important in Data Science?
&lt;/h2&gt;

&lt;h3&gt;
  
  
  a. Summarising Data
&lt;/h3&gt;

&lt;p&gt;Large datasets can be overwhelming. Measures of central tendency simplify these datasets by providing a single value that summarises the general tendency, making it easier to interpret results and communicate findings to stakeholders.&lt;/p&gt;

&lt;h3&gt;
  
  
  b. Facilitating Comparison
&lt;/h3&gt;

&lt;p&gt;Central tendency measures enable comparison between different groups or datasets. For example, comparing the mean sales of two products quickly informs decision-makers about relative performance.&lt;/p&gt;

&lt;h3&gt;
  
  
  c. Supporting Model Building
&lt;/h3&gt;

&lt;p&gt;Many machine learning algorithms, such as K-Means clustering or Gaussian Naïve Bayes, rely on assumptions involving central tendencies. For instance:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;K-Means&lt;/strong&gt; seeks to minimise distances from cluster means.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Gaussian distributions&lt;/strong&gt; use means and variances to define probability densities.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  d. Identifying Skewness and Outliers
&lt;/h3&gt;

&lt;p&gt;Analysing differences between the mean and median provides insights into data distribution and potential outliers, guiding preprocessing decisions like transformation or outlier treatment before model training.&lt;/p&gt;

&lt;h3&gt;
  
  
  e. Informing Business Decisions
&lt;/h3&gt;

&lt;p&gt;Data science projects often aim at actionable business insights. For example:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Identifying the average time customers spend on a website informs user experience optimisation.&lt;/li&gt;
&lt;li&gt;Knowing the modal product size bought assists inventory decisions.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;Measures of central tendency are fundamental tools in data science for summarising data, supporting modelling, and driving informed decisions. While they provide critical insights into data behaviour, it is essential to use them alongside &lt;strong&gt;measures of dispersion&lt;/strong&gt; and &lt;strong&gt;data visualisation&lt;/strong&gt; to gain a comprehensive understanding of datasets before implementing analytical or predictive solutions.&lt;/p&gt;

</description>
    </item>
    <item>
      <title># Understanding Relationships in Power BI</title>
      <dc:creator>Keffas Mutethia Nyamu</dc:creator>
      <pubDate>Mon, 14 Jul 2025 12:58:34 +0000</pubDate>
      <link>https://dev.to/keffas_mutethia_9628a0a72/-understanding-relationships-in-power-bi-6l2</link>
      <guid>https://dev.to/keffas_mutethia_9628a0a72/-understanding-relationships-in-power-bi-6l2</guid>
      <description>&lt;p&gt;Relationships are a core concept in Power BI, allowing users to connect multiple tables to build meaningful and accurate reports. Without relationships, data analysis would require manually merging tables, leading to inefficiency, duplication, and potential errors.&lt;/p&gt;

&lt;h2&gt;
  
  
  What Are Relationships?
&lt;/h2&gt;

&lt;p&gt;In Power BI, a relationship defines how two tables are connected through common fields. For example, a &lt;strong&gt;Sales table&lt;/strong&gt; containing ProductID can connect to a &lt;strong&gt;Products table&lt;/strong&gt; with product details. This connection enables you to analyse sales by product name, category, or other attributes without combining tables into a single flat file.&lt;/p&gt;

&lt;h2&gt;
  
  
  Types of Relationships
&lt;/h2&gt;

&lt;p&gt;One common type is &lt;strong&gt;One-to-Many (1:*)&lt;/strong&gt;, where one record in a table relates to multiple records in another. For instance, each product in the Products table can appear many times in the Sales table.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Many-to-One (*:1)&lt;/strong&gt; is essentially the reverse view, where multiple records in one table relate back to one record in another.&lt;/p&gt;

&lt;p&gt;Another type is &lt;strong&gt;Many-to-Many (&lt;em&gt;:&lt;/em&gt;)&lt;/strong&gt;, used when both tables can have multiple matching rows. This relationship type helps in complex models but should be used cautiously, as it can create ambiguity in calculations.&lt;/p&gt;

&lt;p&gt;Relationships also have &lt;strong&gt;filter directions&lt;/strong&gt;. A &lt;em&gt;single directional filter&lt;/em&gt; means data filters from one table to another, which is ideal for most scenarios. &lt;em&gt;Bidirectional filters&lt;/em&gt; allow filters to flow both ways between tables. They are useful in specific analysis, such as role-playing dimensions, but can impact performance and produce unexpected results if used incorrectly.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Are Relationships Important?
&lt;/h2&gt;

&lt;p&gt;Relationships allow seamless combination of data from multiple tables. They support advanced DAX calculations that involve related tables and ensure data integrity in visuals and aggregations.&lt;/p&gt;

&lt;p&gt;Using relationships also reduces duplication by enabling you to build a &lt;strong&gt;star schema&lt;/strong&gt;, where dimension tables (e.g. Products, Customers) connect to fact tables (e.g. Sales, Transactions). This improves performance and simplifies data models.&lt;/p&gt;

&lt;h2&gt;
  
  
  Creating Relationships
&lt;/h2&gt;

&lt;p&gt;Power BI can &lt;strong&gt;automatically detect relationships&lt;/strong&gt; based on similar column names and data types. However, it is good practice to review these auto-created relationships for accuracy.&lt;/p&gt;

&lt;p&gt;To create relationships manually:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Go to &lt;strong&gt;Model view&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Drag a field from one table to its matching field in another table.&lt;/li&gt;
&lt;li&gt;Set the &lt;strong&gt;cardinality&lt;/strong&gt; (1:&lt;em&gt;, *:1, or *:&lt;/em&gt;).&lt;/li&gt;
&lt;li&gt;Choose the appropriate &lt;strong&gt;cross-filter direction&lt;/strong&gt; based on your analysis needs.&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  Best Practices
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Always check cardinality and filter direction when creating relationships.&lt;/li&gt;
&lt;li&gt;Avoid using Many-to-Many relationships unless necessary.&lt;/li&gt;
&lt;li&gt;Prefer single directional filters for clarity and performance.&lt;/li&gt;
&lt;li&gt;Design your data model as a star schema wherever possible.&lt;/li&gt;
&lt;li&gt;Ensure fields used for relationships are clean, consistent, and free of duplicates on the ‘one’ side.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Common Pitfalls
&lt;/h2&gt;

&lt;p&gt;Incorrect cardinality can lead to inaccurate totals in visuals. Unintended bidirectional filters may create ambiguity, while missing relationships can cause blank visuals or errors in DAX measures.&lt;/p&gt;

&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;Mastering relationships in Power BI is fundamental to building robust and accurate data models. By understanding types of relationships, how to create them, and applying best practices, you will enhance your data analysis capabilities and create impactful reports that drive strategic decisions.&lt;/p&gt;

</description>
      <category>datascience</category>
      <category>analytics</category>
      <category>developers</category>
    </item>
    <item>
      <title>How Excel is Used in Real-World Data Analysis</title>
      <dc:creator>Keffas Mutethia Nyamu</dc:creator>
      <pubDate>Wed, 11 Jun 2025 09:20:11 +0000</pubDate>
      <link>https://dev.to/keffas_mutethia_9628a0a72/how-excel-is-used-in-real-world-data-analysis-55he</link>
      <guid>https://dev.to/keffas_mutethia_9628a0a72/how-excel-is-used-in-real-world-data-analysis-55he</guid>
      <description>&lt;p&gt;One of the most widely used tools in data analysis is Microsoft Excel. It does not matter whether you are a small business owner or work on a global enterprise, Excel offers you impressive features that enable users to organize, interpret, and act on data. The in-depth look at Excel this week demonstrated its real-world applicability regarding informed, data-driven decisions. Data Analysis Real-world applications of Excel are much more than a mere spreadsheet program. &lt;br&gt;
&lt;strong&gt;Three essential applications of Excel in the real-world data analysis are as follows:&lt;/strong&gt;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt; Business Decision-Making 
Organizations use Excel to monitor key performance metrics, make sales projections, and document business processes. The use of dashboards and charts created in Excel ensures that the leaders are able to understand the trends and make strategic decisions within a short period. &lt;/li&gt;
&lt;li&gt; Financial Reporting 
The world of finance cannot run without Excel in budgeting, financial modeling, and creating reports. It enables analysts to cope with a huge amount of data, do calculations, and be accurate in tracking profits and losses. &lt;/li&gt;
&lt;li&gt; Marketing Performance Analysis 
Excel helps marketing teams to track the campaign rates, customer activity, and return on investment (ROI). Equipped with organized data and interactive graphics, marketers have the chance to evaluate and optimize their tactics.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;The Features and Formulas of Excel I have Learned so far&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The topic of this week, concentrated on Excel, presented some important tools and formulas that complement data analysis in the real world: &lt;/p&gt;

&lt;p&gt;• SUM () Function: It is a basic function that helps to add up row or column totals- it is best at adding up sales numbers, time costs, or poll results. &lt;br&gt;
• IF () Statement: Allows logical decision-making on a dataset. As an example, it may be applied to mark customers as inactive or active depending on the purchase history. &lt;br&gt;
• Pivot Tables: An efficient summary of a huge amount of data. They enable users to see totals, averages, or counts broken down by any variables of interest such as region, month, or product line.&lt;br&gt;
• VLOOKUP () and HLOOKUP (): The two look-up functions make it easy to retrieve data in big tables. VLOOKUP and HLOOKUP are important functions that search vertically and horizontally respectively, necessary in finding the matching data between spreadsheets.&lt;br&gt;
 • INDEX () and MATCH (): This is a powerful combination that can extend the look-up. INDEX retrieves the value of a cell at a given position and MATCH determines that the position given some criteria is more flexible and powerful than VLOOKUP in most cases. &lt;br&gt;
In conclusion, learning about Excel through and through has changed my perception of data. Learning Excel at a more advanced level has been a revelation as what appeared to be mere numbers. It stops being the instrument of number storage only- it becomes the medium of problem-solving, thinking, and telling stories with data. This capability to find the meaning in a set of data and to transform it into intelligent and practical implications has changed my way of thinking concerning the purpose of data in decision-making. Excel has also helped me realize that some of the most basic tools in the right hands can become a powerful force. I have shifted my attitude toward data as a mere piece of information but as a strategic asset that could be used to shape the result and encourage innovation.&lt;/p&gt;

</description>
      <category>datascience</category>
      <category>dataanalytics</category>
      <category>beginners</category>
    </item>
  </channel>
</rss>
