<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Ryan Murunga</title>
    <description>The latest articles on DEV Community by Ryan Murunga (@ryan_murunga_10c024263652).</description>
    <link>https://dev.to/ryan_murunga_10c024263652</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3093631%2Fb6424b30-93d9-4edd-bbea-fabf7c85f502.jpg</url>
      <title>DEV Community: Ryan Murunga</title>
      <link>https://dev.to/ryan_murunga_10c024263652</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/ryan_murunga_10c024263652"/>
    <language>en</language>
    <item>
      <title>Data Modelling in Power Bi</title>
      <dc:creator>Ryan Murunga</dc:creator>
      <pubDate>Mon, 29 Jun 2026 10:52:50 +0000</pubDate>
      <link>https://dev.to/ryan_murunga_10c024263652/data-modelling-in-power-bi-2lk2</link>
      <guid>https://dev.to/ryan_murunga_10c024263652/data-modelling-in-power-bi-2lk2</guid>
      <description>&lt;p&gt;&lt;strong&gt;INTRODUCTION&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Power Bi is a comprehensive business analytic tool developed by microsoft.&lt;/p&gt;

&lt;p&gt;Power Bi is used for:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Connect data .&lt;/li&gt;
&lt;li&gt;Clean and transform data.&lt;/li&gt;
&lt;li&gt;Create relationships between tables &lt;/li&gt;
&lt;li&gt;Analyze data &lt;/li&gt;
&lt;li&gt;Build Reports &lt;/li&gt;
&lt;li&gt;Create dashboards and share insights &lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Some of the importance of Power Bi is;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Cleaning messy data &lt;/li&gt;
&lt;li&gt;Analyze large amounts of data &lt;/li&gt;
&lt;li&gt;Create Interactive dashboards&lt;/li&gt;
&lt;li&gt;Identify trends &amp;amp; patterns&lt;/li&gt;
&lt;li&gt;Make data driven decisions&lt;/li&gt;
&lt;li&gt;Present insights visually&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Core components of power bi include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Power BI Desktop - Is a free Windows application used to connect to data sources, clean and transform information, model relationships, and build reports.&lt;/li&gt;
&lt;li&gt;Power BI Service - Is a cloud-based platform accessible via web browsers. &lt;/li&gt;
&lt;li&gt;Power BI Mobile - Is a Native iOS and Android apps that allow you to monitor business metrics and interact with reports on the go.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;DAX FUNCTIONS&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;DAX&lt;/strong&gt; - Data Analysis Expression (Is a formula language used in power Bi to create calculations and analyze logics.)&lt;/p&gt;

&lt;p&gt;DAX is used to create ;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Calculated columns &lt;/li&gt;
&lt;li&gt;Calculated tables&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;DAX functions allow us to create calculations that are not direclty available in the raw dataset.&lt;/p&gt;

&lt;p&gt;Here are some of few common DAX function;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Sum
Is a measure used to calculate a total value.
&lt;/li&gt;
&lt;/ol&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="n"&gt;Total&lt;/span&gt; &lt;span class="n"&gt;Sales&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;SUM&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;Sales&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="n"&gt;SalesAmount&lt;/span&gt;&lt;span class="p"&gt;])&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;ol&gt;
&lt;li&gt;Average
Is a measure used to calculate the average of values.
&lt;/li&gt;
&lt;/ol&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="n"&gt;Average&lt;/span&gt; &lt;span class="n"&gt;Sales&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;AVERAGE&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;Sales&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="n"&gt;Amount&lt;/span&gt;&lt;span class="p"&gt;])&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;ol&gt;
&lt;li&gt;Count 
Is used to count the number of rows in a column that contain numbers.
&lt;/li&gt;
&lt;/ol&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="n"&gt;Total&lt;/span&gt; &lt;span class="n"&gt;Orders&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;COUNT&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;Sales&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="n"&gt;OrderID&lt;/span&gt;&lt;span class="p"&gt;])&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;ol&gt;
&lt;li&gt;IF
Is used to create conditions. It checks if a condition is true or false and returns a specific result based on that.
&lt;/li&gt;
&lt;/ol&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Sales Category = IF(Sales[Amount] &amp;gt; 10000, “High”, “Low”)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;ol&gt;
&lt;li&gt;RANKX
It assigns ranks to values based on a specific measure.
&lt;/li&gt;
&lt;/ol&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Sales Rank = RANKX(ALL(Products), SUM(Sales[Amount]))
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;ol&gt;
&lt;li&gt;ISBLANK
It checks if a value is blank.
&lt;/li&gt;
&lt;/ol&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Check Sales = IF(ISBLANK(Sales[Amount]), “No Data”, Sales[Amount])
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;ol&gt;
&lt;li&gt;NOW
It returns the current date and time.
&lt;/li&gt;
&lt;/ol&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Current DateTime = NOW()
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;After a short overview of PowerBi lets now understand what is Data Modelling.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Data Modelling&lt;/strong&gt; -  is the process of structuring and defining relationships between your tables to enable fast, accurate, and scalable data analysis.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Relationships&lt;/strong&gt; - link data across different tables so your visuals can aggregate, filter, and slice data correctly.&lt;/p&gt;

&lt;p&gt;There are 3 main relationships &lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;One to Many - A database where a single record in a one table is linked to a table with multiple records&lt;br&gt;
e.g&lt;br&gt;
One patient can have many hospital visits&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;One to One - A database where a single records corresponds to exactly one unique in another record.&lt;br&gt;
e.g&lt;br&gt;
One patient can have one patient profile&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Many to Many - A database where multiple records in one entity are associated with multiple records in another entity.&lt;br&gt;
e.g&lt;br&gt;
One patient can have many diagnosis&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;SCHEMAS&lt;/strong&gt;&lt;br&gt;
This refers to how data is organized in a database.&lt;/p&gt;

&lt;p&gt;The main types of schema are ;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Flat Table&lt;/li&gt;
&lt;li&gt;Star Schema&lt;/li&gt;
&lt;li&gt;Snowflake Schema &lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;1. Flat table&lt;/strong&gt;&lt;br&gt;
Is a single large table that contain all the information in one place &lt;/p&gt;

&lt;p&gt;Characteristics&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Has many columns&lt;/li&gt;
&lt;li&gt;Combines both ; Descriptive columns, Numeric Columns, ID columns,Repeated values &lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Example of a flat table &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F9ffrazszjykbhroebhkq.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F9ffrazszjykbhroebhkq.png" alt=" " width="799" height="344"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Advantages&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Simple for small datasets&lt;/li&gt;
&lt;li&gt;Easy to load in PowerBI&lt;/li&gt;
&lt;li&gt;Easy to read &lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Disdvantages&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;May slow down reports&lt;/li&gt;
&lt;li&gt;Can create data quality problems 
3.Contain reported data &lt;/li&gt;
&lt;li&gt;Can become very large datasets&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;2. Star Schema&lt;/strong&gt;&lt;br&gt;
Is a data model where a central fact table connects directly to multiple dimension table.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Fact-table&lt;/em&gt; Sits in the middle &lt;br&gt;
&lt;em&gt;Dimension&lt;/em&gt; Tables surround it&lt;/p&gt;

&lt;p&gt;Example of a star schema&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fg34xk0oo9gymfvhfmbmi.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fg34xk0oo9gymfvhfmbmi.png" alt=" " width="798" height="518"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Fact table&lt;br&gt;
Stores events , transaction or activities that happened &lt;br&gt;
The fact table contains:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Keys that connnect to dimension tables&lt;/li&gt;
&lt;li&gt;Numeric Values that can be calculated&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Dimension Table&lt;br&gt;
Stores descriptive information&lt;br&gt;
Helps us filter , group and describe the fact table&lt;/p&gt;

&lt;p&gt;Advantages of Star schema &lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Reduces repeated data &lt;/li&gt;
&lt;li&gt;Improves report performance &lt;/li&gt;
&lt;li&gt;Easier to understand&lt;/li&gt;
&lt;li&gt;Makes DAX  measures easier&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Star schema is recommended because &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Dimension tables contain descreption&lt;/li&gt;
&lt;li&gt;Filter flow from dimensions to facts&lt;/li&gt;
&lt;li&gt;Fact tables contains numbers and keys&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;3. SNOWFLAKE SCHEMA&lt;/strong&gt;&lt;br&gt;
This is when dimension tables are normalized and split into multiple, related sub-dimension tables.&lt;br&gt;
Dimension tables are split into smaller related tables&lt;/p&gt;

&lt;p&gt;Example of a snowflake schema&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F4qk1xoshrgfost9jxaao.webp" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F4qk1xoshrgfost9jxaao.webp" alt=" " width="800" height="400"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Snowflake schema is useful when;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Creating a more normalized model&lt;/li&gt;
&lt;li&gt;Reduce repeated data&lt;/li&gt;
&lt;li&gt;Have very large dimension tables &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Differences between Star Schema and Snowflake schema&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;AREA&lt;/th&gt;
&lt;th&gt;STAR SCHEMA&lt;/th&gt;
&lt;th&gt;SNOWFLAKE SCHEMA&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Performance in Power BI&lt;/td&gt;
&lt;td&gt;Usually Faster&lt;/td&gt;
&lt;td&gt;Can be slower if too many relationships&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Simplicity&lt;/td&gt;
&lt;td&gt;Easier for beginners&lt;/td&gt;
&lt;td&gt;More Complex&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Structure&lt;/td&gt;
&lt;td&gt;Fact table connected directly to dimensions&lt;/td&gt;
&lt;td&gt;Dimensions are split into smaller dimensions&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Storage&lt;/td&gt;
&lt;td&gt;More repeated dimension data&lt;/td&gt;
&lt;td&gt;Less data&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Best for&lt;/td&gt;
&lt;td&gt;Reporting and dashboards&lt;/td&gt;
&lt;td&gt;More normalized database style model&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

</description>
      <category>beginners</category>
      <category>database</category>
      <category>datascience</category>
      <category>analytics</category>
    </item>
    <item>
      <title>How I Built a 2026 World Cup Predictor Dashboard</title>
      <dc:creator>Ryan Murunga</dc:creator>
      <pubDate>Fri, 19 Jun 2026 05:59:46 +0000</pubDate>
      <link>https://dev.to/ryan_murunga_10c024263652/how-i-built-a-2026-world-cup-predictor-dashboard-joe</link>
      <guid>https://dev.to/ryan_murunga_10c024263652/how-i-built-a-2026-world-cup-predictor-dashboard-joe</guid>
      <description>&lt;p&gt;Let’s be honest: there is nothing quite like the feeling of moving a machine learning model off a messy local Jupyter Notebook and seeing it live, breathing, and clickable on the internet. &lt;/p&gt;

&lt;p&gt;A few weeks ago, I decided to tackle a fun challenge: predicting the knockout stages of the upcoming 2026 FIFA World Cup. Instead of just looking at standard text readouts on my terminal, I wanted to build a complete, interactive visual experience that anyone could play with. &lt;/p&gt;

&lt;p&gt;Here is the story of how I built the data pipeline, why I chose my model, and the cheeky trick I used to keep the app online 24/7 for free.&lt;/p&gt;




&lt;h2&gt;
  
  
  🏗️ Breaking Down the Pipeline
&lt;/h2&gt;

&lt;p&gt;I wanted to keep this workspace modular and clean. Instead of one massive script that does everything, I split the workflow across four dedicated stages:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;The Ingestion (&lt;code&gt;01_data_alignment.ipynb&lt;/code&gt;):&lt;/strong&gt; Cleaning up raw data and aligning historical match histories.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The Feature Store (&lt;code&gt;02_historical_feature_store.ipynb&lt;/code&gt;):&lt;/strong&gt; This is where the magic happens. I engineered metrics to track team form, "attacking velocity," and defensive stability over time.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The Baseline (&lt;code&gt;03_local_validation_and_baseline.ipynb&lt;/code&gt;):&lt;/strong&gt; Setting up cross-validation loops to ensure the model wasn't just guessing blindly.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The Interface (&lt;code&gt;app.py&lt;/code&gt;):&lt;/strong&gt; The final production script handling the simulation logic and rendering the front-end layout.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  🤖 The Engine: Why LightGBM?
&lt;/h2&gt;

&lt;p&gt;For the predictive heavy lifting, I went with LightGBM. &lt;/p&gt;

&lt;p&gt;International football data is highly tabular but filled with tricky, non-linear relationships (e.g., how a team's current attacking form clashes with an opponent's specific defensive structure). LightGBM handles these interactions beautifully. Plus, it trains incredibly fast and allows me to output precise match probability distributions rather than just a boring binary "Win/Loss" result.&lt;/p&gt;




&lt;h2&gt;
  
  
  🎨 Fighting with Streamlit and CSS
&lt;/h2&gt;

&lt;p&gt;I wanted this app to be intuitive for regular football fans, not just techies. &lt;/p&gt;

&lt;p&gt;I built a sidebar playground where you can match any two qualified nations head-to-head and tweak "volatility sliders" to simulate massive underdog upsets. &lt;/p&gt;

&lt;p&gt;But the hardest part? The bracket. Streamlit is amazing, but drawing a fully responsive, clean 16-team tournament tree using standard components is tough. To fix it, I injected raw custom CSS blocks directly into the layout. Now, it draws crisp vector grid lines mapping progression straight from the Round of 16 down to a gold-bordered podium crowning the grand champion.&lt;/p&gt;




&lt;h2&gt;
  
  
  😴 The "Never Sleep" Secret
&lt;/h2&gt;

&lt;p&gt;I deployed the app for free using Streamlit Community Cloud. It’s an incredible service, but there is a catch: if your app doesn't get traffic for 12 hours, the server puts it to sleep. The next visitor gets a slow "This app is hibernating" loading screen. &lt;/p&gt;

&lt;p&gt;To bypass this for my portfolio, I set up a simple GitHub Actions workflow (&lt;code&gt;.github/workflows/keep_alive.yml&lt;/code&gt;). Every 10 hours, it automatically pushes an invisible, empty commit to the repository. This tricks the cloud server into seeing active development, resetting the hibernation timer to zero. Result? The app stays awake 24/7!&lt;/p&gt;




&lt;h2&gt;
  
  
  🎮 Take it for a Spin!
&lt;/h2&gt;

&lt;p&gt;The entire project is public, and I’d love for you to play around with it. &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;🌍 &lt;strong&gt;Live App:&lt;/strong&gt; &lt;a href="https://airflow-gsheets-app-pipelin-cwrzh6sdkghbtth4ym7zwv.streamlit.app/" rel="noopener noreferrer"&gt;Test Your 2026 World Cup Predictions&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;📂 &lt;strong&gt;GitHub Source:&lt;/strong&gt; &lt;a href="https://github.com/MouroshK/World_cup_2026" rel="noopener noreferrer"&gt;MouroshK/World_cup_2026&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  🏆 The Ultimate Prediction
&lt;/h2&gt;

&lt;p&gt;To give you a sneak peek of the dashboard in action, here is the full knockout bracket generated by the pipeline's baseline run:&lt;/p&gt;

&lt;p&gt;2026 World Cup Predictive Bracket Simulation&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fo6h7tlsynqy9tsnav07l.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fo6h7tlsynqy9tsnav07l.png" alt=" " width="800" height="352"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;As you can see, the LightGBM engine forecasts an absolute heavyweight clash for the final, ultimately predicting &lt;strong&gt;France&lt;/strong&gt; to edge out &lt;strong&gt;Argentina&lt;/strong&gt; to lift the trophy, while nations like &lt;strong&gt;England&lt;/strong&gt; and &lt;strong&gt;Brazil&lt;/strong&gt; fall just short in the semifinals.&lt;/p&gt;

</description>
      <category>python</category>
      <category>machinelearning</category>
      <category>datascience</category>
      <category>ai</category>
    </item>
    <item>
      <title>How Excel is used in Real World Data Analysis</title>
      <dc:creator>Ryan Murunga</dc:creator>
      <pubDate>Fri, 05 Jun 2026 08:43:32 +0000</pubDate>
      <link>https://dev.to/ryan_murunga_10c024263652/how-excel-is-used-in-real-world-data-analysis-3n4d</link>
      <guid>https://dev.to/ryan_murunga_10c024263652/how-excel-is-used-in-real-world-data-analysis-3n4d</guid>
      <description>&lt;h3&gt;
  
  
  EXCEL
&lt;/h3&gt;

&lt;p&gt;This is a spreadsheet software developed by Microsoft used to insert , read and manipulate data.&lt;/p&gt;

&lt;p&gt;This is an example of how an excel page looks like. It is called a workbook&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwwbxva8ehfw7hmgehi0g.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwwbxva8ehfw7hmgehi0g.png" alt=" " width="800" height="427"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Basics of excel
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fucg29ftgr4eosadgntub.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fucg29ftgr4eosadgntub.png" alt=" " width="316" height="160"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Formula bar&lt;/strong&gt; - Area above the grid where the contents or formula of the selected cell appear.&lt;br&gt;
&lt;strong&gt;Column header&lt;/strong&gt; - Is the label at the very top of a column in a spreadsheet, database, or web table.&lt;br&gt;
&lt;strong&gt;Row header&lt;/strong&gt; - Is the first column in a data table that identifies the specific information, category, or ID for the corresponding data to its right.&lt;br&gt;
&lt;strong&gt;Selected cell/Active Cell&lt;/strong&gt; -  Is the currently selected cell in a spreadsheet where you can directly enter data or formulas.&lt;br&gt;
&lt;strong&gt;Sheet tab&lt;/strong&gt; - Tabs at the bottom (Sheet1, Sheet2) that switch between spreadsheets within a workbook.&lt;br&gt;
&lt;strong&gt;Cell&lt;/strong&gt; - A single box where a row and a column intersect eg D9.&lt;br&gt;
&lt;strong&gt;Row&lt;/strong&gt; - Labelled 1,2,3... down the left side horizontally .&lt;br&gt;
&lt;strong&gt;Columns&lt;/strong&gt; - Labelled A,B,C... across the top vertically.&lt;/p&gt;
&lt;h3&gt;
  
  
  Ways in which Excel is used in real world data analytics
&lt;/h3&gt;
&lt;h4&gt;
  
  
  1. Financial reporting
&lt;/h4&gt;

&lt;p&gt;Most business uses excel to create financial reports, forecast revenue and analyze profitability. Excel can also help track expenses , manage cash flow and optimize financial decision making.&lt;/p&gt;

&lt;p&gt;This is an example of a profit and loss statement&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fydaymbckch92ztn3n5md.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fydaymbckch92ztn3n5md.png" alt=" " width="800" height="906"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h4&gt;
  
  
  2. Healthcare &amp;amp; Research
&lt;/h4&gt;

&lt;p&gt;Patients records, Lab results , clinical results data are stored in excel . Lab results can be used to predict future outbreaks of diseases&lt;/p&gt;

&lt;p&gt;This is an example of a medical research KPI dashboard in excell&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fiqirjv7oveofjr0uwfvv.webp" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fiqirjv7oveofjr0uwfvv.webp" alt=" " width="799" height="451"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h4&gt;
  
  
  3. Marketing and Sales
&lt;/h4&gt;

&lt;p&gt;Excel is used in sales tracking, inventory control,revenue targets, conversion rates and supply chain management. Businesses can use formulas to calculate stock levels, reorder points, and sales growth trends. It helps companies reduce stockouts, prevent overstocking, and improve operational efficiency.&lt;/p&gt;

&lt;p&gt;This is an example of an excel sales forecasting &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzcdath67dwkmjqwmwy1a.webp" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzcdath67dwkmjqwmwy1a.webp" alt=" " width="799" height="594"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h4&gt;
  
  
  4. Business forecasting &amp;amp; Decision making
&lt;/h4&gt;

&lt;p&gt;This is predicting what will happen in the future to be able to make decision based on the prediction.By using historical data and forecasting models, businesses can predict market trends,plan for future growth, assess risk and make data-driven decisions.&lt;/p&gt;

&lt;p&gt;This is an example of a forecast sheet&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4nfloodfc6xqwv8urxsb.webp" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4nfloodfc6xqwv8urxsb.webp" alt=" " width="800" height="404"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h4&gt;
  
  
  5. Automation &amp;amp; Productivity
&lt;/h4&gt;

&lt;p&gt;Excel automation features like templates and predefined functions help businesses save time and reduce manual errors. It allows companies to automate repetitive tasks such as data entry, invoicing, and payroll processing, increasing overall productivity.&lt;/p&gt;

&lt;p&gt;This  is an example of an automated Periods Table.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fj4pa9bjn5hoo46haw5kn.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fj4pa9bjn5hoo46haw5kn.png" alt=" " width="800" height="430"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h3&gt;
  
  
  Formulas in Excel
&lt;/h3&gt;

&lt;p&gt;In excel they are very many function that makes data analysis simple . Here a few of the main functions one should know.&lt;/p&gt;
&lt;h4&gt;
  
  
  1. Sum
&lt;/h4&gt;

&lt;p&gt;Adds all the values in the said range &lt;br&gt;
E.g in the below it add all values from B2,B3,...,B8&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=SUM(B2:B8) 
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  2. Average
&lt;/h4&gt;

&lt;p&gt;Finds the average of values in the said range&lt;br&gt;
E.g In the below cell it finds the average of all values between cell  D2,D3,...,D22&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=AVERAGE(D2:D22)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  3. Max
&lt;/h4&gt;

&lt;p&gt;Finds the max number in said range of values&lt;br&gt;
E.g In the below cell it finds the maximum number in all cells from F3,F4,...,F10&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=MAX(F3:F10)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  4. Count
&lt;/h4&gt;

&lt;p&gt;Count cells with numbers&lt;br&gt;
E.g It returns the number of cells with  numbers&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=COUNT(H2:H10)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  5. Filter
&lt;/h4&gt;

&lt;p&gt;Extracts data from a table based on criteria you specify.&lt;br&gt;
E.g Checks the data and instantly spill out a list of only the rows where sales (Column C) are greater than 5000.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=FILTER(A2:C100, C2:C100&amp;gt;5000, "No Sales")
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  6. Sort
&lt;/h4&gt;

&lt;p&gt;Sorts the contents of a range or array automatically.&lt;br&gt;
E.g Sorts the data range based on the 2nd column, in descending order (highest to lowest).&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=SORT(A2:B100, 2, -1)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  7. IF
&lt;/h4&gt;

&lt;p&gt;It checks if a condition is true or false and gives you a different result depending on the outcome.&lt;br&gt;
E.g If a student's grade in cell B2 is 70 or higher, Excel prints "Pass". If it's 69 or lower, it prints "Fail".&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=IF(B2&amp;gt;=70, "Pass", "Fail")
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  8. Median
&lt;/h4&gt;

&lt;p&gt;Returns the exact middle number in a given set of numbers.&lt;br&gt;
E.g Finds the middle value of all numbers in that range.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=MEDIAN(C2:C50)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  9. Mode
&lt;/h4&gt;

&lt;p&gt;Returns the most frequently occurring or repetitive value in a dataset.&lt;br&gt;
E.g It shows you which number is most frequently.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=MODE(D2:D100)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  10. Proper
&lt;/h4&gt;

&lt;p&gt;It capitalizes the first letter of every word in a text string and converts all other letters to lowercase.&lt;br&gt;
E.g If cell A2 contains the messy text: "jOHN mCcLAIN" the formula will cleanly output: "John Mcclain"&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=PROPER(A2)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  11.COUNTIF
&lt;/h4&gt;

&lt;p&gt;Returns the value in a given a range given one condition&lt;br&gt;
E.g How many total deals did the North region close?&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=COUNTIF(B2:B100, "North")
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  12. COUNTIFS
&lt;/h4&gt;

&lt;p&gt;Returns the value in a given range with more than one condition&lt;br&gt;
E.g Returns the number of deals in the North region are still 'Pending'&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=COUNTIFS(B2:B100, "North", D2:D100, "Pending")
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  13.SUMIF
&lt;/h4&gt;

&lt;p&gt;Adds the values in a range that meet a single, specified condition&lt;br&gt;
E.g Adds the total revenue brought in by the sales rep 'Sarah'?&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=SUMIF(A2:A100, "Sarah", C2:C100)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  14.SUMIFS
&lt;/h4&gt;

&lt;p&gt;Adds all numbers in a range that meet multiple conditions&lt;br&gt;
E.g Adds total revenue brought in by 'Sarah' for deals larger than $10,000?&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=SUMIFS(C2:C100, A2:A100, "Sarah", C2:C100, "&amp;gt;10000")
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  15. AVERAGEIF
&lt;/h4&gt;

&lt;p&gt;Calculates the average of numbers in a range that meet a specific condition&lt;br&gt;
E.g Calculate the average deal size in the South region?&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=AVERAGEIF(B2:B100, "South", C2:C100)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  16. AVERAGEIFS
&lt;/h4&gt;

&lt;p&gt;Calculates the averages of numbers in a specific range based on more than one conditions.&lt;br&gt;
E.g Calculates the average size of 'Closed' deals in the South region?&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=AVERAGEIFS(C2:C100, D2:D100, "Closed", B2:B100, "South")
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  17. ROUND
&lt;/h4&gt;

&lt;p&gt;Used to round a number to a specified number of digits&lt;br&gt;
E.g A sales commission calculates out to $1,250.4567. You need to round it to standard currency (2 decimal places) for payroll.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=ROUND(1250.4567, 2) returns 1250.46
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  18. ROUNDUP
&lt;/h4&gt;

&lt;p&gt;Used to round a number up&lt;br&gt;
E.g You need to ship items to a client. A box holds 10 items, and they ordered 41 items ($41 / 10 = 4.1$). You can't ship 4.1 boxes; you must round up to ensure you have enough physical boxes.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=ROUNDUP(4.1, 0) returns 5.
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  19. INT
&lt;/h4&gt;

&lt;p&gt;Rounds a number down to the nearest whole interger by removing its decimals&lt;br&gt;
E.g An employee clocked 38.75 hours this week. You want to extract just the full, completed hours they worked.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=INT(38.75) returns 38
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  20.ABS
&lt;/h4&gt;

&lt;p&gt;Returns the positive value of a number by removing its negative sign &lt;br&gt;
E.g If a refund is logged as -$450.00, you want to know the absolute dollar volume of the transaction.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=ABS(-450) returns 450
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  21. TODAY()
&lt;/h4&gt;

&lt;p&gt;Is a formula that updates the day automatically&lt;br&gt;
E.g Finding out exactly what day it is today to calculate invoice aging.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=TODAY() returns 06/05/2026 (based on today's date)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  22. NOW
&lt;/h4&gt;

&lt;p&gt;Inserts the current date and time&lt;br&gt;
E.g Stamping a report so stakeholders know exactly what minute the data was pulled from the server.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=NOW() returns 06/05/2026 15:02
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  23. YEAR
&lt;/h4&gt;

&lt;p&gt;To extract year in a date&lt;br&gt;
E.g You want to group sales by their calendar year.If cell A2 holds 06/05/2026.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=YEAR(A2) returns 2026
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  24.WEEKDAY
&lt;/h4&gt;

&lt;p&gt;To extract the day of the week in excel&lt;br&gt;
E.g Checking if a client submitted a payment on a weekend so you can account for bank processing delays.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=WEEKDAY(A2) returns 6 (which represents Friday)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  25. WEEKNUM
&lt;/h4&gt;

&lt;p&gt;Finds the week number of a specific date&lt;br&gt;
E.g &lt;/p&gt;

&lt;p&gt;Excel is a very important tool in Data analysis. We can say it the mother of all since it can do almost all the analysis. Excel needs to be one of the most important tools one has to learn to be able to understand data analysis.&lt;/p&gt;

</description>
      <category>learning</category>
      <category>datascience</category>
      <category>data</category>
      <category>beginners</category>
    </item>
    <item>
      <title>My Data Journey</title>
      <dc:creator>Ryan Murunga</dc:creator>
      <pubDate>Wed, 03 Jun 2026 16:41:41 +0000</pubDate>
      <link>https://dev.to/ryan_murunga_10c024263652/my-data-journey-2634</link>
      <guid>https://dev.to/ryan_murunga_10c024263652/my-data-journey-2634</guid>
      <description>&lt;h5&gt;
  
  
  Background
&lt;/h5&gt;

&lt;p&gt;As a young kid i realized i had a passion with Mathematics. I would really struggle understanding the other subjects but Math to me was so easy . When i finished highschool the big question was what i wanted to do . I already new that i wanted to do something associated with Math. I did my research and the course that was coming up more frequent was Acturial Science . I applied for the course later on while still doing more research Data science came up and i was curious about it . Looked it up and so that the career opportunities were what i would be intrested . I was later called for Acturial Science but did a course transfer to Data Science&lt;/p&gt;

&lt;h5&gt;
  
  
  Life in Data Science
&lt;/h5&gt;

&lt;p&gt;Back to September 2023 and i began my degree in Data Science . At first i was happy with what i was learning and enjoyed . The only challenge i faced is i expected alot of practical work and i only recieved the theory part of most my units. I wanted to learn the theory but i didn't know where to start. I tried out different online contents but i was never satisfied with whatever i was learning . I still was eager to get my hands on real data.As this was the reason i joined Lux academy to get hands on skill on Data Science&lt;/p&gt;

&lt;p&gt;All my best as i begin this journey&lt;/p&gt;

</description>
      <category>learning</category>
      <category>data</category>
      <category>datascience</category>
      <category>100daysofcode</category>
    </item>
  </channel>
</rss>
