<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Sharon Chepkemoi</title>
    <description>The latest articles on DEV Community by Sharon Chepkemoi (@sharonchepkemoi).</description>
    <link>https://dev.to/sharonchepkemoi</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3851724%2F1ea4263a-ab8e-40b7-ab62-75bd4f3643bb.png</url>
      <title>DEV Community: Sharon Chepkemoi</title>
      <link>https://dev.to/sharonchepkemoi</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/sharonchepkemoi"/>
    <language>en</language>
    <item>
      <title>Understanding Data Modeling in Power BI: Joins, Relationships and Schemas Explained.</title>
      <dc:creator>Sharon Chepkemoi</dc:creator>
      <pubDate>Mon, 30 Mar 2026 14:51:31 +0000</pubDate>
      <link>https://dev.to/sharonchepkemoi/understanding-data-modeling-in-power-bi-joins-relationships-and-schemas-explained-5826</link>
      <guid>https://dev.to/sharonchepkemoi/understanding-data-modeling-in-power-bi-joins-relationships-and-schemas-explained-5826</guid>
      <description>&lt;p&gt;&lt;strong&gt;1.What is data modeling?&lt;/strong&gt;&lt;br&gt;
Data modeling is the process of structuring and organizing Data to make is easily accessible for reporting in power BI. It involves connecting tables, defining relationships, cleaning and shaping data.&lt;br&gt;
Organizes data into a structured format.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2.SQL Joins in Power BI (power query)&lt;/strong&gt;&lt;br&gt;
Powe BI enables SQL like joins  via merge queries to create a physical table. These are essential for data cleaning. Combine tables based on specific requirements.&lt;br&gt;
&lt;strong&gt;INNER JOIN:&lt;/strong&gt; Keeps only rows that have matching values in both tables.&lt;br&gt;
       Example: joining employee table with department table. Only employees assigned to a valid department are returned.&lt;br&gt;
&lt;strong&gt;LEFT JOIN:&lt;/strong&gt; Keeps all rows from the left table and matches from the right. No match it returns null.&lt;br&gt;
    Example; Joining sales table with product table. All sales products are kept and product details added and id a sale was made for a product not in the product table, the product name will be null.&lt;br&gt;
&lt;strong&gt;RIGHT JOIN:&lt;/strong&gt; Keeps all rows from the right table and matches from the left.&lt;br&gt;
    Example; Join customer table with sales. All sales are kept and if a sale exists with no corresponding customer, then it is replaced with null.&lt;br&gt;
&lt;strong&gt;FULL OUTER JOIN:&lt;/strong&gt; Keeps all rows from both tables matching where possible, filling gaps with nulls.&lt;br&gt;
&lt;strong&gt;LEFT ANTI JOIN:&lt;/strong&gt; Keeps rows from the left table that don’t exist in the right.&lt;br&gt;
&lt;strong&gt;RIGHT ANTI JOIN:&lt;/strong&gt; Keeps rows from the right that don’t exist in the left.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3.Power BI Relationships&lt;/strong&gt;&lt;br&gt;
Connects tables allowing them to communicate without physically merging them.&lt;br&gt;
Cardinality: Describes the nature of relationships.&lt;br&gt;
&lt;strong&gt;One to Many (1:M):&lt;/strong&gt; One row relates to many rows.&lt;br&gt;
&lt;strong&gt;Many to Many (M:M):&lt;/strong&gt; Both sides have duplicates.&lt;br&gt;
&lt;strong&gt;One to One (1:1):&lt;/strong&gt; Matches unique rows. Used when splitting large tables.&lt;br&gt;
&lt;strong&gt;Active vs Inactive relationships:&lt;/strong&gt; Only one can be active (solid line). Use DAX (USERELATIONSHIP) for the inactive (dotted line).&lt;br&gt;
&lt;strong&gt;Cross Filter Directions:&lt;/strong&gt; controls filters.&lt;br&gt;
       Single: Dimension filters the fact table.&lt;br&gt;
       Both: Both tables filter each other.&lt;br&gt;
4.&lt;strong&gt;Difference Between Joins and Relationships&lt;/strong&gt;&lt;br&gt;
Joins merge tables during data loading permanently while Relationships keep the tables separate but connected for dynamic filtering based on visuals.&lt;br&gt;
Joins are used for data cleaning while Relationships are used for reporting and slicing.&lt;br&gt;
Joins are created in power query while relationships are created in model view in power BI.&lt;br&gt;
&lt;strong&gt;5.Facts vs Dimension tables&lt;/strong&gt;&lt;br&gt;
Fact tables: Contain quantitative, numeric data e.g. sales. Have many rows&lt;br&gt;
Dimension table: Contains descriptive, categorical context e.g. “who”, “why”, “what”, product name. Have many columns.&lt;br&gt;
&lt;strong&gt;6.Data Schemas&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;Star schema:&lt;/strong&gt; Central fact table connected to dimension tables.&lt;br&gt;
  Example; A sales fact table linked to product table, customer table.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fk1cioom7l96uj2holsud.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fk1cioom7l96uj2holsud.png" alt=" " width="800" height="400"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Snowflake schema:&lt;/strong&gt; Dimension tables are split into multiple tables, normalizing the data.&lt;br&gt;
Used when dealing with highly hierarchical data (complex product catalogs).&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6yiiqcw002nl4dof27ge.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6yiiqcw002nl4dof27ge.png" alt=" " width="800" height="483"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Flat table (DLAT):&lt;/strong&gt; All data in one table. Simple but slow and inefficient. No joins needed.&lt;br&gt;
&lt;strong&gt;7.Role Playing Dimensions&lt;/strong&gt;&lt;br&gt;
A single dimension serving multiple roles in the fact table.&lt;br&gt;
Example; date dimension used for both harvest and planting dates.&lt;br&gt;
&lt;strong&gt;8.Common Data Modeling Issues&lt;/strong&gt;&lt;br&gt;
Circular relationships between tables&lt;br&gt;
Mismatched keys in merges that leads to missing data&lt;br&gt;
Ignoring inactive relationships leads to incorrect totals&lt;br&gt;
Doing too much transformation in DAX instead of power query&lt;br&gt;
Poorly defined entities.&lt;br&gt;
&lt;strong&gt;9.Step by Step Power BI Workflow&lt;/strong&gt;&lt;br&gt;
Get data: Open power BI desktop and use “get data” to connect to various sources. &lt;br&gt;
Transform data to open and clean data in power query editor&lt;br&gt;
Model data (data view): Switch to model view to establish relationships between tables.&lt;br&gt;
Define: Create necessary DAX measures. Use DAX to build calculated columns and measures (e.g. SUM, AVERAGE, CALCULATE). &lt;br&gt;
Visualize data (Report view): Build interactive reports by dragging fields onto the canvas, selecting chart types and formatting visuals.&lt;/p&gt;

</description>
      <category>data</category>
      <category>analytics</category>
      <category>beginners</category>
    </item>
    <item>
      <title>How Excel is used in Real World Data Analysis</title>
      <dc:creator>Sharon Chepkemoi</dc:creator>
      <pubDate>Mon, 30 Mar 2026 14:50:59 +0000</pubDate>
      <link>https://dev.to/sharonchepkemoi/how-excel-is-used-in-real-world-data-analysis-25hk</link>
      <guid>https://dev.to/sharonchepkemoi/how-excel-is-used-in-real-world-data-analysis-25hk</guid>
      <description>&lt;p&gt;&lt;strong&gt;What is Excel?&lt;/strong&gt;&lt;br&gt;
Excel is a spreadsheet application that allows organizing, manipulating, calculating and visualizing data. Uses grid od rows and columns. Each grid is a cell that holds data.&lt;br&gt;
&lt;strong&gt;How is Excel Used in the Real world&lt;/strong&gt;&lt;br&gt;
Excel is used across industries including finance, healthcare, marketing and retail. Common real-world applications;&lt;br&gt;
   Sales analysis: Tracking product performance, revenue and discounts.&lt;br&gt;
   Financial reporting: Budgeting, forecasting and expense tracking&lt;br&gt;
   Project management and planning: Create timelines, track tasks, and allocate resources.&lt;br&gt;
   Customer relationship management: Small businesses use excel to store, organize and analyze customer information and sales inquiries.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Data Cleaning&lt;/strong&gt; &lt;br&gt;
This involves removing inconsistencies, duplicates and errors to prepare data for analysis.Real world data features missing values, duplicates and inconsistent formatting. Here are the steps that I worked through:&lt;br&gt;
  Back up data: Created a copy of the original dataset&lt;br&gt;
  Removing duplicates: Using data tab and removed duplicates&lt;br&gt;
  Fixing data types: converting price columns from texts to numbers&lt;br&gt;
  Handling missing values: on the ribbon is the find and replace(Ctrl + H) where you replace blank cells with the appropriate data&lt;br&gt;
  Extracting numbers from texts (e.g. “4.5 out of 5” to 4.5)&lt;br&gt;
  Use TRIM to remove spaces&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Formulas and Features learnt for Data Transformation&lt;/strong&gt;&lt;br&gt;
Excel formulas are essential for transforming raw data into usable formats.&lt;br&gt;
Aggregate functions (SUM, AVERAGE, MIN, MAX, COUNT)&lt;br&gt;
Categorizing data with IF function&lt;br&gt;
 =IF(b2&amp;gt;4, “excellent”, IF(b2&amp;gt;2, “average”, “POOR”))&lt;br&gt;
LOOK UP functions. (VLOOK UP, XLOOK UP): They search for a specific value in a table and return corresponding data.&lt;br&gt;
&lt;strong&gt;Pivot Tables for Data Analysis&lt;/strong&gt;&lt;br&gt;
Pivot tables are used to summarize and analyze data.&lt;br&gt;
With pivot tables, you can compare categories such as discount category vs number of reviews, count products and calculate averages.&lt;br&gt;
For Example; you can create a pivot table to find average rating by discount category&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frspajwjbdyx6k4iixb3r.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frspajwjbdyx6k4iixb3r.png" alt=" " width="800" height="223"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Excel allows users to create interactive dashboards using charts and slicers.&lt;br&gt;
Common charts used: Bar charts, Column charts, Pie charts, Donut charts, scatter plots.&lt;br&gt;
Example;&lt;br&gt;
Analyze the relationship between discount percentage and number of reviews.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flh9rwfqcp5rt07ae0pih.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flh9rwfqcp5rt07ae0pih.png" alt=" " width="800" height="465"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Adding a trendline helps identify whether it’s a positive, negative or no relationship between the variables.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Building Interactive Dashboards&lt;/strong&gt;&lt;br&gt;
A dashboard is a visualization tool for summarizing key performance indicators (KPIs) and data trends.&lt;br&gt;
Key elements of an excel dashboard&lt;br&gt;
KPIs: Total products, Average rating, Average discount percentage, Total reviews&lt;br&gt;
Pivot tables&lt;br&gt;
Pivot charts&lt;br&gt;
Slicers; allows users to filter data dynamically, making the dashboard interactive&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fjy9aufx3qudc84e5rpah.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fjy9aufx3qudc84e5rpah.png" alt=" " width="800" height="453"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Conclusion&lt;/strong&gt;&lt;br&gt;
Learning excel has changed my approach to data, from overwhelming intimidating numbers to a clear, structured one. I now understand that data must be cleaned, organized and standardized before analyzing. Excel has trained my mind to think logically, like why did this happen. Tools like conditional formatting helps to spot outliers, while pivot tables help summarize large datasets. Understanding function has increased my confidence in data accuracy.&lt;/p&gt;

</description>
      <category>beginners</category>
      <category>datascience</category>
    </item>
  </channel>
</rss>
