<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: WaithakaJoseph</title>
    <description>The latest articles on DEV Community by WaithakaJoseph (@waithakajoseph).</description>
    <link>https://dev.to/waithakajoseph</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3820514%2Fe6b39770-4261-4b60-995d-50679ca6f89d.png</url>
      <title>DEV Community: WaithakaJoseph</title>
      <link>https://dev.to/waithakajoseph</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/waithakajoseph"/>
    <language>en</language>
    <item>
      <title>Article on Modelling, Joins, Relationships and Different Schemas In Power BI</title>
      <dc:creator>WaithakaJoseph</dc:creator>
      <pubDate>Tue, 30 Jun 2026 21:47:03 +0000</pubDate>
      <link>https://dev.to/waithakajoseph/article-on-modelling-joins-relationships-and-different-schemas-in-power-bi-1oc7</link>
      <guid>https://dev.to/waithakajoseph/article-on-modelling-joins-relationships-and-different-schemas-in-power-bi-1oc7</guid>
      <description>&lt;h1&gt;
  
  
  Data Modeling, Relationships, and Schemas in Data Analytics
&lt;/h1&gt;

&lt;p&gt;In the fields of data analytics, data warehousing, and database management, modeling and schema design are the fundamental pillars used to organize and query information efficiently.&lt;/p&gt;

&lt;h2&gt;
  
  
  This article provides a comprehensive guide to these core concepts.
&lt;/h2&gt;

&lt;h2&gt;
  
  
  1. Data Modeling
&lt;/h2&gt;

&lt;p&gt;Data modeling is the architectural process of designing how data is stored, interconnected, and accessed within a system.&lt;/p&gt;

&lt;h3&gt;
  
  
  Core Questions Addressed:
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Storage:&lt;/strong&gt; What specific data points need to be captured?&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Structure:&lt;/strong&gt; How should individual tables be organized?&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Connectivity:&lt;/strong&gt; How do these tables interact with one another?&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Levels of Data Models:
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Conceptual Model:&lt;/strong&gt; A high-level business perspective focusing on entities and their relationships, devoid of technical specifications.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Logical Model:&lt;/strong&gt; Defines specific attributes, keys, and relationships. It is independent of the Database Management System (DBMS).&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Physical Model:&lt;/strong&gt; The actual implementation within a database, including technical details like indexes, partitions, and storage requirements.&lt;/li&gt;
&lt;/ol&gt;




&lt;h2&gt;
  
  
  2. Relationships
&lt;/h2&gt;

&lt;p&gt;Relationships define the logic of how data in one table corresponds to data in another.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;One-to-One (1:1):&lt;/strong&gt; A single record in Table A relates to exactly one record in Table B.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;One-to-Many (1:M):&lt;/strong&gt; The most common relationship; for example, one &lt;strong&gt;Customer&lt;/strong&gt; can place many &lt;strong&gt;Orders&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Many-to-Many (M:M):&lt;/strong&gt; Multiple records in one table relate to multiple records in another. This requires a &lt;strong&gt;Junction Table&lt;/strong&gt; (Bridge Table) to function. &lt;em&gt;Example: One Student can enroll in many Courses, and one Course contains many Students.&lt;/em&gt;
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  3. SQL Joins
&lt;/h2&gt;

&lt;p&gt;Joins are used to combine rows from two or more tables based on a related column.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Join Type&lt;/th&gt;
&lt;th&gt;Description&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Inner Join&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Returns only the records that have matching values in both tables.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Left Join&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Returns all records from the left table and the matched records from the right.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Right Join&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Returns all records from the right table and the matched records from the left.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Full Outer Join&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Returns all records when there is a match in either the left or right table.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Cross Join&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Returns the Cartesian product (every possible combination) of the two tables.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Self Join&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;A regular join in which a table is joined with itself.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  4. Database Schemas
&lt;/h2&gt;

&lt;p&gt;A schema is the logical configuration that defines how tables are organized and managed.&lt;/p&gt;

&lt;h3&gt;
  
  
  A. Star Schema
&lt;/h3&gt;

&lt;p&gt;The standard for data warehousing and Power BI. It features a central &lt;strong&gt;Fact Table&lt;/strong&gt; (quantitative data) surrounded by &lt;strong&gt;Dimension Tables&lt;/strong&gt; (descriptive data).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Visual Representation:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;          Dim_Customer
                |
Dim_Product -- Fact_Sales -- Dim_Date
                |
          Dim_Store
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Pros:&lt;/strong&gt; Simple to understand, high query performance, optimized for reporting.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cons:&lt;/strong&gt; Includes some data redundancy (denormalized).&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  B. Snowflake Schema
&lt;/h3&gt;

&lt;p&gt;An extension of the star schema where dimension tables are &lt;strong&gt;normalized&lt;/strong&gt; (split into additional related tables).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Visual Representation:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;          Country_Table
               |
          Dim_Customer
               |
Dim_Product -- Fact_Sales -- Dim_Date
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Pros:&lt;/strong&gt; Reduced data redundancy and improved data integrity.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cons:&lt;/strong&gt; Requires more complex queries and more joins, which can impact performance.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  C. Galaxy Schema (Fact Constellation)
&lt;/h3&gt;

&lt;p&gt;A complex design where multiple fact tables share the same dimension tables.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Structure:&lt;/strong&gt; Multiple Fact tables (e.g., &lt;strong&gt;Sales&lt;/strong&gt; and &lt;strong&gt;Returns&lt;/strong&gt;) linked to shared dimensions (e.g., &lt;strong&gt;Date&lt;/strong&gt; and &lt;strong&gt;Product&lt;/strong&gt;).&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Pros:&lt;/strong&gt; Supports complex business processes and enterprise-level analytics.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cons:&lt;/strong&gt; Highly difficult to design and maintain.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  5. Integrity Keys
&lt;/h2&gt;

&lt;p&gt;Keys are the unique identifiers that maintain the relationship between tables.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Primary Key (PK):&lt;/strong&gt; A unique identifier for a specific row within its own table. It cannot contain null values.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Foreign Key (FK):&lt;/strong&gt; A column in one table that points to the Primary Key of another table, creating a link between the two.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  6. Normalization vs. Denormalization
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;  &lt;strong&gt;Normalization:&lt;/strong&gt; The process of organizing data to minimize redundancy (used in &lt;strong&gt;OLTP&lt;/strong&gt; - transactional systems like banking apps).&lt;/li&gt;
&lt;li&gt;  &lt;strong&gt;Denormalization:&lt;/strong&gt; The process of combining tables to speed up data retrieval (used in &lt;strong&gt;OLAP&lt;/strong&gt; - analytical systems like Power BI or Tableau dashboards).&lt;/li&gt;
&lt;/ul&gt;




&lt;h3&gt;
  
  
  In Conclusion
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt; &lt;strong&gt;Identify Entities:&lt;/strong&gt; What are the "things" you are tracking? (People, Places, Transactions).&lt;/li&gt;
&lt;li&gt; &lt;strong&gt;Define Keys:&lt;/strong&gt; Every table needs a &lt;strong&gt;Primary Key&lt;/strong&gt; (unique ID). Connect tables using &lt;strong&gt;Foreign Keys&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt; &lt;strong&gt;Choose your Schema:&lt;/strong&gt; Use &lt;strong&gt;Star Schema&lt;/strong&gt; for most reporting needs. Use &lt;strong&gt;Normalization&lt;/strong&gt; if you are building an app that handles many live transactions.&lt;/li&gt;
&lt;li&gt; &lt;strong&gt;Optimize Joins:&lt;/strong&gt; Always join on indexed columns (like IDs) to ensure your queries don't slow down as your data grows.&lt;/li&gt;
&lt;/ol&gt;

</description>
      <category>analytics</category>
      <category>architecture</category>
      <category>data</category>
      <category>database</category>
    </item>
    <item>
      <title>How Excel is Used in Real-World Data Analysis: Excel Is More Than Just a Spreadsheet</title>
      <dc:creator>WaithakaJoseph</dc:creator>
      <pubDate>Wed, 25 Mar 2026 18:28:24 +0000</pubDate>
      <link>https://dev.to/waithakajoseph/how-excel-is-used-in-real-world-data-analysis-excel-is-more-than-just-a-spreadsheet-42j0</link>
      <guid>https://dev.to/waithakajoseph/how-excel-is-used-in-real-world-data-analysis-excel-is-more-than-just-a-spreadsheet-42j0</guid>
      <description>&lt;p&gt;In the world of data, tools like Python, Power BI and Tableau often steal the spotlight. However, there is one veteran tool that remains the undisputed backbone of business operations globally: &lt;strong&gt;Microsoft Excel&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;While many view Excel as just a tool for making lists, it is actually a data analysis engine. In this article, I will explore how Excel is applied in the real world and share some key features that turn raw numbers into actionable intel.&lt;/p&gt;

&lt;h2&gt;
  
  
  What is Microsoft Excel?
&lt;/h2&gt;

&lt;p&gt;At its core, Microsoft Excel is a spreadsheet application developed by Microsoft that uses a grid of cells to organize, manipulate and calculate data. Its power lies in its versatility. It can handle everything from a simple grocery list to complex financial models used by corporations worldwide.&lt;/p&gt;

&lt;h2&gt;
  
  
  Excel in the Real World: Practical Scenarios
&lt;/h2&gt;

&lt;p&gt;In the professional world, data is rarely clean or easy to understand. Excel acts as the "Swiss Army Knife" for professionals across various industries:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Finance and Accounting:&lt;/strong&gt; Finance &amp;amp; Accounting teams run on Excel. Analysts use it to forecast company growth, build budget models and calculate profit margins.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Marketing and Sales:&lt;/strong&gt; Marketing teams use Excel to track campaign performance, calculate Return on Investment (ROI) and segment customers based on purchasing behavior.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Human Resources:&lt;/strong&gt; HR departments use Excel to manage employee payroll, track attendance and analyze diversity and inclusion metrics within an organization.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Logistics and Supply Chain:&lt;/strong&gt; Companies use it to track inventory levels, manage shipping schedules and optimize warehouse space.&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  Excel Power Features: Turning Data into Information
&lt;/h2&gt;

&lt;p&gt;During my journey learning Excel, I discovered that the true magic happens when you stop entering data and start analyzing it. Here are a few features that are game-changers in real-world analysis:&lt;/p&gt;

&lt;h3&gt;
  
  
  1. Pivot Tables
&lt;/h3&gt;

&lt;p&gt;Pivot Tables are perhaps the most powerful feature in Excel. They allow you to take thousands of rows of data and summarize them in seconds. &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  &lt;strong&gt;Application:&lt;/strong&gt; If you have a year’s worth of sales data, a Pivot Table can instantly show you which product sold the most in the month of July without writing a single formula.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  2. XLOOKUP (and VLOOKUP)
&lt;/h3&gt;

&lt;p&gt;Data is often scattered across different sheets. The &lt;code&gt;XLOOKUP&lt;/code&gt; formula allows you to merge this data seamlessly.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  &lt;strong&gt;Application:&lt;/strong&gt; Imagine you have a list of Product IDs in one sheet and a Price List in another. Using &lt;code&gt;XLOOKUP&lt;/code&gt;, you can automatically "look up" the price for each ID, saving hours of manual searching.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  3. IF Statements
&lt;/h3&gt;

&lt;p&gt;Logic is central to data analysis. The &lt;code&gt;IF&lt;/code&gt; function allows you to categorize data based on specific conditions.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  &lt;strong&gt;Formula Example:&lt;/strong&gt; &lt;code&gt;=IF(B2&amp;gt;1000, "High Value", "Low Value")&lt;/code&gt;
&lt;/li&gt;
&lt;li&gt;  &lt;strong&gt;Application:&lt;/strong&gt; A manager can use this to instantly flag "High Value" orders that need special attention.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  4. Conditional Formatting
&lt;/h3&gt;

&lt;p&gt;Data analysis is as much about visualization as it is about math. Conditional formatting allows you to highlight cells based on their values.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  &lt;strong&gt;Application:&lt;/strong&gt; You can set a rule to turn all cells red if a project is over budget, allowing a stakeholder to spot a problem the moment they open the file.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Personal Reflection: A New Way of Seeing
&lt;/h2&gt;

&lt;p&gt;Learning Excel has fundamentally changed the way I understand data. Before, I saw data as a static, intimidating mountain of numbers. I used to think that "analysis" was something only mathematicians and accountants did.&lt;/p&gt;

&lt;p&gt;Now, I interpret data as a &lt;strong&gt;story waiting to be told&lt;/strong&gt;. Excel has taught me that with the right formula and a clear structure, you can strip away the noise and find the "Why" behind the numbers. It has shifted my mindset from being a passive observer of information to an active problem-solver. Whether I’m looking at personal finances or professional KPIs, I now look for patterns, trends and outliers that I would have been blind to before.&lt;/p&gt;

</description>
      <category>analytics</category>
      <category>datascience</category>
      <category>microsoft</category>
      <category>productivity</category>
    </item>
  </channel>
</rss>
