<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Agatha Nduku</title>
    <description>The latest articles on DEV Community by Agatha Nduku (@agatha_nduku_57a6446190bc).</description>
    <link>https://dev.to/agatha_nduku_57a6446190bc</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3952236%2Fc9fec19f-98bf-405d-bcec-15e6011464ce.jpg</url>
      <title>DEV Community: Agatha Nduku</title>
      <link>https://dev.to/agatha_nduku_57a6446190bc</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/agatha_nduku_57a6446190bc"/>
    <language>en</language>
    <item>
      <title>Joins, Modeling, Relationships and Schemas</title>
      <dc:creator>Agatha Nduku</dc:creator>
      <pubDate>Mon, 29 Jun 2026 11:33:51 +0000</pubDate>
      <link>https://dev.to/agatha_nduku_57a6446190bc/joins-modeling-relationships-and-schemas-4kfm</link>
      <guid>https://dev.to/agatha_nduku_57a6446190bc/joins-modeling-relationships-and-schemas-4kfm</guid>
      <description>&lt;h4&gt;
  
  
  Power BI has shown me how multifaceted and multi-purposed it can be. From the Data Cleaning process to data visualization.
&lt;/h4&gt;

&lt;p&gt;Lets begin with some Term definitions;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;em&gt;&lt;strong&gt;Joins&lt;/strong&gt;&lt;/em&gt;-These physically combine columns from different tables into one. They make it possible to assemble data from multiple sources.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;em&gt;&lt;strong&gt;Modeling&lt;/strong&gt;&lt;/em&gt;-This is the process of defining how business data and tables are structured and connected. It structures chaotic data into organized "business objects". It uses relationships and DAX (Data Analysis Expressions), to ensure fast performance and accurate reporting.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;em&gt;&lt;strong&gt;Relationships&lt;/strong&gt;&lt;/em&gt;- These are  logical connections between tables that allow data from multiple sources to be combined and analyzed together. hey propagate filters automatically across tables, ensuring that visuals accurately reflect data based on shared keys or IDs.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;em&gt;&lt;strong&gt;Schemas&lt;/strong&gt;&lt;/em&gt;-These define the structural organization of your data model, dictating how tables are connected and interact. The correct Schema influences the efficiency and performance of data queries and reports.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;h3&gt;
  
  
  Now that we have defined our terms, lets see how they work or what each one does.
&lt;/h3&gt;

&lt;p&gt;I will focus on &lt;strong&gt;Schemas&lt;/strong&gt; because, modeling, joins and relationships are simply integral parts of them.&lt;/p&gt;

&lt;h4&gt;
  
  
  There are Two Types of Schemas;
&lt;/h4&gt;

&lt;p&gt;&lt;strong&gt;Star Schema&lt;/strong&gt;&lt;br&gt;
This is the gold standard for Power BI modeling. It creates a simple, "star-like" pattern in your Model view. This is the most recommended Schema to use, because it is the simplest style and has the widest use.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fa6inmpa5nekxprqhnefk.JPG" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fa6inmpa5nekxprqhnefk.JPG" alt="sample star schema" width="473" height="300"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The star Schema contains two tables;&lt;br&gt;
• Fact Tables: Positioned at the center. They contain quantitative, measurable data (e.g., sales revenue, transaction quantities, dates) and unique ID.&lt;br&gt;
• Dimension Tables: Surrounding the fact table. They contain descriptive data used to filter and group your facts (e.g., product categories, customer names, regions).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Benefits of star schemas&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;• Fact/dimensional models like star schemas are simple to understand and implement, and make it easy for end users to find the data they need. They can be applied to data marts and other data resources.&lt;br&gt;
• Great for simple queries because of their reduced dependency on joins when accessing the data, as compared to normalized models like snowflake schemas.&lt;br&gt;
• Adapt well to fit OLAP models.&lt;br&gt;
• Improved query performance as compared to normalized data, because star schemas attempt to avoid computationally expensive joins.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Snowflake Schema&lt;/strong&gt;&lt;br&gt;
A snowflake schema is a logical database arrangement in a data warehouse where dimension tables are normalized and broken down into related sub-tables. &lt;br&gt;
Resembling a snowflake, it places a central fact table at the core with branched, hierarchical dimension tables radiating outward&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F180bi0fo9byqzckpdds0.JPG" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F180bi0fo9byqzckpdds0.JPG" alt="sample snowflake schema" width="386" height="209"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Characteristics of Snowflake Schema&lt;/strong&gt;&lt;br&gt;
• Requires less storage because dimension data is structured.&lt;br&gt;
• Supports multiple hierarchies within a dimension.&lt;br&gt;
• Performance decreases because of extra joins.&lt;br&gt;
• Dimensions can be sourced from different systems and integrated cleanly.&lt;br&gt;
• Good when dimension tables contain attributes at different grains.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Benefits of a Snowflake schema&lt;/strong&gt;&lt;br&gt;
• Improves data integrity through normalization.&lt;br&gt;
• Reduces redundancy and storage usage.&lt;br&gt;
• Supports detailed hierarchical drill-down.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;em&gt;How to determine which schema to use?&lt;/em&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;• Star schema and snowflake schema differ primarily in their structure, where snowflake schema is an expanded form of star schema that further normalizes data into additional sub-dimension tables.&lt;/p&gt;

&lt;p&gt;• Star schema utilizes a denormalized structure with fewer joins to optimize query performance and speed, while snowflake schema uses normalization to achieve greater storage efficiency and data integrity.&lt;/p&gt;

&lt;p&gt;• You can determine which schema is right for you by comparing their uses, benefits, drawbacks, and performance in key areas like scalability and ease of use.&lt;/p&gt;

&lt;p&gt;This is just but a small overview of the subject title, to know more, grab yourself some data and sit in-front of your desk, get curious, get interactive. All the best!!-&lt;/p&gt;

</description>
      <category>beginners</category>
      <category>learning</category>
      <category>database</category>
    </item>
    <item>
      <title>"Excel in Real-World Data Analysis"</title>
      <dc:creator>Agatha Nduku</dc:creator>
      <pubDate>Sat, 06 Jun 2026 10:25:32 +0000</pubDate>
      <link>https://dev.to/agatha_nduku_57a6446190bc/excel-in-real-world-data-analysis-2728</link>
      <guid>https://dev.to/agatha_nduku_57a6446190bc/excel-in-real-world-data-analysis-2728</guid>
      <description>&lt;h2&gt;
  
  
  What is Excel
&lt;/h2&gt;

&lt;p&gt;According to Wikipedia: &lt;br&gt;
&lt;em&gt;Microsoft Excel, or simply Excel, is a spreadsheet editor developed by Microsoft for Windows, macOS, Android, iOS and iPadOS. It features calculation or computation capabilities, graphing tools, pivot tables, and a macro programming language called Visual Basic for Applications (VBA).&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;In simpler terms, Excel forms part of the Microsoft 365 and Microsoft Office suites of software and has been developed since 1985. It organizes data in columns/rows and allows you to do mathematical functions.&lt;/p&gt;

&lt;h2&gt;
  
  
  How is Excel used in real-world data analysis
&lt;/h2&gt;

&lt;p&gt;From the definitions above, it is safe to say that the core roles of Excel are:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Data Recording/Storage &lt;/li&gt;
&lt;li&gt;Data Analysis/Manipulation&lt;/li&gt;
&lt;li&gt;Data Visualization/Presentation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The image below is a visual representation of an Excel Spreadsheet.&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe827dg13vs9mywnn6wgc.JPG" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe827dg13vs9mywnn6wgc.JPG" alt="Excel Spreadsheet" width="799" height="378"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Lets explore these roles.
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;Data Recording/Storage: Excel has a main spreadsheet format which holds data in worksheets, charts, and macros. This allows supply of data to others for calculation and display.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Data Analysis/Manipulation: Excel comes with functions that allow a person to format or calculate data.  It can edit and format text in cells, calculate formulas, search within the spreadsheet, sort rows and columns, freeze panes, filter the columns, add comments, and create charts among many others.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Data Visualization/Presentation: Excel comes with features that allow the data to be saved in the form of a table, graph, pie chart etc. which makes analyzing/visualizing the information much easier. Thus making it more readable and understandable.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;h3&gt;
  
  
  Excel features or formulas
&lt;/h3&gt;

&lt;p&gt;I will start with the basics and list these in no particular order of importance/functional hierarchy.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Autofill - It automatically populates cells with data or helps create a sequence.&lt;/li&gt;
&lt;li&gt;Text Wrap - It makes overflowing content fit within a cell by expanding the cell vertically.&lt;/li&gt;
&lt;li&gt;Find and Replace -It is used to quickly locate, edit and update data across the worksheet.&lt;/li&gt;
&lt;li&gt;Data Validation - It is used to restrict cell inputs, prevent typos, and ensure users enter clean, standardized data. &lt;/li&gt;
&lt;li&gt;Custom sort - It is used to sort data in a specific sequence (like months, sizes or priority levels) rather than alphabetically.&lt;/li&gt;
&lt;li&gt;Conditional formatting - It is used to automatically change the color, font or border of a cell based on its value.&lt;/li&gt;
&lt;li&gt;Functions - These are pre-built formulas designed to automate calculations and data analysis.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Now that we have seen some of the magic that excel can work on data, lets move on to something else. Within excel, of course!&lt;/p&gt;

&lt;h2&gt;
  
  
  My personal reflection:
&lt;/h2&gt;

&lt;h3&gt;
  
  
  How has learning Excel changed the way I see data?
&lt;/h3&gt;

&lt;p&gt;I now see data not just as numbers on a spreadsheet but as a world of information, manipulatable to answer questions and tell narratives.&lt;/p&gt;

&lt;p&gt;From a single excel spreadsheet, I have learnt that it is possible to extrapolate and/or interpolate a myriad of information and accurately project outcomes. &lt;/p&gt;

&lt;p&gt;See how I am already using big words? That is the confidence I have acquired from my lessons in Data Science and Analytics so far.&lt;/p&gt;

&lt;h4&gt;
  
  
  Some of the direct benefits of using excel I have learnt so far include:
&lt;/h4&gt;

&lt;p&gt;*&lt;em&gt;Numeric precision *&lt;/em&gt;- its inbuilt features when correctly used allow consistent, error-free operations across massive data bases.&lt;/p&gt;

&lt;p&gt;*&lt;em&gt;Complex Formula Integrity *&lt;/em&gt;- Formulas correctly interpret massive data, preventing errors across multiple worksheets.&lt;/p&gt;

&lt;p&gt;*&lt;em&gt;Easy Data recovery *&lt;/em&gt;- If the information is written on a piece of paper, finding it may take longer, however, this is not the case with excel spreadsheets. Finding and recovering data is easy.&lt;/p&gt;

&lt;p&gt;There you have it, an incentive to get acquainted with the excel world. Cheers!&lt;/p&gt;

</description>
      <category>beginners</category>
      <category>datascience</category>
      <category>programming</category>
    </item>
  </channel>
</rss>
