<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: martin georges</title>
    <description>The latest articles on DEV Community by martin georges (@martin_georges_matata).</description>
    <link>https://dev.to/martin_georges_matata</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3818859%2F0d2318d4-6a39-4fa7-922a-c305517ddeed.jpg</url>
      <title>DEV Community: martin georges</title>
      <link>https://dev.to/martin_georges_matata</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/martin_georges_matata"/>
    <language>en</language>
    <item>
      <title>Understanding Data Modeling in Power BI: Joins, Relationships, and Schemas Explained</title>
      <dc:creator>martin georges</dc:creator>
      <pubDate>Sun, 29 Mar 2026 19:49:33 +0000</pubDate>
      <link>https://dev.to/martin_georges_matata/understanding-data-modeling-in-power-bi-joins-relationships-and-schemas-explained-2p20</link>
      <guid>https://dev.to/martin_georges_matata/understanding-data-modeling-in-power-bi-joins-relationships-and-schemas-explained-2p20</guid>
      <description>&lt;h2&gt;
  
  
  Introduction
&lt;/h2&gt;

&lt;p&gt;Last week in our Power BI class we focused on one of the most important concepts in data analysis: &lt;strong&gt;data modeling&lt;/strong&gt;. Data Modeling is the process of analyzing and defining different data type that is collected or produced. I learnt that before creating dashboards or visualizations, it’s critical to properly structure and connect your data. Our teacher illustrated the importance of using &lt;strong&gt;power query&lt;/strong&gt; to remove duplicates, handle blanks , filter and sort data and merge and reshape datasets.&lt;/p&gt;

&lt;h2&gt;
  
  
  Joins vs Relationships
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Joins&lt;/strong&gt; are used in tools like SQL or Power Query to physically combine tables into one. In Power BI, joins usually happen in Power Query before the data is loaded into the model.&lt;br&gt;
&lt;strong&gt;Relationships&lt;/strong&gt; in Power BI link tables without merging them, allowing for more flexible and efficient analysis.&lt;/p&gt;

&lt;p&gt;we were taught that Power BI prefers relationships over joins because they help maintain clean, scalable data models.&lt;/p&gt;

&lt;h2&gt;
  
  
  SQL join types
&lt;/h2&gt;

&lt;p&gt;An &lt;strong&gt;inner join&lt;/strong&gt; returns only the rows that have matching values in two tables. &lt;/p&gt;

&lt;p&gt;A &lt;strong&gt;left join&lt;/strong&gt; keeps all the rows from the left table and only brings in matching rows from the right table. &lt;/p&gt;

&lt;p&gt;A &lt;strong&gt;right join&lt;/strong&gt; does the opposite—it keeps all rows from the right table and matches what it can from the left. &lt;/p&gt;

&lt;p&gt;A &lt;strong&gt;full outer join&lt;/strong&gt; returns all rows from two tables, whether there’s a match or not. &lt;/p&gt;

&lt;p&gt;A &lt;strong&gt;left anti join&lt;/strong&gt; returns rows from the left table that don’t have a match in the right. &lt;/p&gt;

&lt;p&gt;Similarly, a &lt;strong&gt;right anti join&lt;/strong&gt; shows rows from the right table that don’t have a match in the left. &lt;/p&gt;

&lt;h2&gt;
  
  
  Types of Relationships
&lt;/h2&gt;

&lt;p&gt;We explored different types of relationships:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;One-to-Many&lt;/strong&gt; is the most common (e.g., one product category to many products)&lt;br&gt;
&lt;strong&gt;Many-to-One&lt;/strong&gt; is the reverse of one-to-many&lt;br&gt;
&lt;strong&gt;Many-to-Many&lt;/strong&gt; is more complex and is used when both tables contain duplicates&lt;/p&gt;

&lt;p&gt;I also learned about &lt;strong&gt;cross-filter direction&lt;/strong&gt;, which determines how data flows between tables.&lt;/p&gt;




&lt;h2&gt;
  
  
  Data Schemas
&lt;/h2&gt;

&lt;p&gt;Another important concept was &lt;strong&gt;data schemas&lt;/strong&gt;, especially:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Star Schema&lt;/strong&gt;&lt;br&gt;
  One central &lt;strong&gt;fact table&lt;/strong&gt; &lt;br&gt;
  Connected to multiple &lt;strong&gt;dimension tables&lt;/strong&gt; &lt;br&gt;
  Best practice for Power BI models&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Snowflake Schema&lt;/strong&gt;&lt;br&gt;
  More normalized (dimension tables split further)&lt;br&gt;
  Can be more complex but sometimes necessary&lt;/p&gt;

&lt;p&gt;The star schema stood out as the most efficient and recommended approach.&lt;/p&gt;

&lt;h2&gt;
  
  
  Practical Application
&lt;/h2&gt;

&lt;p&gt;During the session, I practiced:&lt;br&gt;
 Creating relationships between tables in Power BI&lt;br&gt;
 Defining primary and foreign keys&lt;br&gt;
 Avoiding common issues like circular relationships&lt;br&gt;
 Structuring datasets into a star schema format&lt;/p&gt;

&lt;h2&gt;
  
  
  Personal Reflection
&lt;/h2&gt;

&lt;p&gt;Before this class, i knew power BI as a tool for only presenting data. Now i can use it for cleaning data and creating visuals.I understand how to connect data and that &lt;strong&gt;how data is connected is just as important as the data itself&lt;/strong&gt;.&lt;/p&gt;

</description>
      <category>analytics</category>
      <category>beginners</category>
      <category>data</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>How Excel is used in Real-World Data Analysis</title>
      <dc:creator>martin georges</dc:creator>
      <pubDate>Sat, 28 Mar 2026 16:58:33 +0000</pubDate>
      <link>https://dev.to/martin_georges_matata/how-excel-is-used-in-real-world-data-analysis-paj</link>
      <guid>https://dev.to/martin_georges_matata/how-excel-is-used-in-real-world-data-analysis-paj</guid>
      <description>&lt;h2&gt;
  
  
  Introduction
&lt;/h2&gt;

&lt;p&gt;The definition of Microsoft excel according to a simple google search is  " A powerful electronic spreadsheet application developed by Microsoft, used to organize, analyze, and store data in rows and columns." I have used it to organize, analyze, and visualize all sorts of data in my science career. excel shines because everyone knows how to use it , it requires no coding skills, and is quicker in data analysis compared to other data analysis software ( may be because it is 'lighter' for the processor) coupled with the fact that it requires no dependencies, environments and deployments.&lt;/p&gt;

&lt;h2&gt;
  
  
  How is it used in real-world scenarios
&lt;/h2&gt;

&lt;p&gt;Excel is applied in a myriad of sectors to support in decision making. &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;In health care public health professionals are able to collect and store data from health facilities and public agencies using excel. Analysis of the data can estimate the number of people showing up in hospitals with particular infections or cancers and they can use that data to guide them on prioritization during allocation of resources.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;In banks and financial institutions operations, excel is used for budgeting, financial modeling, forecasting, and risk analysis. The business development team uses excel in account opening tracking, performance dashboards, and market analysis.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Features I Have Learned and Their Applications
&lt;/h2&gt;

&lt;p&gt;Data cleaning: This is the first process for any data set. Here we try to understand our data set by checking the type of data each column/cell has. we then align the data type to the operation we want to do with the data. for example if the format of a cell with a number is not appropriate calculations involving that cell will bring erroneous value. By right clicking and changing the format of the cell or column. this is corrected.&lt;/p&gt;

&lt;p&gt;NB/: A column with more than 60% blanks should be completely excluded from the dataset.&lt;/p&gt;

&lt;p&gt;Formulas &amp;amp; Functions (e.g., SUM IF, VLOOKUP/XLOOKUP): Used to automate calculations, categorize data, and retrieve information efficiently.&lt;/p&gt;

&lt;p&gt;Consider the image below to illustrate the features I am going to explain: &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fw6cbk4atct27y1tb53w3.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fw6cbk4atct27y1tb53w3.png" alt="excel sheet" width="800" height="533"&gt;&lt;/a&gt; &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;I would use &lt;strong&gt;sumif&lt;/strong&gt;to get the total value of sales before a particular date. =SUMIF(A2:A10, "Apple", B2:B10) sums values in&lt;br&gt;
if corresponding cells in B2 to B10 is equal "Apple".&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Countif&lt;/strong&gt; can be used to count cells that meet a single specific criterion e.g =COUNTIF(A2:A10, "Apple") counts how many cells in the range A2:A10 contain "Apple". this should return a value of 3 in the cell you create the formula&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Vlookup&lt;/strong&gt;and &lt;strong&gt;Xlookup&lt;/strong&gt;to find a specific value in a column or whole data set. this is particularly helpful when handling blanks in a dataset.=VLOOKUP("Apple", A2:B9, 2, FALSE) would bring up 100&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;CONCAT&lt;/strong&gt; is used to combine text from multiple cells. =CONCAT(A1, " ", B1) will combine a name in cell A1 and cell A2 leaving a space between the words. i.e Apple 100&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;MAX/MIN&lt;/strong&gt;: Returns the highest/lowest value. =MAX(B2:B9)  200&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Pivot Tables&lt;/strong&gt;: can be used to summarize datasets without writing formulas. It is crucial when identifying trends. see example below:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fk3k9v4p2pfffc4ern76p.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fk3k9v4p2pfffc4ern76p.jpeg" alt="Pivot table" width="800" height="121"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Personal experience
&lt;/h2&gt;

&lt;p&gt;Before my training, I viewed datasets as rigid structures with very limited possibilities for manipulation. Now, I understand that data is flexible. Even blank cells can be meaningfully handled and included in analysis. I’ve also learned how to create automated formulas that reduce workload and deepen my understanding as I analyze data.&lt;/p&gt;

&lt;p&gt;Most importantly, Excel has been a stepping stone into data science, helping me build a strong foundation in working with structured data and translating it into meaningful insights.&lt;/p&gt;

</description>
      <category>beginners</category>
      <category>learning</category>
      <category>datascience</category>
    </item>
  </channel>
</rss>
