<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Annah Okero</title>
    <description>The latest articles on DEV Community by Annah Okero (@annah_the_analyst).</description>
    <link>https://dev.to/annah_the_analyst</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3952981%2Fc3142e87-e0c8-4b34-9ded-00ee9ca16913.png</url>
      <title>DEV Community: Annah Okero</title>
      <link>https://dev.to/annah_the_analyst</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/annah_the_analyst"/>
    <language>en</language>
    <item>
      <title>Power BI: From Data Cleaning to Interactive Dashboards</title>
      <dc:creator>Annah Okero</dc:creator>
      <pubDate>Sun, 28 Jun 2026 19:19:31 +0000</pubDate>
      <link>https://dev.to/annah_the_analyst/power-bi-from-data-cleaning-to-interactive-dashboards-d8p</link>
      <guid>https://dev.to/annah_the_analyst/power-bi-from-data-cleaning-to-interactive-dashboards-d8p</guid>
      <description>&lt;p&gt;Imagine you are an analyst handling messy data and you need to build a dashboard for the team to understand the relationship between the numbers. The journey begins with &lt;strong&gt;data cleaning in Power Query&lt;/strong&gt;, where inaccurate, duplicate, or missing values are corrected. &lt;/p&gt;

&lt;p&gt;Managing missing values is a critical step in data cleaning, as it improves data quality and ensures accurate analysis. Here is the trick:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Text columns&lt;/strong&gt;: Replace missing values with "&lt;strong&gt;N/A&lt;/strong&gt;" or "&lt;strong&gt;Unknown&lt;/strong&gt;" to indicate that the information is unavailable while maintaining consistency in the dataset.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Numeric columns&lt;/strong&gt;: Missing values can either be left as blank &lt;strong&gt;(null)&lt;/strong&gt; or replaced using appropriate statistical measures such as the &lt;strong&gt;mean&lt;/strong&gt;, &lt;strong&gt;median&lt;/strong&gt;, or &lt;strong&gt;mode&lt;/strong&gt;, depending on the nature of the data and the analysis being performed. Numeric fields are generally the only columns where leaving blanks is acceptable without affecting data integrity.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Rows with excessive missing data&lt;/strong&gt;: If a row contains approximately 90% missing values, it is often best to remove it, as it contributes little or no meaningful information and may negatively impact the quality and reliability of the analysis.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Data Modeling
&lt;/h2&gt;

&lt;p&gt;Once the data has been cleaned, the next step is &lt;strong&gt;data modeling&lt;/strong&gt;, where the data is organized into a logical structure that supports efficient analysis and reporting. Data modeling involves defining relationships between tables, creating calculated columns and measures, and organizing the data to improve report performance. A well-designed data model reduces redundancy, enhances query speed, and makes it easier to build accurate and interactive dashboards.&lt;/p&gt;

&lt;p&gt;During data modeling, it is important to understand the two main types of tables used in Power BI:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Fact Tables&lt;/strong&gt;: These contain measurable business data or transactions, such as sales, orders, revenue, or inventory movements. Fact tables typically include numeric values that can be aggregated and analyzed.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Dimension Tables&lt;/strong&gt;: These provide descriptive information that gives context to the facts, such as customers, products, employees, locations, and dates. Dimension tables help categorize and filter the data for meaningful analysis.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Relationships in Power BI
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Relationships&lt;/strong&gt; define how tables are connected within the data model using a common field. They are created using &lt;strong&gt;Primary Keys&lt;/strong&gt; (PK) and &lt;strong&gt;Foreign Keys&lt;/strong&gt;(FK), allowing Power BI to filter and analyze data across multiple tables without duplicating information. Power BI may detect relationships automatically, but understanding and creating them manually ensures an accurate data model.&lt;br&gt;
Common relationship cardinalities include One-to-Many (1), Many-to-One (N:1), One-to-One (1:1), and Many-to-Many (N), with One-to-Many being the most commonly used in Star Schema models.&lt;/p&gt;

&lt;h2&gt;
  
  
  Joins in Power BI
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Joins&lt;/strong&gt; are used in Power Query to combine data from two or more tables based on a shared column before the data is loaded into the model. Unlike relationships, which connect tables without merging them, joins physically merge the data into a single result. The main join types are Inner Join, Left Outer Join, Right Outer Join, Full Outer Join, Left Anti Join, Right Anti Join, and Cross Join. Selecting the appropriate join type ensures the correct records are returned and improves data quality for analysis.&lt;/p&gt;

&lt;p&gt;Another important concept is the &lt;strong&gt;Star Schema&lt;/strong&gt;, where a central &lt;strong&gt;fact table&lt;/strong&gt; is connected to multiple &lt;strong&gt;dimension tables&lt;/strong&gt;. This design is preferred because it is simple, fast, and optimized for analytical queries. In contrast, the &lt;strong&gt;Snowflake Schema&lt;/strong&gt; &lt;em&gt;normalizes dimension tables&lt;/em&gt; into additional related tables. While it reduces data redundancy, it introduces more relationships, making the model slightly more complex but useful for highly structured datasets.&lt;/p&gt;

&lt;p&gt;The final step is &lt;strong&gt;dashboard creation&lt;/strong&gt;, where visualizations such as charts, KPIs, maps, slicers, and tables are combined into an interactive report. The real value of Power BI isn't just building reports rather it is telling a story that drives action.&lt;/p&gt;

&lt;p&gt;If you could build one Power BI dashboard to solve a real-world problem, what would it be and why?&lt;/p&gt;

</description>
      <category>analytics</category>
      <category>datascience</category>
      <category>microsoft</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>How Excel is Used in Real-World Data Analysis</title>
      <dc:creator>Annah Okero</dc:creator>
      <pubDate>Mon, 08 Jun 2026 17:22:16 +0000</pubDate>
      <link>https://dev.to/annah_the_analyst/how-excel-is-used-in-real-world-data-analysis-1c68</link>
      <guid>https://dev.to/annah_the_analyst/how-excel-is-used-in-real-world-data-analysis-1c68</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fog5oasdn289sjl2k0rwx.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fog5oasdn289sjl2k0rwx.png" alt=" " width="757" height="491"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Microsoft Excel is one of the most widely used tools for working with data. It allows users to organize, clean, analyze, and visualize information in a simple spreadsheet format. Whether you are tracking sales, managing inventory, or analyzing survey results, Excel provides practical tools that help turn raw data into meaningful insights.&lt;br&gt;
When analyzing data in the real world, Excel is crucial. It aids managers in &lt;strong&gt;keeping track of inventory&lt;/strong&gt; levels and making wise purchases. Excel is used in finance to create reports, track spending, and examine revenue patterns. Excel is also used by marketing teams to monitor client involvement, assess campaign performance, and spot trends that can enhance future tactics.&lt;/p&gt;

&lt;h2&gt;
  
  
  ‎‎Excel Features and Formulas
&lt;/h2&gt;

&lt;p&gt;One feature I have learned is Data Validation, which helps reduce data entry errors by creating predefined lists and rules. This ensures that data remains accurate and consistent. Another useful feature is &lt;strong&gt;Conditional Formatting&lt;/strong&gt;, which automatically highlights important information such as low stock levels, duplicate values, or overdue tasks as well as outliers. This makes it easier to identify issues at a glance.&lt;br&gt;
I have also learned several formulas that support data analysis. The &lt;strong&gt;SUM&lt;/strong&gt; function calculates totals quickly, while &lt;strong&gt;COUNTIF&lt;/strong&gt; counts records that meet specific conditions. The &lt;strong&gt;DATEDIF&lt;/strong&gt; function helps calculate the number of days, months, or years between two dates, making it useful for tracking inventory age, project timelines, or customer activity periods.&lt;/p&gt;

&lt;h2&gt;
  
  
  In Conclusion…
&lt;/h2&gt;

&lt;p&gt;Learning Excel has changed the way I see data and its role in everyday decision making. I now understand that data is not just something to be stored, but something that can be used to monitor performance and guide actions. For example, a business owner can use Excel dashboards to track inventory levels, monitor sales trends, and identify fast-moving or slow-moving products. Instead of manually reviewing hundreds of records, Excel presents key information in a clear and visual format, making it easier to decide when to restock products and manage inventory effectively. This has helped me appreciate how data can be transformed into practical insights that support real-world business operations. &lt;/p&gt;

</description>
    </item>
  </channel>
</rss>
