<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Paulet Wairagu</title>
    <description>The latest articles on DEV Community by Paulet Wairagu (@pauletart).</description>
    <link>https://dev.to/pauletart</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1027631%2F566d21f6-0d69-46e1-a34a-4d0ed530615d.jpeg</url>
      <title>DEV Community: Paulet Wairagu</title>
      <link>https://dev.to/pauletart</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/pauletart"/>
    <language>en</language>
    <item>
      <title>Cloud : My AWS CCP Study plan</title>
      <dc:creator>Paulet Wairagu</dc:creator>
      <pubDate>Thu, 09 Apr 2026 17:19:46 +0000</pubDate>
      <link>https://dev.to/pauletart/cloud-my-aws-ccp-study-plan-56ip</link>
      <guid>https://dev.to/pauletart/cloud-my-aws-ccp-study-plan-56ip</guid>
      <description>&lt;p&gt;I come from a technical background; data, SQL, automation and dashboards but cloud is one area I’ve only _tinkered _with so far. Nothing fully structured, nothing end-to-end.&lt;/p&gt;

&lt;p&gt;That changes now.&lt;/p&gt;

&lt;p&gt;It’s April 9th, and my goal is simple:&lt;br&gt;
👉 Sit for the AWS Certified Cloud Practitioner (CCP) exam in the last week of April.&lt;/p&gt;

&lt;p&gt;This isn’t just about passing an exam. It’s about finally bringing all my scattered cloud knowledge into one structured foundation I can actually build on.&lt;/p&gt;

&lt;p&gt;Even better — I’m doing this 100% with free resources, and if you’re on the same path, feel free to follow along or use this as your guide.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;📚 My Study Plan&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;1. Core Learning (Main Course)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Andrew Brown’s AWS CCP Course (FreeCodeCamp)&lt;/em&gt;&lt;br&gt;
This is the backbone.&lt;/p&gt;

&lt;p&gt;Anyone who knows Andrew knows he doesn’t miss — his cloud content is detailed, practical, and built for real understanding.&lt;br&gt;
The 14-hour course is honestly a gem, and I’ll be using it to cover all core concepts end-to-end.&lt;/p&gt;

&lt;p&gt;  &lt;iframe src="https://www.youtube.com/embed/NhDYbskXRgc"&gt;
  &lt;/iframe&gt;
&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Reinforcement (Interactive Learning)&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;AWS Skill Builder&lt;br&gt;
This is where I’ll reinforce concepts in a lighter, more interactive way.&lt;/p&gt;

&lt;p&gt;I like that it breaks things down simply and even gamifies parts of the learning.&lt;br&gt;
I’ll also be using their official exam prep plan to stay aligned with what AWS actually expects.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Practice &amp;amp; Testing
YouTube practice questions (I’ll keep adding links as I go)
Repetition of weak areas
Simulating exam-style thinking&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This is where things move from “I understand” to “I can pass.”&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Notes &amp;amp; Organization
Notion will be my command center&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;I’ll use it to:&lt;/p&gt;

&lt;p&gt;Track progress&lt;br&gt;
Write summarized notes&lt;br&gt;
Store key concepts and cheat sheets&lt;br&gt;
Collect useful links/resources&lt;br&gt;
🎯 Why This Approach?&lt;/p&gt;

&lt;p&gt;I’ve realized something important:&lt;br&gt;
I don’t lack ability — I’ve just lacked structure.&lt;/p&gt;

&lt;p&gt;I’ve explored AWS before, but in pieces. This plan is about pulling everything together into one clear, focused sprint.&lt;/p&gt;

&lt;p&gt;⏳ Timeline&lt;br&gt;
Start: April 9&lt;br&gt;
Exam Target: Last week of April&lt;br&gt;
Approach: Focused, daily consistency over perfection&lt;br&gt;
🤝 If You’re Also Studying…&lt;/p&gt;

&lt;p&gt;If you’re preparing for AWS CCP too:&lt;/p&gt;

&lt;p&gt;Feel free to use this plan&lt;br&gt;
Share resources&lt;br&gt;
Drop practice questions&lt;br&gt;
Let’s make this easier together&lt;/p&gt;

&lt;p&gt;I’ll keep updating this with useful links and insights as I go.&lt;/p&gt;

</description>
      <category>cloudcomputing</category>
      <category>aws</category>
      <category>cloud</category>
    </item>
    <item>
      <title>Cloud : My AWS CCP Study plan</title>
      <dc:creator>Paulet Wairagu</dc:creator>
      <pubDate>Thu, 09 Apr 2026 17:14:11 +0000</pubDate>
      <link>https://dev.to/pauletart/cloud-my-aws-ccp-study-plan-en2</link>
      <guid>https://dev.to/pauletart/cloud-my-aws-ccp-study-plan-en2</guid>
      <description>&lt;p&gt;I come from a technical background; data, SQL, automation and dashboards but cloud is one area I’ve only _tinkered _with so far. Nothing fully structured, nothing end-to-end.&lt;/p&gt;

&lt;p&gt;That changes now.&lt;/p&gt;

&lt;p&gt;It’s April 9th, and my goal is simple:&lt;br&gt;
👉 Sit for the AWS Certified Cloud Practitioner (CCP) exam in the last week of April.&lt;/p&gt;

&lt;p&gt;This isn’t just about passing an exam. It’s about finally bringing all my scattered cloud knowledge into one structured foundation I can actually build on.&lt;/p&gt;

&lt;p&gt;Even better — I’m doing this 100% with free resources, and if you’re on the same path, feel free to follow along or use this as your guide.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;📚 My Study Plan&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;1. Core Learning (Main Course)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Andrew Brown’s AWS CCP Course (FreeCodeCamp)&lt;/em&gt;&lt;br&gt;
This is the backbone.&lt;/p&gt;

&lt;p&gt;Anyone who knows Andrew knows he doesn’t miss — his cloud content is detailed, practical, and built for real understanding.&lt;br&gt;
The 14-hour course is honestly a gem, and I’ll be using it to cover all core concepts end-to-end.&lt;/p&gt;

&lt;p&gt;  &lt;iframe src="https://www.youtube.com/embed/NhDYbskXRgc"&gt;
  &lt;/iframe&gt;
&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Reinforcement (Interactive Learning)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;AWS Skill Builder&lt;/em&gt;&lt;br&gt;
This is where I’ll reinforce concepts in a lighter, more interactive way.&lt;/p&gt;

&lt;p&gt;I’ll also be using their official exam prep plan to stay aligned with what AWS actually expects.&lt;/p&gt;


&lt;div class="crayons-card c-embed text-styles text-styles--secondary"&gt;
    &lt;div class="c-embed__content"&gt;
      &lt;div class="c-embed__body flex items-center justify-between"&gt;
        &lt;a href="https://skillbuilder.aws/learning-plan/8UUCEZGNX4/exam-prep-plan-aws-certified-cloud-practitioner-clfc02--english/1J2VTQSGU2" rel="noopener noreferrer" class="c-link fw-bold flex items-center"&gt;
          &lt;span class="mr-2"&gt;skillbuilder.aws&lt;/span&gt;
          

        &lt;/a&gt;
      &lt;/div&gt;
    &lt;/div&gt;
&lt;/div&gt;


&lt;p&gt;I like that it breaks things down simply and even gamifies parts of the learning.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Practice &amp;amp; Testing&lt;/strong&gt;&lt;br&gt;
YouTube practice questions (I’ll keep adding links as I go)&lt;br&gt;
Repetition of weak areas&lt;br&gt;
Simulating exam-style thinking&lt;/p&gt;

&lt;p&gt;This is where things move from “I understand” to “I can pass.”&lt;/p&gt;

&lt;p&gt;&lt;em&gt;4. Notes &amp;amp; Organization&lt;/em&gt;&lt;br&gt;
Notion will be my command center&lt;/p&gt;

&lt;p&gt;I’ll use it to:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Track progress&lt;/li&gt;
&lt;li&gt;Write summarized notes&lt;/li&gt;
&lt;li&gt;Store key concepts and cheat sheets&lt;/li&gt;
&lt;li&gt;Collect useful links/resources&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;🎯 Why This Approach?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I’ve realized something important:&lt;br&gt;
I don’t lack ability, I’ve just lacked structure.&lt;/p&gt;

&lt;p&gt;I’ve explored AWS before, but in pieces. This plan is about pulling everything together into one clear, focused sprint.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;&lt;strong&gt;⏳ Timeline&lt;/strong&gt;&lt;br&gt;
Start: April 9&lt;br&gt;
Exam Target: Last week of April&lt;br&gt;
Approach: Focused, daily consistency over perfection&lt;br&gt;
🤝 If You’re Also Studying…&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;If you’re preparing for AWS CCP too:&lt;/p&gt;

&lt;p&gt;Feel free to use this plan&lt;br&gt;
Share resources&lt;br&gt;
Drop practice questions&lt;br&gt;
Let’s make this easier together&lt;/p&gt;

&lt;p&gt;I’ll keep updating this with useful links and insights as I go.&lt;/p&gt;

</description>
      <category>cloudcomputing</category>
      <category>aws</category>
      <category>cloud</category>
    </item>
    <item>
      <title>SQL - Basics</title>
      <dc:creator>Paulet Wairagu</dc:creator>
      <pubDate>Tue, 13 Jan 2026 07:34:23 +0000</pubDate>
      <link>https://dev.to/pauletart/sql-basics-1cpm</link>
      <guid>https://dev.to/pauletart/sql-basics-1cpm</guid>
      <description>&lt;p&gt;Data is everywhere and is often stored in different files and formats. As data grows, managing it using files alone becomes inefficient. This creates the need for a &lt;em&gt;database&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;A &lt;em&gt;database&lt;/em&gt; is an organized collection of data that allows information to be stored, retrieved, and managed efficiently. Databases are beneficial because they provide:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Large and scalable storage&lt;/li&gt;
&lt;li&gt;Secure and controlled access to data&lt;/li&gt;
&lt;li&gt;Data consistency and integrity&lt;/li&gt;
&lt;li&gt;Faster querying and data management&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;A Database Management System (DBMS) is a software system that allows users to create, manage, secure, and manipulate databases. Examples include &lt;em&gt;MySQL, PostgreSQL, Oracle, SQL Server, and SQLite&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Structured Query Language (SQL)&lt;/em&gt; is the standard language used to communicate with relational databases. SQL allows users to:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Create databases and tables&lt;/li&gt;
&lt;li&gt;Insert, update, and delete data&lt;/li&gt;
&lt;li&gt;Retrieve data using queries&lt;/li&gt;
&lt;li&gt;Control access and permissions&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Several types of databases exist, each designed for specific use cases:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Relational Database
Stores data in predefined tables made up of rows and columns.
Example: &lt;em&gt;MySQL, PostgreSQL&lt;/em&gt;
&lt;em&gt;Best for structured data and complex queries.&lt;/em&gt;
&lt;/li&gt;
&lt;li&gt;Key-Value Database
Stores data as key–value pairs.
Example: &lt;em&gt;Redis, DynamoDB&lt;/em&gt;
&lt;em&gt;Best for fast lookups and caching.&lt;/em&gt;
&lt;/li&gt;
&lt;li&gt;Column-Based (Column-Oriented) Database
Stores data grouped by columns rather than rows.
Example: &lt;em&gt;Apache Cassandra, Google BigQuery&lt;/em&gt;
&lt;em&gt;Best for analytics and large-scale data processing.&lt;/em&gt;
&lt;/li&gt;
&lt;li&gt;Graph Database
Stores data as nodes and relationships.
Example: &lt;em&gt;Neo4j&lt;/em&gt;
&lt;em&gt;Best for relationship-heavy data such as social networks.&lt;/em&gt;
&lt;/li&gt;
&lt;li&gt;Document Database
Stores data in document formats such as JSON or BSON.
Example:_ MongoDB_
&lt;em&gt;Best for semi-structured or flexible data.&lt;/em&gt;
&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Databases exist within a structure.&lt;br&gt;
A database server hosts and manages databases.&lt;br&gt;
Within a database, data is organized using a schema.&lt;br&gt;
A schema defines the structure of the database and contains:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Tables&lt;/li&gt;
&lt;li&gt;Views&lt;/li&gt;
&lt;li&gt;Indexes&lt;/li&gt;
&lt;li&gt;Stored procedures and other objects&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Tables consist of:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Rows (records): individual entries&lt;/li&gt;
&lt;li&gt;Columns (fields): attributes of the data&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Types of SQL Commands (With Examples)&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;DDL – Data Definition Language
Used to define and modify database structures.
&lt;code&gt;
CREATE TABLE students (
student_id INT PRIMARY KEY,
name VARCHAR(50),
age INT
);&lt;/code&gt;
&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;code&gt;ALTER TABLE students ADD email VARCHAR(100);&lt;/code&gt;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;DML – Data Manipulation Language
Used to insert, update, and delete data.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;code&gt;INSERT INTO students VALUES (1, 'Alice', 20);&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;code&gt;UPDATE students&lt;br&gt;
SET age = 21&lt;br&gt;
WHERE student_id = 1;&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;code&gt;DELETE FROM students&lt;br&gt;
WHERE student_id = 1;&lt;/code&gt;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;DQL – Data Query Language
Used to retrieve data from the database.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;code&gt;SELECT * FROM students;&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;code&gt;SELECT name, age&lt;br&gt;
FROM students&lt;br&gt;
WHERE age &amp;gt; 18;&lt;/code&gt;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;DCL – Data Control Language
Used to control access and permissions.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;code&gt;GRANT SELECT ON students TO user1;&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;code&gt;REVOKE SELECT ON students FROM user1;&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;It is also worth noting the Coding Order in SQL (Logical Query Processing Order).&lt;br&gt;
Although SQL is written in one order, it is &lt;em&gt;executed logically&lt;/em&gt; in the following sequence:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;FROM&lt;/li&gt;
&lt;li&gt;WHERE&lt;/li&gt;
&lt;li&gt;GROUP BY&lt;/li&gt;
&lt;li&gt;HAVING&lt;/li&gt;
&lt;li&gt;SELECT&lt;/li&gt;
&lt;li&gt;ORDER BY&lt;/li&gt;
&lt;li&gt;LIMIT&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;code&gt;SELECT department, COUNT(*)&lt;br&gt;
FROM employees&lt;br&gt;
WHERE salary &amp;gt; 50000&lt;br&gt;
GROUP BY department&lt;br&gt;
HAVING COUNT(*) &amp;gt; 3&lt;br&gt;
ORDER BY department;&lt;/code&gt;&lt;/p&gt;

</description>
      <category>programming</category>
      <category>sql</category>
      <category>database</category>
      <category>week1</category>
    </item>
    <item>
      <title>THINKING GAME DOCUMENTARY: MY REVIEW</title>
      <dc:creator>Paulet Wairagu</dc:creator>
      <pubDate>Wed, 26 Nov 2025 18:47:57 +0000</pubDate>
      <link>https://dev.to/pauletart/thinking-game-documentary-my-review-36c4</link>
      <guid>https://dev.to/pauletart/thinking-game-documentary-my-review-36c4</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F31ku2jttaxit6k2q4h7o.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F31ku2jttaxit6k2q4h7o.png" alt=" " width="800" height="201"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;So I finally watched AlphaGo, that documentary about the Google DeepMind AI that took on the world champion in Go, and honestly… I didn’t expect to enjoy it this much. I thought it would be one of those “tech bros hype themselves” films, but wueh, it’s actually deep.&lt;/p&gt;

&lt;p&gt;First things first,the game itself. Go is like chess on steroids. Watching those pros talk about it felt like watching athletes explaining how they breathe. The amount of strategy, intuition, and reading the board… it made me respect the game a lot more. And the way the documentary broke it down for normal people? Lovely. Even those of us who have never touched a Go board can follow the tension.&lt;/p&gt;

&lt;p&gt;Then there's Lee Sedol. Man, that guy carried the emotional weight of the whole thing. You feel the pressure on him — not just to win, but to defend human creativity. The guy literally said he felt like he was playing on behalf of everyone. That scene where he loses a game and walks out looking completely defeated? Your heart just sinks. Been there, that feeling of “I did everything and still lost.”&lt;/p&gt;

&lt;p&gt;And then AlphaGo. The AI itself almost feels like a character. Quiet, calculating, no hype, just vibes and probabilities. The wild part is when it makes those “impossible” moves that even the Go masters can’t understand. Move 37 especially — the commentators looked like they’d seen witchcraft. Even Sedol was like, “No human plays like that.” That’s the moment I realized AI isn’t just copying; sometimes it’s genuinely creating.&lt;/p&gt;

&lt;p&gt;But my favourite part is how the doc doesn’t frame it as “humans vs robots.” It shows how the match changed how humans think. After the loss, the pros started studying AlphaGo games and discovering new strategies. Like the AI unlocked creativity instead of killing it. That hit me because we’re in that same AI era now — people thinking AI is coming to take all jobs, yet here we are, learning new ways of thinking from it.&lt;/p&gt;

&lt;p&gt;Cinematography was also clean — the slow, quiet shots, the close-ups, the music. It’s not rushed or over-dramatic. Just calm, like meditation.&lt;/p&gt;

&lt;p&gt;If you like strategy, tech, psychology, or you just want to see a human fight for dignity against a machine, this is a solid watch. It’s not a hype documentary; it’s a thoughtful one. Emotion, tension, and a bit of “eish, surely, how is an algorithm beating a whole world champion?”&lt;/p&gt;

&lt;p&gt;Highly recommend.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>data</category>
      <category>datascience</category>
      <category>alphago</category>
    </item>
    <item>
      <title>The 5 Most Common Data Quality Issues (and How Analysts Can Fix Them)</title>
      <dc:creator>Paulet Wairagu</dc:creator>
      <pubDate>Mon, 24 Nov 2025 10:50:56 +0000</pubDate>
      <link>https://dev.to/pauletart/the-5-most-common-data-quality-issues-and-how-analysts-can-fix-them-3c7p</link>
      <guid>https://dev.to/pauletart/the-5-most-common-data-quality-issues-and-how-analysts-can-fix-them-3c7p</guid>
      <description>&lt;p&gt;Data analysts spend more time cleaning data than analyzing it. In fact, in most real-world projects, 60–80% of your time goes into preparing data for meaningful insights. &lt;br&gt;
Poor data quality leads to incorrect conclusions, broken dashboards, and bad decisions which is why understanding common issues and knowing how to fix them is a core skill for every analyst.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Here are the five most common data quality problems and practical steps to solve each one.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. Missing or Null Values&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Missing data can distort metrics, create gaps in reports, or lead to inaccurate ML models.&lt;/p&gt;

&lt;p&gt;Causes:&lt;br&gt;
• Manual data entry errors&lt;br&gt;
• Incomplete integrations&lt;br&gt;
• System migration issues&lt;/p&gt;

&lt;p&gt;How to fix it:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Identify missingness patterns using COUNT(*) in SQL or df.isna().sum() in Python.&lt;/li&gt;
&lt;li&gt;Drop rows only when missing data is irrelevant.&lt;/li&gt;
&lt;li&gt;Impute using averages, medians, or domain logic.&lt;/li&gt;
&lt;li&gt;Use Power Query’s “Replace Errors” or “Fill Down” functions for structured fixes.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;2. Inconsistent Formatting&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;You’ve probably seen this: “Kenya”, “kenya”, “K E N Y A”, or mismatched date formats in the same column.&lt;/p&gt;

&lt;p&gt;Why it happens:&lt;br&gt;
• Different data sources&lt;br&gt;
• Manual inputs&lt;br&gt;
• Lack of data validation rules&lt;/p&gt;

&lt;p&gt;How to fix it:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Apply standard casing (upper/lower/title).&lt;/li&gt;
&lt;li&gt;Convert all dates to a unified ISO format (YYYY-MM-DD).&lt;/li&gt;
&lt;li&gt;Use Excel Power Query’s “Transform → Format” options.&lt;/li&gt;
&lt;li&gt;In SQL, standardize with functions like UPPER(), TRIM(), or TO_DATE().&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;3. Duplicate Records&lt;/strong&gt;&lt;br&gt;
Duplicates inflate counts, break KPIs, and cause incorrect aggregations.&lt;/p&gt;

&lt;p&gt;Why it happens:&lt;br&gt;
• Multiple data entry points&lt;br&gt;
• Poor primary key definition&lt;br&gt;
• System sync issues&lt;/p&gt;

&lt;p&gt;How to fix it:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Identify duplicates using ROW_NUMBER() windows in SQL.&lt;/li&gt;
&lt;li&gt;Use Power Query’s “Remove Duplicates”.&lt;/li&gt;
&lt;li&gt;Implement unique IDs early in the pipeline.&lt;/li&gt;
&lt;li&gt;In Python, use df.drop_duplicates().&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;4. Outliers and Incorrect Values&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Some values are valid extreme cases; others are simply errors (like a customer aged 600).&lt;/p&gt;

&lt;p&gt;Why it happens:&lt;br&gt;
• Typographical errors&lt;br&gt;
• Faulty sensors or scraping issues&lt;br&gt;
• Incorrect units (meters vs. feet)&lt;/p&gt;

&lt;p&gt;How to fix it:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Visualize distributions using box plots or histograms.&lt;/li&gt;
&lt;li&gt;Apply domain thresholds or rule-based logic.&lt;/li&gt;
&lt;li&gt;Use interquartile ranges or z-scores for statistical outlier detection.&lt;/li&gt;
&lt;li&gt;Create automated validations in Power BI or SQL.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;5. Mixed Granularity&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Data at different levels combined into one column or table — e.g., weekly and monthly data in the same dataset.&lt;/p&gt;

&lt;p&gt;Why it happens:&lt;br&gt;
• Data integration from multiple systems&lt;br&gt;
• Poorly designed source tables&lt;/p&gt;

&lt;p&gt;How to fix it:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Split datasets by granularity before analysis.&lt;/li&gt;
&lt;li&gt;Create dimensional tables for dates, products, etc.&lt;/li&gt;
&lt;li&gt;Aggregate or disaggregate consistently before joining.&lt;/li&gt;
&lt;li&gt;Use a proper star schema when possible.&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>etl</category>
      <category>data</category>
      <category>datascience</category>
    </item>
  </channel>
</rss>
