Lawrence Murithi

Posted on Feb 3

Schemas and Data Modelling in Power BI

#dataengineering #schema #datamodelling

Introduction

Power BI is a business intelligence tool used to turn raw data into meaningful reports and dashboards. It allows organizations to analyze data, track performance, and make informed decisions. However, the quality of insights produced by Power BI depends heavily on how the data is structured behind the scenes. Good visuals and advanced calculations cannot compensate for poorly designed data.

What are Schemas and Data Modelling

Data modelling is the process of organizing data into tables and defining how those tables relate to each other. A schema is the structure or design of this data model.
Schemas and data modelling define how data is organized, connected, and interpreted in Power BI. They determine how tables relate to each other, how filters flow across the model, and how calculations are performed.

Characteristics of a data model.

A good data model:

Makes reports faster and more responsive
Produces correct totals and calculations
Is easy to understand and maintain
Works naturally with DAX measures and visuals.

A bad data model:

Slows down reports
Produces wrong or inconsistent numbers
Forces complex and hard-to-read DAX formulas
Confuses report users and developers

This article explains schemas and data modelling in Power BI, focusing on:

Star schema
Snowflake schema
Fact and dimension tables
Relationships
Importance of good modelling for performance and accurate reporting

Understanding Data Modelling in Power BI

Data modelling happens after data is loaded from sources such as Excel, databases, or cloud systems. The model is built in the Model view, where tables and relationships are defined.
A Power BI data model usually includes:

Fact tables (business events or measurements)
Dimension tables (descriptive information)
Relationships between tables
A schema design (such as star or snowflake)

Fact Tables and Dimension Tables

Fact tables store measurable business data (i.e) numerical data that you want to analyze and measure.
Characteristics of fact tables:

Very large, with many rows
Contain numeric values used in calculations
Contain keys that link to dimension tables
Grow over time as new transactions are added.

Examples of fact data:

Sales amount
Quantity sold
Profit.

Dimension tables store descriptive information that helps explain the facts. Dimension tables are used for filtering, grouping, and slicing data in reports.

Characteristics of dimension tables:

Smaller than fact tables
Mostly text and categorical data
Used for filtering, grouping, and slicing
Change less frequently than facts.

Examples of dimension data:

Product name
Customer name
Region

Relationships in Power BI

Relationships in Power BI define how tables are connected and how data flows between them. A relationship is usually created between a key column in one table and a matching column in another table. These keys allow Power BI to link descriptive data from dimension tables to numerical data in fact tables. Relationships are mainly defined by cardinality, direction, and status.

Types of Relationships

One-to-Many - This is the most common and recommended relationship where one record in a dimension table matches many records in a fact table.
One-to-One - One row in one table matches one row in another table.
Many-to-One - Many rows in the fact table match one row in the dimension table.
Many-to-Many - multiple rows in one table match multiple rows in another table and is often used when there is no unique key.

Why Relationships Matter

Good relationships:

Ensure correct totals and aggregations
Control how slicers and filters behave
Improve report performance
Make DAX measures simpler and easier to maintain

On the contrary, poorly defined relationships often result in:

Wrong numbers
Missing data in visuals
Confusing filter behavior
Slow reports

Best Practices for Relationships in Power BI

Use one-to-many relationships wherever possible
Connect dimension tables to fact tables, not dimension to dimension
Use numeric surrogate keys instead of text
Avoid unnecessary many-to-many relationships
Use single-direction filtering by default
Keep the model simple and clear

Star Schema

A star schema is the recommended data model in Power BI and consists of:

One central fact table
Multiple dimension tables connected directly to the fact table. A star schema structure looks like a star, with the fact table in the center and dimension tables branching out around it.

Example:

        Date
         |
Product — Sales — Customer
         |
       Region

Benefits of Star Schema

Simple and easy to understand
Faster query performance
Fewer relationships
Easier DAX calculations
Better compatibility with Power BI’s engine.

Power BI can process queries more efficiently because dimension tables are not connected to each other.

Snowflake Schema

A snowflake schema is a more complex version of the star schema. In this structure:

Dimension tables are normalized
Dimension tables are connected to other dimension tables.

Example:

Sales → Product → Category → Department

Benefits of Snowflake Schema

Reduces data redundancy
Useful for very large or complex dimensions

Challenges of Snowflake Schema:

More complex relationships
Slower performance due to extra joins
More complex DAX measures
Harder for users to understand

Why Good Data Modelling is Critical

Performance - Power BI uses an in-memory engine. A clean star schema reduces joins and improves query speed. Poor models can cause reports to load slowly or even fail.
Accurate Reporting - Correct relationships and proper table design ensure that filters and totals behave correctly. Bad modelling often leads to duplicated values or missing data.
Simpler DAX - DAX formulas are easier to write and maintain when the model is simple. Complex schemas often require complicated formulas, which are harder to debug.
Better User Experience - Business users prefer models that are easy to understand. Clear table names, logical relationships, and simple structures help users create reports without confusion.

Common Modelling Mistakes

Using many-to-many relationships unnecessarily
Mixing transactional and lookup data in one table
Using bi-directional relationships everywhere
Not creating a proper date dimension
Loading unnecessary columns

NB: Avoiding these mistakes improves both performance and reliability.

Conclusion

Good data modelling is the foundation of effective Power BI reporting. Visuals and calculations only work well when the underlying model is designed correctly. A clean star schema with clear fact and dimension tables leads to faster performance, accurate results, and reports that are easy to build and maintain.
Good data modelling improves performance, ensures accurate reporting, simplifies DAX, and makes reports easier to use. Without proper modelling, even the best visuals cannot deliver correct insights.

DEV Community

Schemas and Data Modelling in Power BI

Introduction

What are Schemas and Data Modelling

Characteristics of a data model.

Understanding Data Modelling in Power BI

Fact Tables and Dimension Tables

Relationships in Power BI

Types of Relationships

Why Relationships Matter

Best Practices for Relationships in Power BI

Star Schema

Benefits of Star Schema

Snowflake Schema

Benefits of Snowflake Schema

Challenges of Snowflake Schema:

Why Good Data Modelling is Critical

Common Modelling Mistakes

Conclusion

Top comments (0)