DEV Community

Cover image for How Excel is used in real-world data analysis: A reflection from week 1
Faith Mwai
Faith Mwai

Posted on

How Excel is used in real-world data analysis: A reflection from week 1

As part of my first week in the Data Science & Analytics course, I have been learning how Excel is used in real-world data analysis. Before this week, I mainly viewed Excel as a tool for storing data and performing simple calculations. However, after working with a human resources (HR) dataset containing employee records, I now understand that Excel is a powerful tool for data cleaning, analysis, and decision making.

Excel is a spreadsheet application that enables users to organize, clean, analyze, and visualize data. It is widely used by organizations because it provides practical tools for transforming raw data into meaningful insights.

One of the most important lessons I learned is that data cleaning is a critical part of data analysis. In the HR dataset we worked with, there were 877 employee records, but the data contained several quality issues. For example, there were missing values in fields such as salary, gender etc. There were also duplicate records which could lead to inaccurate analysis if not removed.

Another challenge in the dataset is inconsistent data entry. In the gender column, entries included values such as Female, female, Femle, Male, male, F and M. Using Excel's filtering and sorting tools, an analyst can quickly identify these inconsistencies and standardize them to improve data quality.

One Excel feature that I found particularly useful is conditional formatting. This feature allows users to highlight specific values automatically. For example, it can be used to identify missing data, highlight employees with low performance scores, or flag duplicates. This makes it easier to spot patterns and potential issues within datasets.

I also learned several Excel functions that support data analysis. The COUNTBLANK() function can be used to count missing values in a dataset, helping analysts assess data completeness. The Aggregate functions such as SUMIFS() function allows users to calculate totals based on multiple conditions such as determining the total salary expenditure for employees in a specific department.

Learning Excel has changed the way I see data. I now understand that meaningful analysis begins with clean and accurate data. Excel provides practical tools for identifying errors, removing duplicates, handling missing values, and extracting insights from data. This first week has shown me that Excel is not just a spreadsheet application; it is an essential tool for solving real world business problems and supporting data-driven decisions.

Top comments (0)