Data cleaning is one of the most crucial steps in any data science or analytics project. In this challenge, I worked on a real-world dataset from Kaggle with over 100,000 rows, performing various Pandas operations to clean, preprocess, and prepare it for further analysis.
๐ Dataset Details
For this challenge, I selected the E-commerce Sales Dataset from Kaggle containing around 120,000 rows and 12 columns.
It includes data such as:
๐งพ Order ID
๐ค Customer Name
๐ Product & Quantity
๐ฐ Sales & Discount
๐ Region
๐
Order Date
Before Cleaning:
Rows โ 120,000
Columns โ 12
File format โ .csv
โ๏ธ Tools & Environment
Python 3
Google Colab
Libraries: Pandas, NumPy, Matplotlib

Top comments (0)