DEV Community

Codes With Pankaj
Codes With Pankaj

Posted on

Reading CSV Files into Python with Pandas

Introduction

Python is a versatile programming language, and Pandas is a powerful data manipulation and analysis library that makes working with data a breeze. One common task in data analysis is reading data from a CSV (Comma-Separated Values) file. In this tutorial, we'll walk you through how to read a CSV file into Python using Pandas, along with a practical example.

Prerequisites

Before we get started, ensure you have Pandas installed. If you don't have it, you can install it using pip:

pip install pandas
Enter fullscreen mode Exit fullscreen mode

Reading a CSV File

Pandas provides a read_csv() function that makes reading CSV files a straightforward process. Here's a step-by-step guide on how to use it:

Step 1: Import the Pandas Library

Start by importing the Pandas library:

import pandas as pd
Enter fullscreen mode Exit fullscreen mode

Step 2: Load the CSV File

Use the read_csv() function to load your CSV file into a Pandas DataFrame. You need to provide the file path as an argument:

df = pd.read_csv('p4n.csv')
Enter fullscreen mode Exit fullscreen mode

Make sure to replace 'p4n.csv' with the actual path to your CSV file.

Step 3: Explore Your Data

Now that you've loaded the CSV file into a DataFrame, you can explore and manipulate the data. Here are a few common operations:

  • df.head(): View the first few rows of the DataFrame.
  • df.info(): Get information about the DataFrame, including data types.
  • df.describe(): Generate summary statistics for numerical columns.

Step 4: Access Data

You can access specific columns and rows in the DataFrame using Pandas' indexing and slicing methods. For example:

# Access a specific column
column_data = df['column_name']

# Access a specific row
row_data = df.loc[row_index]
Enter fullscreen mode Exit fullscreen mode

Example: Reading a CSV File

Let's put this into action with an example. Suppose we have a CSV file named 'sales_data.csv' containing sales data with columns 'Date', 'Product', 'Sales', and 'Profit'. Here's how we can read and explore this data:

import pandas as pd

# Load the CSV file
df = pd.read_csv('sales_data.csv')

# View the first 5 rows
print(df.head())

# Get basic info about the DataFrame
print(df.info())

# Summary statistics for numerical columns
print(df.describe())
Enter fullscreen mode Exit fullscreen mode

Sentry image

See why 4M developers consider Sentry, “not bad.”

Fixing code doesn’t have to be the worst part of your day. Learn how Sentry can help.

Learn more

Top comments (0)

Image of Docusign

🛠️ Bring your solution into Docusign. Reach over 1.6M customers.

Docusign is now extensible. Overcome challenges with disconnected products and inaccessible data by bringing your solutions into Docusign and publishing to 1.6M customers in the App Center.

Learn more