Convert A Pandas Dataframe With String of Dict To Columns

#python #datascience #data #pandas

Let's take a look at how we can convert a string column where the data is in a dictionary format to pandas dataframe columns.

Read In Data

We're going to read in a csv and display our column with data. Pandas will display this column as an object, and we can't access the values in the dictionary as the row element is actually a string.

Using AST To Evaluate Strings

Let's use the AST library to transform our string into a python literal, defined here. This allows us to transform our values into dictionaries. We'll use the pandas .apply() function to apply this function to each element in the column.

Normalize Into New Columns

Using the pandas function json_normalize, we can convert our dictionary values into columns in a new dataframe, which we will merge into the orginal later. You don't have to set the json_normalize output to a new dataframe, I just like how it comes out.

We now have a new dataframe with the columns being the key and value pairs from our dictionary.

Combining Data

Once you have your new columns, you can either set them back to columns in the original dataframe or merge them into a larger dataframe using pd.merge().

DEV Community

Convert A Pandas Dataframe With String of Dict To Columns

Let's take a look at how we can convert a string column where the data is in a dictionary format to pandas dataframe columns.

Read In Data

Using AST To Evaluate Strings

Normalize Into New Columns

Combining Data

Love Pandas & Data Structures?

Read some more of our pandas guides to be a better data scientist.

Top comments (0)

Read next

Building a Local AI Code Reviewer with ClientAI and Ollama

Introducing uv: Next-Gen Python Package Manager

Design Patterns: Your Secret Weapon in Software Engineering

Building SaaS Faster with Ercas for SaaS: A Template for Indie Hackers