DEV Community

Tankala Ashok
Tankala Ashok

Posted on

1

My First Billion (of Rows) in DuckDB | By João Pedro

When you want to process 450Gb/1billion rows of data we think in all the directions like PySpark, Bigquery and etc. If someone says it can be processed with one Python package(DuckDB) without using/installing any fancy tools can you believe it? That’s what João Pedro did and explained in this article.

My First Billion (of Rows) in DuckDB | by João Pedro | May, 2024 | Towards Data Science

First Impressions of DuckDB handling 450Gb in a real project

favicon towardsdatascience.com

Top comments (0)

AWS Q Developer image

Your AI Code Assistant

Generate and update README files, create data-flow diagrams, and keep your project fully documented. Built to handle large projects, Amazon Q Developer works alongside you from idea to production code.

Get started free in your IDE

👋 Kindness is contagious

Please leave a ❤️ or a friendly comment on this post if you found it helpful!

Okay