DEV Community

Alex Spinov
Alex Spinov

Posted on

DuckDB Has a Free Analytical Database — Run SQL on CSV, Parquet, and JSON Without a Server

Loading a 5GB CSV into PostgreSQL to run one GROUP BY query is absurd. DuckDB runs analytical SQL directly on files. No server. No import.

What You Get Free

MIT licensed:

  • Zero-server — embedded, no daemon
  • Read files directly — CSV, Parquet, JSON, Excel
  • Columnar storage — optimized for analytics
  • SQL standard — PostgreSQL-compatible
  • Fast — vectorized, parallel processing
  • In-process — Python, Node.js, Java, R
  • Extensions — HTTP, PostgreSQL scanner, S3

Quick Start

import duckdb

result = duckdb.sql("""
    SELECT category, COUNT(*) as count, AVG(price) as avg_price
    FROM 'sales.csv'
    GROUP BY category
    ORDER BY count DESC
""").df()

# Remote Parquet
duckdb.sql("SELECT * FROM 'https://example.com/data.parquet' LIMIT 10")
Enter fullscreen mode Exit fullscreen mode

What You Can Build

1. Data analysis — explore files without loading into a database.
2. ETL — transform data between formats with SQL.
3. Embedded analytics — add queries to any app.
4. Log analysis — query JSON logs with SQL.
5. Data lake — read S3 Parquet directly.


Need data pipeline help? Email spinov001@gmail.com

More free tiers: 76+ Free APIs Every Developer Should Bookmark

Top comments (0)