Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
How I forced Python standard libraries to process and serialize production server crashes into Parquet locally
Syed Amman qadir bukhari
Syed Amman qadir bukhari
Syed Amman qadir bukhari
Follow
Jun 8
How I forced Python standard libraries to process and serialize production server crashes into Parquet locally
#
python
#
dataengineering
#
devops
#
opensource
Comments
Add Comment
1 min read
From Clean CSVs to Production‑Shaped Data: A Practical Guide for Academic ML and Data Engineering
Jitendra Devabhaktuni
Jitendra Devabhaktuni
Jitendra Devabhaktuni
Follow
Jun 8
From Clean CSVs to Production‑Shaped Data: A Practical Guide for Academic ML and Data Engineering
#
ai
#
database
#
datascience
#
dataengineering
Comments
Add Comment
5 min read
I Built a Write-Ahead Log in Pure Python and Finally Understood How Databases Survive Crashes
Haji Rufai
Haji Rufai
Haji Rufai
Follow
Jun 8
I Built a Write-Ahead Log in Pure Python and Finally Understood How Databases Survive Crashes
#
python
#
database
#
dataengineering
#
programming
Comments
Add Comment
7 min read
Architecture Over Alerts: How We Cut BigQuery Costs by 57%($12M) for a Fortune 500
Pratik Dhanave
Pratik Dhanave
Pratik Dhanave
Follow
Jun 8
Architecture Over Alerts: How We Cut BigQuery Costs by 57%($12M) for a Fortune 500
#
bigquery
#
finops
#
cloudarchitecture
#
dataengineering
Comments
Add Comment
4 min read
I Built a Columnar File Format in Pure Python — a tiny, readable Parquet
Haji Rufai
Haji Rufai
Haji Rufai
Follow
Jun 8
I Built a Columnar File Format in Pure Python — a tiny, readable Parquet
#
python
#
dataengineering
#
database
#
programming
Comments
Add Comment
6 min read
Running Apache Airflow + Docker for Free Using GitHub Codespaces
Tanmay
Tanmay
Tanmay
Follow
Jun 8
Running Apache Airflow + Docker for Free Using GitHub Codespaces
#
dataengineering
#
docker
#
python
#
apacheairflow
Comments
Add Comment
1 min read
Why Big Tech is Migrating from Traditional Databases to NewSQL
Lê Đình Phú
Lê Đình Phú
Lê Đình Phú
Follow
Jun 8
Why Big Tech is Migrating from Traditional Databases to NewSQL
#
bigdata
#
dataengineering
#
database
#
sql
1
reaction
Comments
Add Comment
1 min read
Organizing How to Use AWS Lake Formation
Aki
Aki
Aki
Follow
for
AWS Community Builders
Jun 8
Organizing How to Use AWS Lake Formation
#
aws
#
dataengineering
3
reactions
Comments
Add Comment
9 min read
ETL Pipeline: Fetching Real-Time News Data with Python and Postgres
Gathuru_M
Gathuru_M
Gathuru_M
Follow
Jun 7
ETL Pipeline: Fetching Real-Time News Data with Python and Postgres
#
dataengineering
#
api
#
etl
#
beginners
1
reaction
Comments
Add Comment
4 min read
I built a mini columnar storage engine in pure Python — and finally understood why Parquet is so fast
Haji Rufai
Haji Rufai
Haji Rufai
Follow
Jun 7
I built a mini columnar storage engine in pure Python — and finally understood why Parquet is so fast
#
dataengineering
#
python
#
database
#
performance
Comments
Add Comment
6 min read
One command from your laptop to Kubernetes — no CI pipeline
Alisson Rosa
Alisson Rosa
Alisson Rosa
Follow
Jun 7
One command from your laptop to Kubernetes — no CI pipeline
#
airflow
#
go
#
kubernetes
#
dataengineering
Comments
Add Comment
7 min read
Why I Don’t Let the LLM Decide Issue State
Ahmad Humayun
Ahmad Humayun
Ahmad Humayun
Follow
Jun 6
Why I Don’t Let the LLM Decide Issue State
#
ai
#
python
#
dataengineering
#
marketinganalytics
Comments
Add Comment
5 min read
Your Scraper Collected 50 Rows. There Were 4,000.
Alex Spinov
Alex Spinov
Alex Spinov
Follow
Jun 6
Your Scraper Collected 50 Rows. There Were 4,000.
#
webscraping
#
python
#
dataengineering
#
pagination
Comments
Add Comment
7 min read
Deeper into Dataform 3: Auditing Dataform
Ben Watson
Ben Watson
Ben Watson
Follow
Jun 6
Deeper into Dataform 3: Auditing Dataform
#
dataform
#
dataengineering
#
gcp
#
bigquery
Comments
Add Comment
2 min read
How I Broke Down My ETL Pipeline Project Into Smaller Engineering Exercises
Tanmay
Tanmay
Tanmay
Follow
Jun 6
How I Broke Down My ETL Pipeline Project Into Smaller Engineering Exercises
#
dataengineering
#
database
#
etl
#
sql
Comments
Add Comment
2 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account