DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Is AWS Glue Data Catalog Sufficient as a Data Catalog? Organizing Its Design, Limitations, and Complementary Strategies

Is AWS Glue Data Catalog Sufficient as a Data Catalog? Organizing Its Design, Limitations, and Complementary Strategies

7
Comments
10 min read
🤖 Feature Pipeline — Where Your Raw Data Becomes AI Fuel🤖

🤖 Feature Pipeline — Where Your Raw Data Becomes AI Fuel🤖

13
Comments
2 min read
The Vinted Arbitrage War: Building a Scraper That Doesn't Get IP-Banned

The Vinted Arbitrage War: Building a Scraper That Doesn't Get IP-Banned

Comments 1
9 min read
Building a Real-Time Data Pipeline: Streaming TCP Socket Data to PostgreSQL with Node.js

Building a Real-Time Data Pipeline: Streaming TCP Socket Data to PostgreSQL with Node.js

Comments
3 min read
I built pq - the jq of Parquet. Here's why data engineers need a better CLI

I built pq - the jq of Parquet. Here's why data engineers need a better CLI

2
Comments
1 min read
The Ultimate Databricks Data Engineer Associate Exam Guide for AWS Engineers

The Ultimate Databricks Data Engineer Associate Exam Guide for AWS Engineers

1
Comments
45 min read
PECOS Data Extraction Pipeline - DevOps Documentation

PECOS Data Extraction Pipeline - DevOps Documentation

3
Comments
7 min read
SnowPro Core Roadmap

SnowPro Core Roadmap

2
Comments 1
13 min read
Feeding the Black Box: Engineering a Data Pipeline for Meta's Deep Learning Algorithms

Feeding the Black Box: Engineering a Data Pipeline for Meta's Deep Learning Algorithms

3
Comments
3 min read
The Ultimate Guide to Databricks Data Engineer Associate Exam: Everything You Need to Know

The Ultimate Guide to Databricks Data Engineer Associate Exam: Everything You Need to Know

1
Comments
40 min read
COALESCE in SQL: the “Don’t Let NULL Win” Function You’ll Use Everywhere

COALESCE in SQL: the “Don’t Let NULL Win” Function You’ll Use Everywhere

Comments
2 min read
✅ Benefits of the FTI Architecture — The Cleanest Way to Build Production ML Systems✅

✅ Benefits of the FTI Architecture — The Cleanest Way to Build Production ML Systems✅

37
Comments 5
2 min read
I built an "Agentic" SQL Generator because LLMs are bad at syntax.

I built an "Agentic" SQL Generator because LLMs are bad at syntax.

Comments
3 min read
Architecting and Operating Geospatial Workflows with Dagster

Architecting and Operating Geospatial Workflows with Dagster

Comments
8 min read
Databricks Starter Kit

Databricks Starter Kit

1
Comments
7 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.