Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Automating Serverless Data Ingestion: How to Connect External APIs to BigQuery using Python and Cloud Functions
Amine Laatfa
Amine Laatfa
Amine Laatfa
Follow
Jan 1
Automating Serverless Data Ingestion: How to Connect External APIs to BigQuery using Python and Cloud Functions
#
googlecloud
#
bigquery
#
marketing
#
dataengineering
Comments
Add Comment
12 min read
Why Data SLAs Fail — and How to Enforce Them with a Unified Reliability Framework
Baharath Bathula
Baharath Bathula
Baharath Bathula
Follow
Jan 1
Why Data SLAs Fail — and How to Enforce Them with a Unified Reliability Framework
#
dataengineering
#
aws
#
machinelearning
#
analytics
Comments
Add Comment
2 min read
Building Streaming Iceberg Tables for Real-Time Logistics Analytics
Eliana Lam
Eliana Lam
Eliana Lam
Follow
Nov 29 '25
Building Streaming Iceberg Tables for Real-Time Logistics Analytics
#
analytics
#
dataengineering
#
architecture
#
opensource
Comments
Add Comment
4 min read
The Great Table Format Debate: A Deep Dive into Apache Iceberg, Delta Lake, and Apache Hudi
Data Tech Bridge
Data Tech Bridge
Data Tech Bridge
Follow
Dec 31 '25
The Great Table Format Debate: A Deep Dive into Apache Iceberg, Delta Lake, and Apache Hudi
#
database
#
dataengineering
#
iceberg
#
apachehudi
1
 reaction
Comments
Add Comment
18 min read
Amazon Kinesis vs Amazon MSK: The Complete Guide for Stream Processing on AWS
Data Tech Bridge
Data Tech Bridge
Data Tech Bridge
Follow
Dec 30 '25
Amazon Kinesis vs Amazon MSK: The Complete Guide for Stream Processing on AWS
#
architecture
#
aws
#
dataengineering
Comments
Add Comment
29 min read
Day 8: Accelerating Spark Joins - Broadcast, Shuffle Optimization & Skew Handling
Sandeep
Sandeep
Sandeep
Follow
Dec 9 '25
Day 8: Accelerating Spark Joins - Broadcast, Shuffle Optimization & Skew Handling
#
dataengineering
#
python
#
spark
#
bigdata
Comments
Add Comment
2 min read
Mastering Serverless Data Pipelines: AWS Step Functions Best Practices for 2026
Jubin Soni
Jubin Soni
Jubin Soni
Follow
Dec 30 '25
Mastering Serverless Data Pipelines: AWS Step Functions Best Practices for 2026
#
aws
#
serverless
#
stepfunctions
#
dataengineering
Comments
Add Comment
5 min read
2025 Year in Review: Apache Iceberg, Polaris, Parquet, and Arrow
Alex Merced
Alex Merced
Alex Merced
Follow
Dec 29 '25
2025 Year in Review: Apache Iceberg, Polaris, Parquet, and Arrow
#
architecture
#
bigdata
#
opensource
#
dataengineering
Comments
Add Comment
6 min read
A Stranger In a New Town: CsvPath metadata fields
David Kershaw
David Kershaw
David Kershaw
Follow
Nov 25 '25
A Stranger In a New Town: CsvPath metadata fields
#
metadata
#
dataengineering
#
csv
#
datascience
Comments
Add Comment
6 min read
Unified Data Fabric: Serverless Spark on ROSA Integrating with AWS Glue Catalog
Marco Gonzalez
Marco Gonzalez
Marco Gonzalez
Follow
Dec 29 '25
Unified Data Fabric: Serverless Spark on ROSA Integrating with AWS Glue Catalog
#
serverless
#
kubernetes
#
aws
#
dataengineering
8
 reactions
Comments
1
 comment
39 min read
Interesting links - November 2025
Robin Moffatt
Robin Moffatt
Robin Moffatt
Follow
Dec 17 '25
Interesting links - November 2025
#
data
#
dataengineering
#
kafka
#
flink
Comments
Add Comment
19 min read
đź’€ RIP Copy-Paste: Google NotebookLM Just Killed Manual Data Entry
Siddhesh Surve
Siddhesh Surve
Siddhesh Surve
Follow
Dec 29 '25
đź’€ RIP Copy-Paste: Google NotebookLM Just Killed Manual Data Entry
#
ai
#
productivity
#
google
#
dataengineering
Comments
Add Comment
3 min read
dupl
Query Filter
Query Filter
Query Filter
Follow
Nov 25 '25
dupl
#
sql
#
dataengineering
#
backend
#
database
Comments
Add Comment
1 min read
Apache Dev List Digest: Iceberg, Polaris, Arrow & Parquet (Nov 18–24, 2025)
Alex Merced
Alex Merced
Alex Merced
Follow
Nov 24 '25
Apache Dev List Digest: Iceberg, Polaris, Arrow & Parquet (Nov 18–24, 2025)
#
data
#
dataengineering
#
opensource
#
resources
Comments
Add Comment
5 min read
Building a Realistic Banking Dummy Data Generator with Bad-Data Simulation
Benjamin Ibrulj
Benjamin Ibrulj
Benjamin Ibrulj
Follow
Dec 29 '25
Building a Realistic Banking Dummy Data Generator with Bad-Data Simulation
#
dataengineering
#
python
#
opensource
#
sql
1
 reaction
Comments
Add Comment
1 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account