Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Building a Production-Ready Serverless App on Google Cloud (Part 1: Architecture)
Patricio Navarro
Patricio Navarro
Patricio Navarro
Follow
for
Google Developer Experts
Mar 31
Building a Production-Ready Serverless App on Google Cloud (Part 1: Architecture)
#
serverless
#
dataengineering
#
cloud
#
tutorial
14
 reactions
Comments
Add Comment
5 min read
Data Pipeline Architecture: From Messy CSVs to Clean Database
Max Klein
Max Klein
Max Klein
Follow
Mar 2
Data Pipeline Architecture: From Messy CSVs to Clean Database
#
python
#
dataengineering
#
tutorial
#
database
Comments
Add Comment
5 min read
Lightweight ETL on AWS Lambda Using DuckDB and Snowflake Connector
Aki
Aki
Aki
Follow
for
AWS Community Builders
Apr 4
Lightweight ETL on AWS Lambda Using DuckDB and Snowflake Connector
#
aws
#
snowflake
#
dataengineering
6
 reactions
Comments
Add Comment
6 min read
Epistemic Control Systems: Anchoring on Kafka
Abubakar
Abubakar
Abubakar
Follow
Apr 4
Epistemic Control Systems: Anchoring on Kafka
#
architecture
#
dataengineering
#
distributedsystems
#
systemdesign
Comments
Add Comment
4 min read
Building an Incremental Zoho Desk to BigQuery Pipeline: Lessons from the Trenches
Blessing Angus
Blessing Angus
Blessing Angus
Follow
Feb 28
Building an Incremental Zoho Desk to BigQuery Pipeline: Lessons from the Trenches
#
dataengineering
#
zohodeskapi
#
bigquery
#
elt
1
 reaction
Comments
Add Comment
7 min read
Stop Manually Entering Medical Data: How to Automate PDF Lab Reports with LayoutParser & OCR
Beck_Moulton
Beck_Moulton
Beck_Moulton
Follow
Mar 1
Stop Manually Entering Medical Data: How to Automate PDF Lab Reports with LayoutParser & OCR
#
machinelearning
#
python
#
dataengineering
#
healthtech
1
 reaction
Comments
Add Comment
3 min read
Shopify Automation: How I Managed an 80,000-Product Catalog with Python & Pandas
Niccolò Colombini
Niccolò Colombini
Niccolò Colombini
Follow
Apr 3
Shopify Automation: How I Managed an 80,000-Product Catalog with Python & Pandas
#
automation
#
dataengineering
#
productivity
#
python
Comments
Add Comment
3 min read
Synthetic Data and the Privacy Problem: Beyond Alice and Bob
Aaron Wiegel
Aaron Wiegel
Aaron Wiegel
Follow
Mar 4
Synthetic Data and the Privacy Problem: Beyond Alice and Bob
#
dataengineering
#
testing
1
 reaction
Comments
Add Comment
10 min read
how i use cursor and ai agents to write dbt tests and documentation
Philip Hern
Philip Hern
Philip Hern
Follow
Apr 3
how i use cursor and ai agents to write dbt tests and documentation
#
dbt
#
cursor
#
ai
#
dataengineering
1
 reaction
Comments
Add Comment
2 min read
dbt + OpenLineage #1: Why dbt-ol Is a Post-Processor (Not a Plugin) — and Why It Matters
Byron Hsieh
Byron Hsieh
Byron Hsieh
Follow
Mar 4
dbt + OpenLineage #1: Why dbt-ol Is a Post-Processor (Not a Plugin) — and Why It Matters
#
dbt
#
openlineage
#
dataengineering
#
python
Comments
Add Comment
7 min read
PardoX 0.3.1: The GPU Awakening and the Conquest of the Universal Backend
Alberto Cardenas
Alberto Cardenas
Alberto Cardenas
Follow
Mar 1
PardoX 0.3.1: The GPU Awakening and the Conquest of the Universal Backend
#
showdev
#
backend
#
dataengineering
#
performance
1
 reaction
Comments
Add Comment
19 min read
Feed Rescue: Converting Raw Ulta Scrapes into Google Merchant Center XML
Erika S. Adkins
Erika S. Adkins
Erika S. Adkins
Follow
Feb 28
Feed Rescue: Converting Raw Ulta Scrapes into Google Merchant Center XML
#
webscraping
#
python
#
node
#
dataengineering
1
 reaction
Comments
Add Comment
5 min read
the future of data engineering workflows with ai
Philip Hern
Philip Hern
Philip Hern
Follow
Apr 3
the future of data engineering workflows with ai
#
dataengineering
#
ai
#
workflow
#
future
1
 reaction
Comments
Add Comment
2 min read
100 Spark Scenario Based Interview Questions and Answers
Hannah Usmedynska
Hannah Usmedynska
Hannah Usmedynska
Follow
Apr 3
100 Spark Scenario Based Interview Questions and Answers
#
career
#
data
#
dataengineering
#
interview
1
 reaction
Comments
1
 comment
24 min read
ETL Pipeline: The 6-Phase Pattern That Cuts Debugging From Hours to Minutes
Kunwar Jhamat
Kunwar Jhamat
Kunwar Jhamat
Follow
Mar 4
ETL Pipeline: The 6-Phase Pattern That Cuts Debugging From Hours to Minutes
#
etl
#
dataengineering
#
programming
#
architecture
1
 reaction
Comments
Add Comment
5 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account