Skip to content
Navigation menu
Search
Powered by
Search
Algolia
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Optimizing Large-Scale Data Processing in Python: A Guide to Parallelizing CSV Operations
pawan deore
pawan deore
pawan deore
Follow
Dec 1 '24
Optimizing Large-Scale Data Processing in Python: A Guide to Parallelizing CSV Operations
#
webdev
#
python
#
csv
#
dataengineering
1
reaction
Comments
Add Comment
3 min read
Jupyter Notebooks in Docker
Hassan Aftab
Hassan Aftab
Hassan Aftab
Follow
Nov 29 '24
Jupyter Notebooks in Docker
#
datascience
#
docker
#
dataengineering
#
programming
8
reactions
Comments
1
comment
3 min read
🚀 Beyond Data Ingestion: Advanced Strategies for Optimizing API Data Pipelines
SANKET PATIL
SANKET PATIL
SANKET PATIL
Follow
Nov 29 '24
🚀 Beyond Data Ingestion: Advanced Strategies for Optimizing API Data Pipelines
#
dataengineering
#
azuredatafactory
#
apipagination
#
devops
4
reactions
Comments
1
comment
3 min read
SQL "SELECT INTO" vs "INSERT INTO SELECT" statements.
Danwycliff Ndwiga
Danwycliff Ndwiga
Danwycliff Ndwiga
Follow
Oct 30 '24
SQL "SELECT INTO" vs "INSERT INTO SELECT" statements.
#
sql
#
database
#
data
#
dataengineering
Comments
Add Comment
1 min read
ACID Properties in Databases: What Happens Without Them?
Meqdad Darwish
Meqdad Darwish
Meqdad Darwish
Follow
Nov 27 '24
ACID Properties in Databases: What Happens Without Them?
#
database
#
data
#
dataengineering
#
sql
5
reactions
Comments
Add Comment
6 min read
🕵️ OSINT: link company acronyms to Standard Occupation Classification w. Open Source LLMs
adriens
adriens
adriens
Follow
Nov 25 '24
🕵️ OSINT: link company acronyms to Standard Occupation Classification w. Open Source LLMs
#
ai
#
datascience
#
database
#
dataengineering
1
reaction
Comments
8
comments
6 min read
Data Architecture Best Practices
DQOps
DQOps
DQOps
Follow
Nov 23 '24
Data Architecture Best Practices
#
data
#
dataengineering
#
dataquality
#
datascience
1
reaction
Comments
Add Comment
6 min read
My Journey into Data AI and Machine Learning
Lusanda Ndlovu
Lusanda Ndlovu
Lusanda Ndlovu
Follow
Oct 20 '24
My Journey into Data AI and Machine Learning
#
softwaredevelopment
#
ai
#
machinelearning
#
dataengineering
Comments
Add Comment
1 min read
🚀 Unlock the Power of ORC File Format 📊
Pratik Barjatiya
Pratik Barjatiya
Pratik Barjatiya
Follow
Nov 22 '24
🚀 Unlock the Power of ORC File Format 📊
#
dataengineering
#
bigdata
#
datascience
#
data
5
reactions
Comments
Add Comment
1 min read
Designing robust and scalable relational databases: A series of best practices.
Pedro H Goncalves
Pedro H Goncalves
Pedro H Goncalves
Follow
Nov 19 '24
Designing robust and scalable relational databases: A series of best practices.
#
database
#
advanced
#
optimization
#
dataengineering
10
reactions
Comments
5
comments
17 min read
From Data to Decisions: How Machine Learning Works in 2025
Vikas76
Vikas76
Vikas76
Follow
Nov 20 '24
From Data to Decisions: How Machine Learning Works in 2025
#
machinelearning
#
datascience
#
data
#
dataengineering
3
reactions
Comments
Add Comment
3 min read
Why Data Security is Broken and How to Fix it?
Lulu Cheng
Lulu Cheng
Lulu Cheng
Follow
for
jarrid.xyz
Oct 15 '24
Why Data Security is Broken and How to Fix it?
#
security
#
automation
#
devops
#
dataengineering
1
reaction
Comments
Add Comment
5 min read
From ETL and ELT to Reverse ETL
luminousmen
luminousmen
luminousmen
Follow
Oct 15 '24
From ETL and ELT to Reverse ETL
#
dataengineering
#
bigdata
#
data
Comments
Add Comment
4 min read
OLAP (Online Analytical Processing)
Pranav Bakare
Pranav Bakare
Pranav Bakare
Follow
Nov 16 '24
OLAP (Online Analytical Processing)
#
datascience
#
database
#
data
#
dataengineering
5
reactions
Comments
Add Comment
3 min read
The Future of Agentic Systems Podcast
1:42:26
Daniel Davis
Daniel Davis
Daniel Davis
Follow
for
TrustGraph
Nov 15 '24
The Future of Agentic Systems Podcast
#
ai
#
aiops
#
opensource
#
dataengineering
6
reactions
Comments
1
comment
1 min read
What is Data Engineering?
Norton Augusto Herrero dos Santos
Norton Augusto Herrero dos Santos
Norton Augusto Herrero dos Santos
Follow
Oct 12 '24
What is Data Engineering?
#
dataengineering
#
datascience
Comments
Add Comment
1 min read
Deep Dive into Dremio's File-based Auto Ingestion into Apache Iceberg Tables
Alex Merced
Alex Merced
Alex Merced
Follow
Nov 15 '24
Deep Dive into Dremio's File-based Auto Ingestion into Apache Iceberg Tables
#
database
#
dataengineering
#
datascience
1
reaction
Comments
Add Comment
13 min read
One Off to One Data Platform: The Unscalable Data Platform [Part 1]
Lulu Cheng
Lulu Cheng
Lulu Cheng
Follow
for
jarrid.xyz
Nov 14 '24
One Off to One Data Platform: The Unscalable Data Platform [Part 1]
#
data
#
architecture
#
devops
#
dataengineering
2
reactions
Comments
Add Comment
3 min read
What are the major advantages of a cloud warehouse solution over an on-premises data warehouse solution?
Hana Sato
Hana Sato
Hana Sato
Follow
Nov 13 '24
What are the major advantages of a cloud warehouse solution over an on-premises data warehouse solution?
#
data
#
datascience
#
dataengineering
Comments
1
comment
5 min read
Databricks vs. Hadoop: Which Platform is Best for Predictive Analytics?
Hana Sato
Hana Sato
Hana Sato
Follow
Nov 13 '24
Databricks vs. Hadoop: Which Platform is Best for Predictive Analytics?
#
dataengineering
#
analytics
#
tutorial
#
ai
2
reactions
Comments
1
comment
7 min read
End-to-End ETL and Sales Dashboard on WWI dataset in Microsoft Fabric
abdulmaleek mubaraq
abdulmaleek mubaraq
abdulmaleek mubaraq
Follow
Oct 8 '24
End-to-End ETL and Sales Dashboard on WWI dataset in Microsoft Fabric
#
tutorial
#
dataengineering
#
devto
#
analytics
Comments
Add Comment
7 min read
The Ultimate Data Engineering Roadmap: From Beginner to Pro
Akhilesh Pratap Shahi
Akhilesh Pratap Shahi
Akhilesh Pratap Shahi
Follow
Nov 10 '24
The Ultimate Data Engineering Roadmap: From Beginner to Pro
#
dataengineering
#
datascience
#
computerscience
#
machinelearning
6
reactions
Comments
1
comment
8 min read
Data Analysis: The Unsung Hero of Modern Business
Milcah03
Milcah03
Milcah03
Follow
Oct 7 '24
Data Analysis: The Unsung Hero of Modern Business
#
datascience
#
dataengineering
#
writing
#
datastructures
Comments
Add Comment
2 min read
Intro to SQL using Apache Iceberg and Dremio
Alex Merced
Alex Merced
Alex Merced
Follow
Nov 8 '24
Intro to SQL using Apache Iceberg and Dremio
#
database
#
sql
#
dataengineering
#
datascience
4
reactions
Comments
Add Comment
22 min read
5 Best ETL Tools: A Comprehensive Comparison Guide
Sourabh Gupta
Sourabh Gupta
Sourabh Gupta
Follow
Oct 28 '24
5 Best ETL Tools: A Comprehensive Comparison Guide
#
etl
#
datascience
#
dataengineering
#
learning
1
reaction
Comments
Add Comment
3 min read
loading...
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account