Skip to content
Navigation menu
Search
Powered by
Search
Algolia
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Data Engineering with Scala: Mastering Real-Time Data Processing with Apache Flink and Google Pub/Sub
Geazi Anc
Geazi Anc
Geazi Anc
Follow
Oct 18 '24
Data Engineering with Scala: Mastering Real-Time Data Processing with Apache Flink and Google Pub/Sub
#
dataengineering
#
scala
#
datascience
#
flink
7
reactions
Comments
Add Comment
15 min read
Clear Link Between DevSecOps and Data Engineering
Regnard Raquedan
Regnard Raquedan
Regnard Raquedan
Follow
Sep 13 '24
Clear Link Between DevSecOps and Data Engineering
#
dataengineering
#
devops
#
devsecops
#
cloud
Comments
Add Comment
1 min read
Still Using SQL, Python, & Excel for Data Deduplication? Here's Why You Need Better Tools.
Farah Kim
Farah Kim
Farah Kim
Follow
Oct 17 '24
Still Using SQL, Python, & Excel for Data Deduplication? Here's Why You Need Better Tools.
#
algorithms
#
ai
#
dataengineering
5
reactions
Comments
Add Comment
4 min read
Building a Big Data Playground Sandbox for Learning
Abdullah Haggag
Abdullah Haggag
Abdullah Haggag
Follow
Oct 17 '24
Building a Big Data Playground Sandbox for Learning
#
dataengineering
#
bigdata
#
opensource
6
reactions
Comments
Add Comment
5 min read
Capture Browser XHR/Fetch API Response Automatically into JSON Files
Dendi Handian
Dendi Handian
Dendi Handian
Follow
Sep 12 '24
Capture Browser XHR/Fetch API Response Automatically into JSON Files
#
help
#
dataengineering
#
chrome
#
javascript
Comments
Add Comment
1 min read
The True Cost of Poor Data Quality: Why It Matters and How to Improve It
Mark Yu
Mark Yu
Mark Yu
Follow
Oct 16 '24
The True Cost of Poor Data Quality: Why It Matters and How to Improve It
#
database
#
datascience
#
dataengineering
#
management
3
reactions
Comments
Add Comment
6 min read
Explaining the History of Data Lakehouse
Pavol Z. Kutaj
Pavol Z. Kutaj
Pavol Z. Kutaj
Follow
Oct 14 '24
Explaining the History of Data Lakehouse
#
lakehouse
#
dataengineering
#
warehouse
Comments
Add Comment
2 min read
Building a User-Friendly, Budget-Friendly Alternative to dbt Cloud
Marco Porracin
Marco Porracin
Marco Porracin
Follow
Sep 8 '24
Building a User-Friendly, Budget-Friendly Alternative to dbt Cloud
#
dbt
#
dataengineering
#
opensource
#
datascience
Comments
Add Comment
1 min read
O que é Engenharia de Dados?
Norton Augusto Herrero dos Santos
Norton Augusto Herrero dos Santos
Norton Augusto Herrero dos Santos
Follow
Oct 12 '24
O que é Engenharia de Dados?
#
dataengineering
#
datascience
3
reactions
Comments
Add Comment
1 min read
How SQL Spatial Data Solves Real-World Problems
Nuthan Kishore
Nuthan Kishore
Nuthan Kishore
Follow
Sep 7 '24
How SQL Spatial Data Solves Real-World Problems
#
firstpost
#
spatialdata
#
dataengineering
Comments
Add Comment
6 min read
Explaining CDC (Change Data Capture)
Pavol Z. Kutaj
Pavol Z. Kutaj
Pavol Z. Kutaj
Follow
Oct 11 '24
Explaining CDC (Change Data Capture)
#
databricks
#
dataengineering
Comments
Add Comment
1 min read
Handling Outliers 101: Why the IQR Method is Your Go-To Tool
allan-pg
allan-pg
allan-pg
Follow
Oct 10 '24
Handling Outliers 101: Why the IQR Method is Your Go-To Tool
#
python
#
datascience
#
dataengineering
#
data
2
reactions
Comments
Add Comment
3 min read
Go vs Python for File Processing: A Performance and Architecture Perspective
Nico Bistolfi
Nico Bistolfi
Nico Bistolfi
Follow
Oct 9 '24
Go vs Python for File Processing: A Performance and Architecture Perspective
#
python
#
go
#
performance
#
dataengineering
2
reactions
Comments
2
comments
5 min read
Exploring Data Operations with PySpark, Pandas, DuckDB, Polars, and DataFusion in a Python Notebook
Alex Merced
Alex Merced
Alex Merced
Follow
Oct 7 '24
Exploring Data Operations with PySpark, Pandas, DuckDB, Polars, and DataFusion in a Python Notebook
#
python
#
database
#
datascience
#
dataengineering
4
reactions
Comments
Add Comment
13 min read
Secure Data Stack: Navigating Adoption Challenges of Data Encryption
Lulu Cheng
Lulu Cheng
Lulu Cheng
Follow
for
jarrid.xyz
Sep 3 '24
Secure Data Stack: Navigating Adoption Challenges of Data Encryption
#
security
#
dataengineering
#
encryption
#
infosec
1
reaction
Comments
Add Comment
5 min read
Analyzing Airbnb Listings in Chicago: A Power BI Dashboard Project
Raj Tiwari
Raj Tiwari
Raj Tiwari
Follow
Oct 7 '24
Analyzing Airbnb Listings in Chicago: A Power BI Dashboard Project
#
datascience
#
dataengineering
#
data
1
reaction
Comments
Add Comment
4 min read
Python 101: Introduction to Python as a Data Analytics Tool
Gichuki Edwin
Gichuki Edwin
Gichuki Edwin
Follow
Oct 7 '24
Python 101: Introduction to Python as a Data Analytics Tool
#
python
#
analytics
#
datascience
#
dataengineering
Comments
Add Comment
3 min read
Ultimate Directory of Apache Iceberg Resources
Alex Merced
Alex Merced
Alex Merced
Follow
Oct 5 '24
Ultimate Directory of Apache Iceberg Resources
#
database
#
dataengineering
#
datascience
#
elasticsearch
1
reaction
Comments
Add Comment
14 min read
Understanding OLTP and Choosing the Right Database
Chetan Gupta
Chetan Gupta
Chetan Gupta
Follow
Oct 4 '24
Understanding OLTP and Choosing the Right Database
#
dataengineering
#
mongodb
#
postgressql
#
mysql
1
reaction
Comments
Add Comment
6 min read
Change Data Capture (CDC) when there is no CDC
Alex Merced
Alex Merced
Alex Merced
Follow
Oct 4 '24
Change Data Capture (CDC) when there is no CDC
#
database
#
dataengineering
#
postgres
1
reaction
Comments
Add Comment
11 min read
The Ultimate Guide to Data Engineering
Milcah
Milcah
Milcah
Follow
Aug 27 '24
The Ultimate Guide to Data Engineering
#
dataengineering
#
data
Comments
Add Comment
2 min read
Evolution of Data Sharding Towards Automation and Flexibility
Apache Doris
Apache Doris
Apache Doris
Follow
Aug 27 '24
Evolution of Data Sharding Towards Automation and Flexibility
#
opensource
#
dataengineering
#
database
#
automation
Comments
Add Comment
15 min read
Data Showdown: OLAP vs. OLTP – The Battle of Real-Time and Analytics Titans
Chetan Gupta
Chetan Gupta
Chetan Gupta
Follow
Sep 29 '24
Data Showdown: OLAP vs. OLTP – The Battle of Real-Time and Analytics Titans
#
bigdata
#
dataengineering
#
understanding
#
database
Comments
Add Comment
5 min read
Serverless PDF Processing with AWS Lambda and Textract
Olga Shabalina
Olga Shabalina
Olga Shabalina
Follow
for
AWS Community Builders
Sep 28 '24
Serverless PDF Processing with AWS Lambda and Textract
#
cloudcomputing
#
serverless
#
lambda
#
dataengineering
10
reactions
Comments
2
comments
9 min read
The Simplest Data Architecture
Aram Panasenco
Aram Panasenco
Aram Panasenco
Follow
Sep 25 '24
The Simplest Data Architecture
#
data
#
architecture
#
dataengineering
#
analytics
1
reaction
Comments
Add Comment
21 min read
loading...
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account