Skip to content
Navigation menu
Search
Powered by
Search
Algolia
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
All About Parquet Part 06 - Encoding in Parquet | Optimizing for Storage
Alex Merced
Alex Merced
Alex Merced
Follow
Oct 21 '24
All About Parquet Part 06 - Encoding in Parquet | Optimizing for Storage
#
database
#
datascience
#
dataengineering
3
reactions
Comments
Add Comment
6 min read
All About Parquet Part 08 - Reading and Writing Parquet Files in Python
Alex Merced
Alex Merced
Alex Merced
Follow
Oct 21 '24
All About Parquet Part 08 - Reading and Writing Parquet Files in Python
#
database
#
datascience
#
dataengineering
#
data
18
reactions
Comments
Add Comment
5 min read
All About Parquet Part 04 - Schema Evolution in Parquet
Alex Merced
Alex Merced
Alex Merced
Follow
Oct 21 '24
All About Parquet Part 04 - Schema Evolution in Parquet
#
database
#
datascience
#
dataengineering
4
reactions
Comments
1
comment
5 min read
All About Parquet Part 03 - Parquet File Structure | Pages, Row Groups, and Columns
Alex Merced
Alex Merced
Alex Merced
Follow
Oct 21 '24
All About Parquet Part 03 - Parquet File Structure | Pages, Row Groups, and Columns
#
database
#
datascience
#
dataengineering
3
reactions
Comments
Add Comment
5 min read
From a Unified Bronze Layer to Multiple Silver Layers: Streamlining Data Transformation in Databricks Unity Catalog
prakhyatkarri
prakhyatkarri
prakhyatkarri
Follow
Oct 20 '24
From a Unified Bronze Layer to Multiple Silver Layers: Streamlining Data Transformation in Databricks Unity Catalog
#
databricks
#
unitycatalog
#
medallionarchitecture
#
dataengineering
2
reactions
Comments
Add Comment
5 min read
*Mastering Informatica Intelligent Cloud Services (IICS) for Cloud Data Integration*
Rodolfo Mendivil
Rodolfo Mendivil
Rodolfo Mendivil
Follow
Oct 18 '24
*Mastering Informatica Intelligent Cloud Services (IICS) for Cloud Data Integration*
#
iics
#
data
#
etl
#
dataengineering
1
reaction
Comments
Add Comment
3 min read
Data Engineering with Scala: Mastering Real-Time Data Processing with Apache Flink and Google Pub/Sub
Geazi Anc
Geazi Anc
Geazi Anc
Follow
Oct 18 '24
Data Engineering with Scala: Mastering Real-Time Data Processing with Apache Flink and Google Pub/Sub
#
dataengineering
#
scala
#
datascience
#
flink
7
reactions
Comments
Add Comment
15 min read
Clear Link Between DevSecOps and Data Engineering
Regnard Raquedan
Regnard Raquedan
Regnard Raquedan
Follow
Sep 13 '24
Clear Link Between DevSecOps and Data Engineering
#
dataengineering
#
devops
#
devsecops
#
cloud
Comments
Add Comment
1 min read
Still Using SQL, Python, & Excel for Data Deduplication? Here's Why You Need Better Tools.
Farah Kim
Farah Kim
Farah Kim
Follow
Oct 17 '24
Still Using SQL, Python, & Excel for Data Deduplication? Here's Why You Need Better Tools.
#
algorithms
#
ai
#
dataengineering
5
reactions
Comments
Add Comment
4 min read
Building a Big Data Playground Sandbox for Learning
Abdullah Haggag
Abdullah Haggag
Abdullah Haggag
Follow
Oct 17 '24
Building a Big Data Playground Sandbox for Learning
#
dataengineering
#
bigdata
#
opensource
6
reactions
Comments
Add Comment
5 min read
Capture Browser XHR/Fetch API Response Automatically into JSON Files
Dendi Handian
Dendi Handian
Dendi Handian
Follow
Sep 12 '24
Capture Browser XHR/Fetch API Response Automatically into JSON Files
#
help
#
dataengineering
#
chrome
#
javascript
Comments
Add Comment
1 min read
The True Cost of Poor Data Quality: Why It Matters and How to Improve It
Mark Yu
Mark Yu
Mark Yu
Follow
Oct 16 '24
The True Cost of Poor Data Quality: Why It Matters and How to Improve It
#
database
#
datascience
#
dataengineering
#
management
3
reactions
Comments
Add Comment
6 min read
From ETL and ELT to Reverse ETL
luminousmen
luminousmen
luminousmen
Follow
Oct 15 '24
From ETL and ELT to Reverse ETL
#
dataengineering
#
bigdata
#
data
Comments
1
comment
4 min read
Explaining the History of Data Lakehouse
Pavol Z. Kutaj
Pavol Z. Kutaj
Pavol Z. Kutaj
Follow
Oct 14 '24
Explaining the History of Data Lakehouse
#
lakehouse
#
dataengineering
#
warehouse
Comments
Add Comment
2 min read
Building a User-Friendly, Budget-Friendly Alternative to dbt Cloud
Marco Porracin
Marco Porracin
Marco Porracin
Follow
Sep 8 '24
Building a User-Friendly, Budget-Friendly Alternative to dbt Cloud
#
dbt
#
dataengineering
#
opensource
#
datascience
Comments
Add Comment
1 min read
O que é Engenharia de Dados?
Norton Augusto Herrero dos Santos
Norton Augusto Herrero dos Santos
Norton Augusto Herrero dos Santos
Follow
Oct 12 '24
O que é Engenharia de Dados?
#
dataengineering
#
datascience
3
reactions
Comments
Add Comment
1 min read
How SQL Spatial Data Solves Real-World Problems
Nuthan Kishore
Nuthan Kishore
Nuthan Kishore
Follow
Sep 7 '24
How SQL Spatial Data Solves Real-World Problems
#
firstpost
#
spatialdata
#
dataengineering
Comments
Add Comment
6 min read
Explaining CDC (Change Data Capture)
Pavol Z. Kutaj
Pavol Z. Kutaj
Pavol Z. Kutaj
Follow
Oct 11 '24
Explaining CDC (Change Data Capture)
#
databricks
#
dataengineering
Comments
Add Comment
1 min read
Handling Outliers 101: Why the IQR Method is Your Go-To Tool
allan-pg
allan-pg
allan-pg
Follow
Oct 10 '24
Handling Outliers 101: Why the IQR Method is Your Go-To Tool
#
python
#
datascience
#
dataengineering
#
data
2
reactions
Comments
Add Comment
3 min read
Go vs Python for File Processing: A Performance and Architecture Perspective
Nico Bistolfi
Nico Bistolfi
Nico Bistolfi
Follow
Oct 9 '24
Go vs Python for File Processing: A Performance and Architecture Perspective
#
python
#
go
#
performance
#
dataengineering
2
reactions
Comments
2
comments
5 min read
Exploring Data Operations with PySpark, Pandas, DuckDB, Polars, and DataFusion in a Python Notebook
Alex Merced
Alex Merced
Alex Merced
Follow
Oct 7 '24
Exploring Data Operations with PySpark, Pandas, DuckDB, Polars, and DataFusion in a Python Notebook
#
python
#
database
#
datascience
#
dataengineering
4
reactions
Comments
Add Comment
13 min read
Secure Data Stack: Navigating Adoption Challenges of Data Encryption
Lulu Cheng
Lulu Cheng
Lulu Cheng
Follow
for
jarrid.xyz
Sep 3 '24
Secure Data Stack: Navigating Adoption Challenges of Data Encryption
#
security
#
dataengineering
#
encryption
#
infosec
1
reaction
Comments
Add Comment
5 min read
Data Analysis: The Unsung Hero of Modern Business
Milcah03
Milcah03
Milcah03
Follow
Oct 7 '24
Data Analysis: The Unsung Hero of Modern Business
#
datascience
#
dataengineering
#
writing
#
datastructures
Comments
Add Comment
2 min read
Analyzing Airbnb Listings in Chicago: A Power BI Dashboard Project
Raj Tiwari
Raj Tiwari
Raj Tiwari
Follow
Oct 7 '24
Analyzing Airbnb Listings in Chicago: A Power BI Dashboard Project
#
datascience
#
dataengineering
#
data
1
reaction
Comments
Add Comment
4 min read
Python 101: Introduction to Python as a Data Analytics Tool
Gichuki Edwin
Gichuki Edwin
Gichuki Edwin
Follow
Oct 7 '24
Python 101: Introduction to Python as a Data Analytics Tool
#
python
#
analytics
#
datascience
#
dataengineering
Comments
Add Comment
3 min read
loading...
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account