Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Reading CSVs with varying column counts that pandas cannot read using DuckDB
nk_Enuke
nk_Enuke
nk_Enuke
Follow
Aug 16 '25
Reading CSVs with varying column counts that pandas cannot read using DuckDB
#
dataengineering
#
duckdb
#
csv
#
programming
1
 reaction
Comments
Add Comment
3 min read
Working with Apache to automate collection of Weather data for Kenya’s major Agricultural Areas
Hillary Onyango
Hillary Onyango
Hillary Onyango
Follow
Jul 26 '25
Working with Apache to automate collection of Weather data for Kenya’s major Agricultural Areas
#
dataengineering
#
dataanalytics
#
apacheairflow
#
luxdev
Comments
Add Comment
5 min read
Building a Data Career: The Skills That Truly Matter
DataLane
DataLane
DataLane
Follow
Aug 14 '25
Building a Data Career: The Skills That Truly Matter
#
datascience
#
dataengineering
#
career
#
sql
10
 reactions
Comments
Add Comment
5 min read
You Can't Trust COUNT and SUM: Scalable Data Validation with Merkle Trees
Andrey
Andrey
Andrey
Follow
Aug 14 '25
You Can't Trust COUNT and SUM: Scalable Data Validation with Merkle Trees
#
dataengineering
#
dataquality
#
datamonitoring
#
sql
2
 reactions
Comments
1
 comment
8 min read
Unable to emit metadata to DataHub GMS with Airflow - a solution
Ivica Kolenkaš
Ivica Kolenkaš
Ivica Kolenkaš
Follow
Aug 14 '25
Unable to emit metadata to DataHub GMS with Airflow - a solution
#
airflow
#
dataengineering
#
datahub
#
data
Comments
Add Comment
4 min read
Snowflake RBAC 101 – Episode 2: Role Hierarchies & Least Privilege
Vinicius Fagundes
Vinicius Fagundes
Vinicius Fagundes
Follow
Aug 14 '25
Snowflake RBAC 101 – Episode 2: Role Hierarchies & Least Privilege
#
dataengineering
#
dataprivacy
#
datawarehouse
#
snowflake
Comments
Add Comment
1 min read
Lightweight ETL with AWS Glue Python Shell, DuckDB, and PyIceberg
Aki
Aki
Aki
Follow
for
AWS Community Builders
Aug 13 '25
Lightweight ETL with AWS Glue Python Shell, DuckDB, and PyIceberg
#
aws
#
iceberg
#
dataengineering
#
duckdb
5
 reactions
Comments
1
 comment
7 min read
PyIceberg on AWS Lambda: Comparing GlueCatalog and REST Catalog Access Methods
Aki
Aki
Aki
Follow
for
AWS Community Builders
Aug 13 '25
PyIceberg on AWS Lambda: Comparing GlueCatalog and REST Catalog Access Methods
#
aws
#
iceberg
#
dataengineering
2
 reactions
Comments
Add Comment
3 min read
The Rise of Real-Time Data: Why Batch Might Be Fading
Milcah03
Milcah03
Milcah03
Follow
Aug 2 '25
The Rise of Real-Time Data: Why Batch Might Be Fading
#
ai
#
kafka
#
airflow
#
dataengineering
10
 reactions
Comments
Add Comment
3 min read
📚 A Complete Guide to Data Science Courses: How to Choose, What to Learn, and Where to Begin
Virat Kohli
Virat Kohli
Virat Kohli
Follow
Jul 10 '25
📚 A Complete Guide to Data Science Courses: How to Choose, What to Learn, and Where to Begin
#
datascience
#
dataengineering
#
database
#
machinelearning
Comments
Add Comment
5 min read
Engineering with SOLID, DRY, KISS, YAGNI and GRASP
Andrey
Andrey
Andrey
Follow
Aug 13 '25
Engineering with SOLID, DRY, KISS, YAGNI and GRASP
#
dataengineering
#
softwareengineering
#
softwaredevelopment
#
software
1
 reaction
Comments
Add Comment
16 min read
Three Formats Walk into a Lakehouse: Iceberg, Delta and Hudi in a Local Setup You Can Run on Your Laptop
Olga Braginskaya
Olga Braginskaya
Olga Braginskaya
Follow
Aug 12 '25
Three Formats Walk into a Lakehouse: Iceberg, Delta and Hudi in a Local Setup You Can Run on Your Laptop
#
dataengineering
#
data
#
tutorial
#
learning
10
 reactions
Comments
4
 comments
16 min read
Which is Best for Real Time Dashboards: Airbyte, Fivetran, or Estuary
Sourabh Gupta
Sourabh Gupta
Sourabh Gupta
Follow
Aug 12 '25
Which is Best for Real Time Dashboards: Airbyte, Fivetran, or Estuary
#
dataengineering
#
etl
#
elt
1
 reaction
Comments
Add Comment
6 min read
Personal Picks: Data Product News (July 9, 2025)
Sagara
Sagara
Sagara
Follow
Jul 9 '25
Personal Picks: Data Product News (July 9, 2025)
#
dataengineering
#
lakehouse
#
snowflake
Comments
Add Comment
6 min read
Key Concepts Every Data Engineer Should Master
George
George
George
Follow
Aug 11 '25
Key Concepts Every Data Engineer Should Master
#
datascience
#
dataengineering
#
olap
#
partitioning
4
 reactions
Comments
Add Comment
6 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account