DEV Community

# data

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Introduction to Data Engineering Concepts |2| Understanding Data Sources and Ingestion

Introduction to Data Engineering Concepts |2| Understanding Data Sources and Ingestion

1
Comments
4 min read
Introduction to Data Engineering Concepts |3| ETL vs ELT – Understanding Data Pipelines

Introduction to Data Engineering Concepts |3| ETL vs ELT – Understanding Data Pipelines

Comments
4 min read
Introduction to Data Engineering Concepts |4| Batch Processing Fundamentals

Introduction to Data Engineering Concepts |4| Batch Processing Fundamentals

1
Comments
4 min read
When Small Parquet Files Become a Big Problem (and How I Ended Up Writing a Compactor in PyArrow)

When Small Parquet Files Become a Big Problem (and How I Ended Up Writing a Compactor in PyArrow)

17
Comments 2
5 min read
What is Geo-Redundancy? A Comprehensive Guide

What is Geo-Redundancy? A Comprehensive Guide

Comments
3 min read
How Data Mining is Shaping the Future of Algorithmic Trading

How Data Mining is Shaping the Future of Algorithmic Trading

Comments
4 min read
Cleaning data in PostgreSQL.

Cleaning data in PostgreSQL.

1
Comments
7 min read
Choosing the Right Dataset for Your Image Classification Project

Choosing the Right Dataset for Your Image Classification Project

1
Comments
2 min read
What is Synthetic Data?

What is Synthetic Data?

Comments
1 min read
Introduction to ARIMA: How I Gained Intuition Behind it

Introduction to ARIMA: How I Gained Intuition Behind it

Comments
7 min read
How to train LLM faster

How to train LLM faster

4
Comments
3 min read
CDs to DNA: The Future of Data Storage 🧬

CDs to DNA: The Future of Data Storage 🧬

4
Comments 5
3 min read
Top 10 tools to build and deploy your next GenAI Application

Top 10 tools to build and deploy your next GenAI Application

8
Comments
3 min read
Excel For Data Analysis: A Comprehensive Guide To Mastering Data Insights

Excel For Data Analysis: A Comprehensive Guide To Mastering Data Insights

Comments
4 min read
Top 5 Cloud Data Management Challenges and How to Overcome Them

Top 5 Cloud Data Management Challenges and How to Overcome Them

Comments
4 min read
Interesting links - March 2025

Interesting links - March 2025

Comments
7 min read
How I Automated Crypto Price Tracking with Apache Airflow & CoinGecko

How I Automated Crypto Price Tracking with Apache Airflow & CoinGecko

4
Comments 3
2 min read
Building a Multilingual Business Assistant for Kenya

Building a Multilingual Business Assistant for Kenya

1
Comments
3 min read
My Journey from Web2 Data Analytics to Web3 On-Chain Analysis

My Journey from Web2 Data Analytics to Web3 On-Chain Analysis

Comments
1 min read
Oracle AI Database 26ai : Group By and Having using Column Aliases

Oracle AI Database 26ai : Group By and Having using Column Aliases

Comments
2 min read
CRISP-DM (Cross-Industry Standard Process for Data Mining)

CRISP-DM (Cross-Industry Standard Process for Data Mining)

5
Comments
5 min read
¿De verdad hacía falta? Pues sí

¿De verdad hacía falta? Pues sí

Comments
2 min read
Beginner’s Guide to Using Variables in Python

Beginner’s Guide to Using Variables in Python

4
Comments 1
4 min read
How Data Science and Analytics Are Revolutionizing Today’s Industries.

How Data Science and Analytics Are Revolutionizing Today’s Industries.

2
Comments 2
4 min read
Rerun: Revolutionizing Data Visualization for Modern Projects

Rerun: Revolutionizing Data Visualization for Modern Projects

Comments
2 min read
loading...