DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Compression algorithms in Parquet Java

Compression algorithms in Parquet Java

3
Comments 2
7 min read
Top 10 Web Scraping Tools in 2025 (Free & Paid Options)

Top 10 Web Scraping Tools in 2025 (Free & Paid Options)

8
Comments 4
5 min read
When to use Apache Xtable or Delta Lake Uniform for Data Lakehouse Interoperability

When to use Apache Xtable or Delta Lake Uniform for Data Lakehouse Interoperability

Comments
5 min read
Goodbye Kafka: Build a Low-Cost User Analysis System

Goodbye Kafka: Build a Low-Cost User Analysis System

Comments
5 min read
The Columnar Approach: A Deep Dive into Efficient Data Storage for Analytics 🚀

The Columnar Approach: A Deep Dive into Efficient Data Storage for Analytics 🚀

1
Comments
4 min read
Introduction to Hadoop:)

Introduction to Hadoop:)

6
Comments
10 min read
Big Data Trends That Will Impact Your Business In 2025

Big Data Trends That Will Impact Your Business In 2025

5
Comments
6 min read
The Heart of DolphinScheduler: In-Depth Analysis of the Quartz Scheduling Framework

The Heart of DolphinScheduler: In-Depth Analysis of the Quartz Scheduling Framework

8
Comments
3 min read
SQL Filtering and Sorting with Real-life Examples

SQL Filtering and Sorting with Real-life Examples

1
Comments
4 min read
Query 1B Rows in PostgreSQL >25x Faster with Squirrels!

Query 1B Rows in PostgreSQL >25x Faster with Squirrels!

1
Comments 8
5 min read
Big Data

Big Data

Comments
1 min read
Introduction to Data lakes: The future of big data storage

Introduction to Data lakes: The future of big data storage

5
Comments
2 min read
Construyendo una aplicación con Change Data Capture (CDC) utilizando Debezium, Kafka y NiFi

Construyendo una aplicación con Change Data Capture (CDC) utilizando Debezium, Kafka y NiFi

1
Comments
3 min read
The Apache Iceberg™ Small File Problem

The Apache Iceberg™ Small File Problem

13
Comments
3 min read
System Design 09 - Data Partitioning: Dividing to Conquer Big Data

System Design 09 - Data Partitioning: Dividing to Conquer Big Data

Comments
2 min read
How AWS Handles the 5 Vs of Big Data: Updated for 2025.

How AWS Handles the 5 Vs of Big Data: Updated for 2025.

11
Comments
3 min read
Introduction to Messaging Systems with Kafka

Introduction to Messaging Systems with Kafka

1
Comments
16 min read
Best Practices for Data Security in Big Data Projects

Best Practices for Data Security in Big Data Projects

Comments
6 min read
🚀 Unlock the Power of ORC File Format 📊

🚀 Unlock the Power of ORC File Format 📊

5
Comments
1 min read
SeaTunnel-Powered Data Integration: How 58 Group Handles Over 500 Billion+ Data Points Daily

SeaTunnel-Powered Data Integration: How 58 Group Handles Over 500 Billion+ Data Points Daily

5
Comments 2
5 min read
5 Big Data Use Cases that Retailers Fail to Use for Actionable Insights

5 Big Data Use Cases that Retailers Fail to Use for Actionable Insights

Comments
3 min read
Introduction to Big Data Analysis

Introduction to Big Data Analysis

8
Comments
13 min read
Understanding Star Schema vs. Snowflake Schema

Understanding Star Schema vs. Snowflake Schema

7
Comments
1 min read
Why Scala is the Best Choice for Big Data Applications: Advantages Over Java and Python

Why Scala is the Best Choice for Big Data Applications: Advantages Over Java and Python

Comments
6 min read
Processando 20 milhões de registros em menos de 5 segundos com Apache Hive.

Processando 20 milhões de registros em menos de 5 segundos com Apache Hive.

8
Comments
8 min read
loading...