DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Building an Efficient and Cost-Effective Business Data Analytics System with Databend Cloud

Building an Efficient and Cost-Effective Business Data Analytics System with Databend Cloud

Comments
7 min read
🚀 Kyuubi + Apache Spark: Big Data, Smarter Execution

🚀 Kyuubi + Apache Spark: Big Data, Smarter Execution

Comments 1
1 min read
build-my-own-datalake: Part 1

build-my-own-datalake: Part 1

Comments
4 min read
No le temas a AWS LakeFormation

No le temas a AWS LakeFormation

Comments
2 min read
Implementing Real-Time Data Processing Using Apache Flink

Implementing Real-Time Data Processing Using Apache Flink

Comments
3 min read
Boost Your Data Transfer Speed by 100x with Arrow Flight SQL in Just 3 Minutes

Boost Your Data Transfer Speed by 100x with Arrow Flight SQL in Just 3 Minutes

Comments
5 min read
Optimizing Data Lake Storage Architectures for High-Volume, High-Velocity Data

Optimizing Data Lake Storage Architectures for High-Volume, High-Velocity Data

Comments
4 min read
The Future of Big Data: Key Trends Shaping 2025

The Future of Big Data: Key Trends Shaping 2025

Comments
1 min read
Designing a Scalable Shuffle Service for Big Data on AWS

Designing a Scalable Shuffle Service for Big Data on AWS

Comments
3 min read
From Overload to Insight: Big Data in Mastery of Web Applications

From Overload to Insight: Big Data in Mastery of Web Applications

Comments
5 min read
Exploring Data Integration and the Evolution of Apache SeaTunnel Architecture

Exploring Data Integration and the Evolution of Apache SeaTunnel Architecture

Comments
4 min read
Analyzing billing information using BigQuery

Analyzing billing information using BigQuery

Comments
3 min read
DuckDB vs. ClickHouse Local: A Comparative Analysis for Analytical Workloads

DuckDB vs. ClickHouse Local: A Comparative Analysis for Analytical Workloads

1
Comments
4 min read
Data Transformation

Data Transformation

1
Comments
1 min read
AWS Athena

AWS Athena

Comments
3 min read
Business Intelligence, Data Analytics, and Predictive Analytics – A comparative analysis for decision-makers

Business Intelligence, Data Analytics, and Predictive Analytics – A comparative analysis for decision-makers

Comments
4 min read
Using DolphinScheduler API to Achieve Efficient Batch Workflow Import and Script Deployment

Using DolphinScheduler API to Achieve Efficient Batch Workflow Import and Script Deployment

6
Comments
3 min read
Data formats - how and when

Data formats - how and when

Comments
3 min read
The two versions of Parquet

The two versions of Parquet

2
Comments
5 min read
How to Load Datasets Efficiently in Pandas: A Complete Guide

How to Load Datasets Efficiently in Pandas: A Complete Guide

8
Comments 2
4 min read
Vector search using Alibaba Cloud inference API and semantic text

Vector search using Alibaba Cloud inference API and semantic text

Comments
10 min read
Reliability in Data-Intensive Applications

Reliability in Data-Intensive Applications

3
Comments 1
3 min read
Using Apache Parquet to Optimize Data Handling in a Real-Time Ad Exchange Platform

Using Apache Parquet to Optimize Data Handling in a Real-Time Ad Exchange Platform

2
Comments
3 min read
Mastering SQL for Data Engineering: Advanced Queries, Optimization, and Data Modeling Best Practices

Mastering SQL for Data Engineering: Advanced Queries, Optimization, and Data Modeling Best Practices

Comments
4 min read
MapReduce Simplified: Understand Distributed Processing with the Same Logic as SQL

MapReduce Simplified: Understand Distributed Processing with the Same Logic as SQL

1
Comments
4 min read
loading...