DEV Community

# bigdata

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
From Snowflake to Databend: Leading Game Platform replaced Snowflake with Databend Cloud for real-time Data Cloud

From Snowflake to Databend: Leading Game Platform replaced Snowflake with Databend Cloud for real-time Data Cloud

Comments
4 min read
Java para Análise de Dados: Criando um Analisador de Dados com Apache Spark que Compete com Python

Java para Análise de Dados: Criando um Analisador de Dados com Apache Spark que Compete com Python

1
Comments
6 min read
🚀 Hey DEV Community! Let’s Build Something Awesome Together

🚀 Hey DEV Community! Let’s Build Something Awesome Together

2
Comments
1 min read
🎉 Apache Ambari 3.0.0 Released: A New Chapter in Hadoop Cluster Management

🎉 Apache Ambari 3.0.0 Released: A New Chapter in Hadoop Cluster Management

4
Comments 2
3 min read
Apache Pyspark

Apache Pyspark

5
Comments
1 min read
Is Storage-Computing Separation Really Necessary? From the Architectural Debate to the Practical Analysis of Doris

Is Storage-Computing Separation Really Necessary? From the Architectural Debate to the Practical Analysis of Doris

1
Comments
4 min read
Building an Efficient and Cost-Effective Business Data Analytics System with Databend Cloud

Building an Efficient and Cost-Effective Business Data Analytics System with Databend Cloud

Comments
7 min read
build-my-own-datalake: Starting from PoC

build-my-own-datalake: Starting from PoC

Comments
5 min read
The two versions of Parquet

The two versions of Parquet

2
Comments
5 min read
How to Load Datasets Efficiently in Pandas: A Complete Guide

How to Load Datasets Efficiently in Pandas: A Complete Guide

8
Comments 2
4 min read
Using Apache Parquet to Optimize Data Handling in a Real-Time Ad Exchange Platform

Using Apache Parquet to Optimize Data Handling in a Real-Time Ad Exchange Platform

2
Comments
3 min read
Mastering SQL for Data Engineering: Advanced Queries, Optimization, and Data Modeling Best Practices

Mastering SQL for Data Engineering: Advanced Queries, Optimization, and Data Modeling Best Practices

Comments
4 min read
MapReduce Simplified: Understand Distributed Processing with the Same Logic as SQL

MapReduce Simplified: Understand Distributed Processing with the Same Logic as SQL

2
Comments
4 min read
How to Calculate the Return on Investment for Data Analytics

How to Calculate the Return on Investment for Data Analytics

1
Comments
5 min read
5 Game-Changing Habits to Master Your Data Science Journey

5 Game-Changing Habits to Master Your Data Science Journey

6
Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.