DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Building Apache Pinot and Presto

Building Apache Pinot and Presto

2
Comments
4 min read
O que é dark data?

O que é dark data?

10
Comments
1 min read
Apache-Spark introduction for SQL developers

Apache-Spark introduction for SQL developers

2
Comments
7 min read
Learning Big Data - Step by Step

Learning Big Data - Step by Step

2
Comments
1 min read
SeaTunnel Connector Access Plan

SeaTunnel Connector Access Plan

4
Comments
12 min read
Entrepreneurs must learn from Lord Ganesha!!!

Entrepreneurs must learn from Lord Ganesha!!!

6
Comments
2 min read
What is Big Data? Characteristics, types, and technologies

What is Big Data? Characteristics, types, and technologies

1
Comments
11 min read
Why we don’t use Spark

Why we don’t use Spark

7
Comments
7 min read
Top Skills You Need in Testing Big Data projects

Top Skills You Need in Testing Big Data projects

Comments
3 min read
Design Pattern of Streaming Enrichment

Design Pattern of Streaming Enrichment

3
Comments
6 min read
Data Lake vs Data Warehouse

Data Lake vs Data Warehouse

9
Comments
3 min read
Spark tip: Disable Coalescing Post Shuffle Partitions for compute intensive tasks

Spark tip: Disable Coalescing Post Shuffle Partitions for compute intensive tasks

3
Comments 3
3 min read
Stream Processing Introduction

Stream Processing Introduction

2
Comments 1
6 min read
How to run Amazon EMR Serverless with --packages flag

How to run Amazon EMR Serverless with --packages flag

8
Comments 2
6 min read
The Relational DBs (RDB)

The Relational DBs (RDB)

12
Comments 2
4 min read
The story behind Apache SeaTunnel’s evolving from a data integration component to an enterprise-level service

The story behind Apache SeaTunnel’s evolving from a data integration component to an enterprise-level service

5
Comments
12 min read
Big Data Vs Small Data

Big Data Vs Small Data

7
Comments 1
2 min read
Learning Workflow Schedulers (Oozie)

Learning Workflow Schedulers (Oozie)

2
Comments
5 min read
There will be 175 Zettabytes of data in the world by 2025. Where will we store it?

There will be 175 Zettabytes of data in the world by 2025. Where will we store it?

18
Comments 2
1 min read
How discord manage 300M socket connection

How discord manage 300M socket connection

13
Comments
2 min read
Here is why you need a message broker

Here is why you need a message broker

57
Comments 4
7 min read
How to filter columns in HBase Shell

How to filter columns in HBase Shell

5
Comments
3 min read
Visual task orchestration & Drag & Drop, Scaleph Data integration practice based on SeaTunnel

Visual task orchestration & Drag & Drop, Scaleph Data integration practice based on SeaTunnel

10
Comments
12 min read
The best Open-source lakehouse project, LakeSoul 2.0, supports snapshot, rollback, Flink, and Hive interconnection

The best Open-source lakehouse project, LakeSoul 2.0, supports snapshot, rollback, Flink, and Hive interconnection

9
Comments
5 min read
Creating a Subtitle Search Engine using the Stanford Parts of Speech Tagger

Creating a Subtitle Search Engine using the Stanford Parts of Speech Tagger

3
Comments
4 min read
loading...