DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Playing PyFlink in a Nutshell

Playing PyFlink in a Nutshell

8
Comments
5 min read
Podcast with Josh Long on Apache Pulsar and Spring

Podcast with Josh Long on Apache Pulsar and Spring

3
Comments
1 min read
Playing PyFlink from Scratch

Playing PyFlink from Scratch

2
Comments
4 min read
Optimizing massive MongoDB inserts, load 50 million records faster by 33%!

Optimizing massive MongoDB inserts, load 50 million records faster by 33%!

16
Comments 1
12 min read
Docker Alternatives That Can Boost Your Productivity

Docker Alternatives That Can Boost Your Productivity

1
Comments
4 min read
Building Apache Pinot and Presto

Building Apache Pinot and Presto

2
Comments
4 min read
O que é dark data?

O que é dark data?

10
Comments
1 min read
Apache-Spark introduction for SQL developers

Apache-Spark introduction for SQL developers

2
Comments
7 min read
Learning Big Data - Step by Step

Learning Big Data - Step by Step

2
Comments
1 min read
SeaTunnel Connector Access Plan

SeaTunnel Connector Access Plan

4
Comments
12 min read
What is Big Data? Characteristics, types, and technologies

What is Big Data? Characteristics, types, and technologies

1
Comments
11 min read
Why we don’t use Spark

Why we don’t use Spark

7
Comments
7 min read
Entrepreneurs must learn from Lord Ganesha!!!

Entrepreneurs must learn from Lord Ganesha!!!

6
Comments
2 min read
Top Skills You Need in Testing Big Data projects

Top Skills You Need in Testing Big Data projects

Comments
3 min read
Design Pattern of Streaming Enrichment

Design Pattern of Streaming Enrichment

3
Comments
6 min read
Data Lake vs Data Warehouse

Data Lake vs Data Warehouse

9
Comments
3 min read
Spark tip: Disable Coalescing Post Shuffle Partitions for compute intensive tasks

Spark tip: Disable Coalescing Post Shuffle Partitions for compute intensive tasks

3
Comments 3
3 min read
Stream Processing Introduction

Stream Processing Introduction

2
Comments 1
6 min read
How to run Amazon EMR Serverless with --packages flag

How to run Amazon EMR Serverless with --packages flag

8
Comments 2
6 min read
The Relational DBs (RDB)

The Relational DBs (RDB)

12
Comments 2
4 min read
The story behind Apache SeaTunnel’s evolving from a data integration component to an enterprise-level service

The story behind Apache SeaTunnel’s evolving from a data integration component to an enterprise-level service

5
Comments
12 min read
Big Data Vs Small Data

Big Data Vs Small Data

7
Comments 1
2 min read
Learning Workflow Schedulers (Oozie)

Learning Workflow Schedulers (Oozie)

2
Comments
5 min read
There will be 175 Zettabytes of data in the world by 2025. Where will we store it?

There will be 175 Zettabytes of data in the world by 2025. Where will we store it?

18
Comments 2
1 min read
How discord manage 300M socket connection

How discord manage 300M socket connection

13
Comments
2 min read
loading...