Skip to content
Navigation menu
Search
Powered by
Search
Algolia
Log in
Create account
DEV Community
Close
#
bigdata
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Building Apache Pinot and Presto
ChunTing Wu
ChunTing Wu
ChunTing Wu
Follow
Oct 10 '22
Building Apache Pinot and Presto
#
bigdata
#
eventdriven
#
tutorial
#
programming
2
reactions
Comments
Add Comment
4 min read
O que é dark data?
Rita Carolina
Rita Carolina
Rita Carolina
Follow
for
Feministech
Oct 6 '22
O que é dark data?
#
bigdata
#
braziliandevs
#
darkdata
10
reactions
Comments
Add Comment
1 min read
Apache-Spark introduction for SQL developers
Cesar Mostacero
Cesar Mostacero
Cesar Mostacero
Follow
Sep 29 '22
Apache-Spark introduction for SQL developers
#
apachespark
#
dataengineering
#
beginners
#
bigdata
2
reactions
Comments
Add Comment
7 min read
Learning Big Data - Step by Step
Areeba Farooq
Areeba Farooq
Areeba Farooq
Follow
Sep 27 '22
Learning Big Data - Step by Step
#
bigdata
#
aws
#
hive
#
programming
2
reactions
Comments
Add Comment
1 min read
SeaTunnel Connector Access Plan
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Sep 20 '22
SeaTunnel Connector Access Plan
#
connectordevelopment
#
bigdata
#
datascience
#
programming
4
reactions
Comments
Add Comment
12 min read
Entrepreneurs must learn from Lord Ganesha!!!
Arpit Shrivastava
Arpit Shrivastava
Arpit Shrivastava
Follow
Sep 1 '22
Entrepreneurs must learn from Lord Ganesha!!!
#
bigdata
#
webdev
#
beginners
#
startup
6
reactions
Comments
Add Comment
2 min read
What is Big Data? Characteristics, types, and technologies
Hunter Johnson
Hunter Johnson
Hunter Johnson
Follow
for
Educative
Sep 7 '22
What is Big Data? Characteristics, types, and technologies
#
datascience
#
database
#
bigdata
#
tutorial
1
reaction
Comments
Add Comment
11 min read
Why we don’t use Spark
Karel Vanden Bussche
Karel Vanden Bussche
Karel Vanden Bussche
Follow
for
Lighthouse
Sep 7 '22
Why we don’t use Spark
#
python
#
spark
#
googlecloud
#
bigdata
7
reactions
Comments
Add Comment
7 min read
Top Skills You Need in Testing Big Data projects
Renee Betina Esperas
Renee Betina Esperas
Renee Betina Esperas
Follow
Aug 31 '22
Top Skills You Need in Testing Big Data projects
#
testing
#
bigdata
Comments
Add Comment
3 min read
Design Pattern of Streaming Enrichment
ChunTing Wu
ChunTing Wu
ChunTing Wu
Follow
Aug 29 '22
Design Pattern of Streaming Enrichment
#
eventdriven
#
bigdata
#
architecture
#
programming
3
reactions
Comments
Add Comment
6 min read
Data Lake vs Data Warehouse
Muhammad Rameez
Muhammad Rameez
Muhammad Rameez
Follow
Aug 28 '22
Data Lake vs Data Warehouse
#
datascience
#
lake
#
difference
#
bigdata
9
reactions
Comments
Add Comment
3 min read
Spark tip: Disable Coalescing Post Shuffle Partitions for compute intensive tasks
Artem Plotnikov
Artem Plotnikov
Artem Plotnikov
Follow
Aug 26 '22
Spark tip: Disable Coalescing Post Shuffle Partitions for compute intensive tasks
#
spark
#
performance
#
bigdata
#
machinelearning
3
reactions
Comments
3
comments
3 min read
Stream Processing Introduction
ChunTing Wu
ChunTing Wu
ChunTing Wu
Follow
Aug 22 '22
Stream Processing Introduction
#
eventdriven
#
bigdata
#
tutorial
#
architecture
2
reactions
Comments
1
comment
6 min read
How to run Amazon EMR Serverless with --packages flag
Neylson Crepalde
Neylson Crepalde
Neylson Crepalde
Follow
for
AWS Community Builders
Aug 18 '22
How to run Amazon EMR Serverless with --packages flag
#
aws
#
bigdata
#
spark
#
emrserverless
8
reactions
Comments
2
comments
6 min read
The Relational DBs (RDB)
Augusto Valdivia
Augusto Valdivia
Augusto Valdivia
Follow
for
AWS Community Builders
Aug 14 '22
The Relational DBs (RDB)
#
database
#
aws
#
terraform
#
bigdata
12
reactions
Comments
2
comments
4 min read
The story behind Apache SeaTunnel’s evolving from a data integration component to an enterprise-level service
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Aug 10 '22
The story behind Apache SeaTunnel’s evolving from a data integration component to an enterprise-level service
#
bigdata
#
service
5
reactions
Comments
Add Comment
12 min read
Big Data Vs Small Data
Muhammad Rameez
Muhammad Rameez
Muhammad Rameez
Follow
Aug 5 '22
Big Data Vs Small Data
#
bigdata
#
smalldata
#
hadoop
#
datamining
7
reactions
Comments
1
comment
2 min read
Learning Workflow Schedulers (Oozie)
Ruikai Li
Ruikai Li
Ruikai Li
Follow
Jul 29 '22
Learning Workflow Schedulers (Oozie)
#
bigdata
#
datascience
#
dataengineering
2
reactions
Comments
Add Comment
5 min read
There will be 175 Zettabytes of data in the world by 2025. Where will we store it?
Augusto Valdivia
Augusto Valdivia
Augusto Valdivia
Follow
for
AWS Community Builders
Jul 18 '22
There will be 175 Zettabytes of data in the world by 2025. Where will we store it?
#
awsdatabases
#
terraform
#
bigdata
#
aws
18
reactions
Comments
2
comments
1 min read
How discord manage 300M socket connection
Abdulrahman S.
Abdulrahman S.
Abdulrahman S.
Follow
Jul 15 '22
How discord manage 300M socket connection
#
discord
#
algorithms
#
programming
#
bigdata
13
reactions
Comments
Add Comment
2 min read
Here is why you need a message broker
Memphis.dev team
Memphis.dev team
Memphis.dev team
Follow
for
Memphis.dev
Jul 7 '22
Here is why you need a message broker
#
beginners
#
architecture
#
opensource
#
bigdata
57
reactions
Comments
4
comments
7 min read
How to filter columns in HBase Shell
DataPotion
DataPotion
DataPotion
Follow
Jul 8 '22
How to filter columns in HBase Shell
#
database
#
nosql
#
bigdata
5
reactions
Comments
Add Comment
3 min read
Visual task orchestration & Drag & Drop, Scaleph Data integration practice based on SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Jul 8 '22
Visual task orchestration & Drag & Drop, Scaleph Data integration practice based on SeaTunnel
#
datascience
#
bigdata
10
reactions
Comments
Add Comment
12 min read
The best Open-source lakehouse project, LakeSoul 2.0, supports snapshot, rollback, Flink, and Hive interconnection
DMetaSoul
DMetaSoul
DMetaSoul
Follow
Jul 8 '22
The best Open-source lakehouse project, LakeSoul 2.0, supports snapshot, rollback, Flink, and Hive interconnection
#
opensource
#
bigdata
#
database
#
datascience
9
reactions
Comments
Add Comment
5 min read
Creating a Subtitle Search Engine using the Stanford Parts of Speech Tagger
Paul Preibisch
Paul Preibisch
Paul Preibisch
Follow
Jun 2 '22
Creating a Subtitle Search Engine using the Stanford Parts of Speech Tagger
#
bigdata
#
elasticsearch
#
programming
3
reactions
Comments
Add Comment
4 min read
loading...
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account