DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Cleaning And Normalizing Data Using AWS Glue DataBrew

Cleaning And Normalizing Data Using AWS Glue DataBrew

14
Comments 3
9 min read
Introduction to Apache Spark, SparkQL, and Spark MLib.

Introduction to Apache Spark, SparkQL, and Spark MLib.

12
Comments
15 min read
Data Lake explained

Data Lake explained

6
Comments
4 min read
Introduction to Hive(A SQL layer above Hadoop)

Introduction to Hive(A SQL layer above Hadoop)

8
Comments
9 min read
Build a small TA-Lib container image

Build a small TA-Lib container image

3
Comments
2 min read
SPOTLIGHT: A GENTLE INTRODUCTION TO MACHINE LEARNING CONCEPTS IN PYTHON

SPOTLIGHT: A GENTLE INTRODUCTION TO MACHINE LEARNING CONCEPTS IN PYTHON

5
Comments
5 min read
How to choose a MongoDB shard key

How to choose a MongoDB shard key

8
Comments 1
3 min read
Big Data Open Source Frameworks

Big Data Open Source Frameworks

3
Comments
5 min read
Scala Vs Python Syntax Cheat Sheet

Scala Vs Python Syntax Cheat Sheet

4
Comments
5 min read
Scala For Beginners - Crash Course - Part 3

Scala For Beginners - Crash Course - Part 3

3
Comments
6 min read
Scala For Beginners - Crash Course - Part 2

Scala For Beginners - Crash Course - Part 2

3
Comments
6 min read
Scala For Beginners - Crash Course - Part 5

Scala For Beginners - Crash Course - Part 5

4
Comments
6 min read
Scala For Beginners - Crash Course - Part 4

Scala For Beginners - Crash Course - Part 4

3
Comments
4 min read
Django + Mongodb works slowly

Django + Mongodb works slowly

1
Comments
1 min read
Creating and running Spark Jobs in Scala on Cloud Dataproc !!!

Creating and running Spark Jobs in Scala on Cloud Dataproc !!!

7
Comments
3 min read
Getting started with Spark

Getting started with Spark

12
Comments 2
6 min read
The World Beyond the Docker! $$ :)

The World Beyond the Docker! $$ :)

5
Comments
2 min read
Airbyte: Data Integration / CDC Solution for Modern Data Teams!

Airbyte: Data Integration / CDC Solution for Modern Data Teams!

6
Comments
12 min read
Best extensions for JupyterLab!!

Best extensions for JupyterLab!!

7
Comments
3 min read
Vitess: Easy database deployment, clustering, and scaling!

Vitess: Easy database deployment, clustering, and scaling!

7
Comments
5 min read
Zero to Deployment and Evolution Data Catalog!

Zero to Deployment and Evolution Data Catalog!

4
Comments
6 min read
Build an analytics app with React and Cube.js

Build an analytics app with React and Cube.js

8
Comments
9 min read
Cardinality Counting in Redis

Cardinality Counting in Redis

3
Comments
4 min read
Cube Cloud Deep Dive: Mastering Pre-Aggregations

Cube Cloud Deep Dive: Mastering Pre-Aggregations

6
Comments
11 min read
When Big Data Goes Bad: Rehabilitating Data Quality

When Big Data Goes Bad: Rehabilitating Data Quality

Comments
9 min read
loading...