DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Quick use of CDC: A new demo from lakesoul makes it easier to set up the environment

Quick use of CDC: A new demo from lakesoul makes it easier to set up the environment

8
Comments
5 min read
Big Data in Cloud Computing - AWS

Big Data in Cloud Computing - AWS

14
Comments
2 min read
4 best opensource projects about big data you should try out

4 best opensource projects about big data you should try out

16
Comments 3
3 min read
A new unified streaming and batch table storage solution similar to iceberg/hudi/delta lake but with several new functions

A new unified streaming and batch table storage solution similar to iceberg/hudi/delta lake but with several new functions

8
Comments
2 min read
[OPINIÃO] Construindo uma Carreira como Data Engineer

[OPINIÃO] Construindo uma Carreira como Data Engineer

2
Comments
2 min read
Characteristics of Big Data

Characteristics of Big Data

4
Comments
8 min read
Apache Spark Unit Testing Strategies

Apache Spark Unit Testing Strategies

9
Comments
1 min read
NodeJS - Get data from Redash v6 API

NodeJS - Get data from Redash v6 API

6
Comments
2 min read
Building an Apache ECharts dashboard with React and Cube

Building an Apache ECharts dashboard with React and Cube

14
Comments
11 min read
[DICA] Adentre o universo da Engenharia de Dados com profissionais brasileiros que se tornaram referência na área!

[DICA] Adentre o universo da Engenharia de Dados com profissionais brasileiros que se tornaram referência na área!

6
Comments
2 min read
What are the best practices while using BigQuery?

What are the best practices while using BigQuery?

11
Comments
2 min read
Building a Bubble Dashboard with Cube

Building a Bubble Dashboard with Cube

9
Comments
14 min read
[ARTIGO] Data Warehouse, Data Lake e Data Lakehouse: Conceitos e Diferenças

[ARTIGO] Data Warehouse, Data Lake e Data Lakehouse: Conceitos e Diferenças

6
Comments
3 min read
Fast Multivalue Look-ups For Huge Data Sets

Fast Multivalue Look-ups For Huge Data Sets

6
Comments
6 min read
Dagster: The Best Free and Open-Source Alternative to Airflow With Python!

Dagster: The Best Free and Open-Source Alternative to Airflow With Python!

5
Comments
1 min read
What is the SingleStore and why should we use it?

What is the SingleStore and why should we use it?

12
Comments 2
3 min read
How to handle nested JSON with Apache Spark

How to handle nested JSON with Apache Spark

3
Comments
3 min read
Machine Learning Lifecycle Process

Machine Learning Lifecycle Process

45
Comments
4 min read
Quill- Most efficient Scala driver for Apache Cassandra and Spark

Quill- Most efficient Scala driver for Apache Cassandra and Spark

2
Comments
4 min read
Presenting ML-based COVID-19 Risk Assessment App Pandemonium

Presenting ML-based COVID-19 Risk Assessment App Pandemonium

4
Comments
3 min read
Cleaning And Normalizing Data Using AWS Glue DataBrew

Cleaning And Normalizing Data Using AWS Glue DataBrew

14
Comments 3
9 min read
Introduction to Apache Spark, SparkQL, and Spark MLib.

Introduction to Apache Spark, SparkQL, and Spark MLib.

12
Comments
15 min read
Data Lake explained

Data Lake explained

6
Comments
4 min read
Introduction to Hive(A SQL layer above Hadoop)

Introduction to Hive(A SQL layer above Hadoop)

8
Comments
9 min read
Build a small TA-Lib container image

Build a small TA-Lib container image

3
Comments
2 min read
loading...