DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Apache Spark, Hive, and Spring Boot — Testing Guide

Apache Spark, Hive, and Spring Boot — Testing Guide

18
Comments 4
18 min read
Design concept of a best opensource project about big data and data lakehouse

Design concept of a best opensource project about big data and data lakehouse

9
Comments
9 min read
How to prepare for the GCP Professional Data Engineer certification

How to prepare for the GCP Professional Data Engineer certification

35
Comments 6
8 min read
Details of 4 best opensource projects about big data you should try out(Ⅰ)

Details of 4 best opensource projects about big data you should try out(Ⅰ)

8
Comments
5 min read
HIVE installation on WSL

HIVE installation on WSL

12
Comments
3 min read
How to create a DIY Inexpensive Cloud Data Lake

How to create a DIY Inexpensive Cloud Data Lake

8
Comments
3 min read
Create a Hadoop playground with Docker Desktop on Windows in minutes

Create a Hadoop playground with Docker Desktop on Windows in minutes

10
Comments
4 min read
Quick use of CDC: A new demo from lakesoul makes it easier to set up the environment

Quick use of CDC: A new demo from lakesoul makes it easier to set up the environment

8
Comments
5 min read
Big Data in Cloud Computing - AWS

Big Data in Cloud Computing - AWS

14
Comments
2 min read
4 best opensource projects about big data you should try out

4 best opensource projects about big data you should try out

16
Comments 3
3 min read
A new unified streaming and batch table storage solution similar to iceberg/hudi/delta lake but with several new functions

A new unified streaming and batch table storage solution similar to iceberg/hudi/delta lake but with several new functions

8
Comments
2 min read
[OPINIÃO] Construindo uma Carreira como Data Engineer

[OPINIÃO] Construindo uma Carreira como Data Engineer

2
Comments
2 min read
Characteristics of Big Data

Characteristics of Big Data

4
Comments
8 min read
Apache Spark Unit Testing Strategies

Apache Spark Unit Testing Strategies

9
Comments
1 min read
NodeJS - Get data from Redash v6 API

NodeJS - Get data from Redash v6 API

6
Comments
2 min read
Building an Apache ECharts dashboard with React and Cube

Building an Apache ECharts dashboard with React and Cube

14
Comments
11 min read
[DICA] Adentre o universo da Engenharia de Dados com profissionais brasileiros que se tornaram referência na área!

[DICA] Adentre o universo da Engenharia de Dados com profissionais brasileiros que se tornaram referência na área!

6
Comments
2 min read
What are the best practices while using BigQuery?

What are the best practices while using BigQuery?

11
Comments
2 min read
Building a Bubble Dashboard with Cube

Building a Bubble Dashboard with Cube

9
Comments
14 min read
[ARTIGO] Data Warehouse, Data Lake e Data Lakehouse: Conceitos e Diferenças

[ARTIGO] Data Warehouse, Data Lake e Data Lakehouse: Conceitos e Diferenças

6
Comments
3 min read
Fast Multivalue Look-ups For Huge Data Sets

Fast Multivalue Look-ups For Huge Data Sets

6
Comments
6 min read
Dagster: The Best Free and Open-Source Alternative to Airflow With Python!

Dagster: The Best Free and Open-Source Alternative to Airflow With Python!

5
Comments
1 min read
What is the SingleStore and why should we use it?

What is the SingleStore and why should we use it?

12
Comments 2
3 min read
How to handle nested JSON with Apache Spark

How to handle nested JSON with Apache Spark

3
Comments
3 min read
Machine Learning Lifecycle Process

Machine Learning Lifecycle Process

45
Comments
4 min read
loading...