DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
What's new in Apache Spark 3.3.0

What's new in Apache Spark 3.3.0

8
Comments 1
4 min read
A New One-stop AI development and production platform, AlphaIDE

A New One-stop AI development and production platform, AlphaIDE

10
Comments
4 min read
Solved a practical business problem when using Hudi: LakeSoul supports null field non-override semanticssemantics

Solved a practical business problem when using Hudi: LakeSoul supports null field non-override semanticssemantics

7
Comments
3 min read
What is the Lakehouse, the latest Direction of Big Data Architecture?

What is the Lakehouse, the latest Direction of Big Data Architecture?

9
Comments
10 min read
BigQuery transactions over multiple queries, with sessions

BigQuery transactions over multiple queries, with sessions

18
Comments 2
3 min read
Dynamic way doing ETL through Pyspark

Dynamic way doing ETL through Pyspark

16
Comments 2
4 min read
Auto discovering and auto actions in data monitoring or How to drink coffee instead of routine tasks

Auto discovering and auto actions in data monitoring or How to drink coffee instead of routine tasks

12
Comments
9 min read
Build a real-time machine learning sample library using the best open-source project about big data and data lakehouse, LakeSoul

Build a real-time machine learning sample library using the best open-source project about big data and data lakehouse, LakeSoul

11
Comments
7 min read
Leveraging Change Data Capture for Fraud Detection using Arcion Cloud

Leveraging Change Data Capture for Fraud Detection using Arcion Cloud

7
Comments
9 min read
How to prepare for the GCP Professional Data Engineer certification

How to prepare for the GCP Professional Data Engineer certification

35
Comments 9
8 min read
Apache Spark, Hive, and Spring Boot — Testing Guide

Apache Spark, Hive, and Spring Boot — Testing Guide

20
Comments 4
18 min read
Design concept of a best opensource project about big data and data lakehouse

Design concept of a best opensource project about big data and data lakehouse

9
Comments
9 min read
Details of 4 best opensource projects about big data you should try out(Ⅰ)

Details of 4 best opensource projects about big data you should try out(Ⅰ)

8
Comments
5 min read
HIVE installation on WSL

HIVE installation on WSL

13
Comments
3 min read
How to create a DIY Inexpensive Cloud Data Lake

How to create a DIY Inexpensive Cloud Data Lake

8
Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.