DEV Community

# spark

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Exploring the Netflix TV Shows and Movies Dataset with Spark

Exploring the Netflix TV Shows and Movies Dataset with Spark

Comments
2 min read
Gravitino 0.5.0: Expanding the horizon to Apache Spark, non-tabular data, and more!

Gravitino 0.5.0: Expanding the horizon to Apache Spark, non-tabular data, and more!

1
Comments
7 min read
Spark & Scala Cache Lessons from ETL Project

Spark & Scala Cache Lessons from ETL Project

2
Comments 1
3 min read
Adaptive Partition Estimation in Distributed Dataflows: A Machine Learning Approach for Spark

Adaptive Partition Estimation in Distributed Dataflows: A Machine Learning Approach for Spark

Comments
4 min read
Big Data Fundamentals: spark

Big Data Fundamentals: spark

Comments
6 min read
Building a Real-Time Healthcare Data Pipeline with Apache Spark: From SQS to Parquet (Part 2)

Building a Real-Time Healthcare Data Pipeline with Apache Spark: From SQS to Parquet (Part 2)

Comments
8 min read
Use DolphinScheduler to schedule Spark jobs

Use DolphinScheduler to schedule Spark jobs

1
Comments
6 min read
🚀 Docker + Spark on Kubernetes: Build Tiny Custom Executors in Minutes (2025)

🚀 Docker + Spark on Kubernetes: Build Tiny Custom Executors in Minutes (2025)

Comments
1 min read
Setting Up IOMete: A Cloud-Independent Data Platform Based on Spark

Setting Up IOMete: A Cloud-Independent Data Platform Based on Spark

1
Comments 1
6 min read
Building a YouTube Channel Analytics Dashboard with Airflow, Spark, and Grafana

Building a YouTube Channel Analytics Dashboard with Airflow, Spark, and Grafana

Comments
8 min read
Big Data Processing - Case Study 2 (Spark) 01:52

Big Data Processing - Case Study 2 (Spark)

Comments
1 min read
Complete Beginner's Guide: Building a Weather ETL Pipeline with PySpark

Complete Beginner's Guide: Building a Weather ETL Pipeline with PySpark

2
Comments 1
5 min read
Spark On Kubernetes

Spark On Kubernetes

1
Comments
4 min read
Big Data Processing - Case Study 4 (Spark) 02:36

Big Data Processing - Case Study 4 (Spark)

Comments
1 min read
Big Data Processing - Case Study 3 (Spark) 02:35

Big Data Processing - Case Study 3 (Spark)

Comments
1 min read
Big Data Processing - Case Study 1 (Spark) 01:32

Big Data Processing - Case Study 1 (Spark)

Comments
1 min read
How to treat secure data on lakehouse

How to treat secure data on lakehouse

1
Comments
3 min read
Tiny URL Design

Tiny URL Design

Comments
10 min read
Automatizando a Qualidade de Dados com DQX: Performance e praticidade

Automatizando a Qualidade de Dados com DQX: Performance e praticidade

Comments
5 min read
Azure Data Engineering Books from #Techtter YT Channel

Azure Data Engineering Books from #Techtter YT Channel

Comments
1 min read
AWS Glue vs AWS Lambda: Comparativa Serverless para Ingeniería de Datos en AWS

AWS Glue vs AWS Lambda: Comparativa Serverless para Ingeniería de Datos en AWS

1
Comments
4 min read
Big Boost for Flink & Spark SQL: Both Tools Just Got Updated!

Big Boost for Flink & Spark SQL: Both Tools Just Got Updated!

Comments
1 min read
Designing a Scalable Shuffle Service for Big Data on AWS

Designing a Scalable Shuffle Service for Big Data on AWS

Comments
3 min read
Study Notes 5.1.1-2 Introduction to Batch Processing & spark

Study Notes 5.1.1-2 Introduction to Batch Processing & spark

1
Comments
6 min read
Study Notes 5.6.1-2 Spark on cloud & local

Study Notes 5.6.1-2 Spark on cloud & local

1
Comments
7 min read
loading...