DEV Community

# spark

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Why I Built a Spark-Native LLM Evaluation Framework

Why I Built a Spark-Native LLM Evaluation Framework

Comments
9 min read
🔥 Day 5: Introduction to DataFrames - The Most Importantce of Spark API

🔥 Day 5: Introduction to DataFrames - The Most Importantce of Spark API

Comments
2 min read
Fixing PySpark on Windows: Downgrading from Python 3.13 to 3.11 (Complete Guide)

Fixing PySpark on Windows: Downgrading from Python 3.13 to 3.11 (Complete Guide)

Comments
3 min read
Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study

Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study

Comments
5 min read
Node4J — Call Java Code Natively from Node.js

Node4J — Call Java Code Natively from Node.js

1
Comments
3 min read
Day 12: UDF vs Pandas UDF

Day 12: UDF vs Pandas UDF

Comments
2 min read
Day 9: Spark SQL Deep Dive - Temp Views, Query Execution & Optimization Tips for Data Engineers

Day 9: Spark SQL Deep Dive - Temp Views, Query Execution & Optimization Tips for Data Engineers

Comments
2 min read
Exploring Brazilian E-commerce with Spark on Databricks Free Edition

Exploring Brazilian E-commerce with Spark on Databricks Free Edition

2
Comments
4 min read
Spark & Scala Cache Lessons from ETL Project

Spark & Scala Cache Lessons from ETL Project

2
Comments 1
3 min read
Exploring the Netflix TV Shows and Movies Dataset with Spark

Exploring the Netflix TV Shows and Movies Dataset with Spark

Comments
2 min read
Apache Spark vs. Apache Kafka: The "Brain" and the "Nervous System" of Big Data

Apache Spark vs. Apache Kafka: The "Brain" and the "Nervous System" of Big Data

5
Comments
3 min read
Fixing PySpark “Cannot run program python3” Error on Windows

Fixing PySpark “Cannot run program python3” Error on Windows

Comments
3 min read
End-to-End Real-Time Data Engineering on Databricks Using Spark Structured Streaming and Delta Lake

End-to-End Real-Time Data Engineering on Databricks Using Spark Structured Streaming and Delta Lake

1
Comments
1 min read
🚀 Day 1: Introduction to Apache Spark

🚀 Day 1: Introduction to Apache Spark

1
Comments
2 min read
🔥 Day 6: Essential PySpark DataFrame Transformations

🔥 Day 6: Essential PySpark DataFrame Transformations

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.