DEV Community

Adventures in Machine Learning

Apache Spark Integration and Platform Execution for ML - ML 073

Apache Spark is a lightning-fast unified analytics engine for large-scale data processing and machine learning. In this episode, Ben and Michael unpack Spark by ping-ponging questions and answers, supplemented by various examples applicable to machine learning workflows.

In this Episode…

  1. How does Spark work?
  2. What makes Apache Spark effective?
  3. Dot repartition in Spark
  4. Parallel processing systems
  5. What is an aggregation in Spark sequel?
  6. Analytics with Spark
  7. What is MPP?
  8. Testing for production
  9. Spark algorithms

Sponsors

Sponsored By:

Episode source