Apache Spark Bitesize Series

adipolak profile image Adi Polak ・1 min read

Want to learn Apache Spark?
Stream Processing, Analytics, and Machine Learning?

This blog post series is for you!

Apache Spark is an open-source distributed general-purpose cluster- computing framework. Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

It's time to light up the Spark:

Apache Spark bitesize series is built for busy people.

Each post will cover one of the three most important areas in working
with data technologies and challenges today:

Machine Learning

Every post will have a maximum of 2 minutes read length coupled with longer tutorial and hand-on workshops for when you can put the time.

Want to get updates on new bitesize posts?
Follow me here on and on Twitter.

Have a question for me? comment or send a DM.

Discussion (1)

gregory_kramer profile image

I'm in! Have set a personal goal of getting this 'kitchen sink', example cooking for my own edification and Spark is a piece of the puzzle. So, chipping away at Spark sounds just like what the dr. ordered!