DEV Community

Cover image for Data Engineering
Muhammed Jimoh
Muhammed Jimoh

Posted on

Data Engineering

Week5 of the Data Engineering Zoomcamp featuring Sejal Vaidya, Ankush Khanna, and Alexey Grigorev by #DataTalksClub.

Week 5 repo โžก๏ธ Github

๐Ÿ›  Tools
๐Ÿงต Apache Spark
๐Ÿงต Apache Hadoop
๐Ÿงต Google VMs

Apache Airflow

๐ŸนWeek 5 (Batch Processing) Summary:

๐ŸŽฏ Streaming vs. Batch Processing: Advantages and Disadvantages.
๐ŸŽฏ Theoretical and Practical understanding of Batching Processing.
๐ŸŽฏ Bash scripting
๐ŸŽฏ Anatomy of Spark
๐ŸŽฏ RDDS
๐ŸŽฏ Connecting Spark to BigQuery
๐ŸŽฏ Setting up Dataproc cluster

Apache Spark

What a beautiful Monday to begin the week.

Top comments (0)