Week5 of the Data Engineering Zoomcamp featuring Sejal Vaidya, Ankush Khanna, and Alexey Grigorev by #DataTalksClub.
Week 5 repo โก๏ธ Github
๐ Tools
๐งต Apache Spark
๐งต Apache Hadoop
๐งต Google VMs
๐นWeek 5 (Batch Processing) Summary:
๐ฏ Streaming vs. Batch Processing: Advantages and Disadvantages.
๐ฏ Theoretical and Practical understanding of Batching Processing.
๐ฏ Bash scripting
๐ฏ Anatomy of Spark
๐ฏ RDDS
๐ฏ Connecting Spark to BigQuery
๐ฏ Setting up Dataproc cluster
What a beautiful Monday to begin the week.
Top comments (0)