DEV Community

The GeekNarrator

Batch vs Realtime Stream Processing - A Deep Dive with Phil Fried from Estuary

In this video I talk to Philip Fried from Estuary about Batch vs Realtime Stream Processing. Philip brings a ton of experience in the world of data processing and has shared some of the best practices in implementing these systems. We dive deep into the world of data processing, covering batch and streaming systems, their challenges, tradeoffs and use cases. Chapters: 00:00 Batch vs Realtime Stream Processing 03:25 What is Batch and Reatlime processing? 18:29 How does Batch and Realtime compare in terms of Latency and Throughput? 27:24 Where is the cost saving coming from? Compute?Storage? or Network? 31:38 Moving from Batch to Stream processing 37:50 How is Idempotency implemented in Streaming systems? 48:50 How do we approach Schema evolution in Batch and Streaming systems? 57:16 Summary - key points to keep in mind Do checkout Estuary if you deal with a ton of data, and don't want to deal with the painful operations, infrastructure management, schema migrations etc and only want to focus on building highly scalable and resilient applications. References: Estuary: https://estuary.dev/ Flow documentation: https://docs.estuary.dev If you like this video please hit the like button, share it with your network (whoever works with a ton of data) and subscribe to the channel. Feel free to watch related episodes in the playlist:    • Distributed Syste...   Modern Databases:    • Modern Databases   Software Engineering:    • Software Engineering   Distributed Systems:    • Distributed Systems   Cheers, The GeekNarrator

Episode source