DEV Community

The GeekNarrator

Tim Berglund on Realtime Analytics with Apache Pinot

Hey Everyone, In the 43rd episode I speak with Tim Berglund on Realtime Analytics with Apache Pinot. Chapters: 00:00 Introduction 01:22 What do we mean by analytics and realtime analytics? 05:35 Can we define realtime in millis, seconds or minutes? 08:54 What is the fundamental difference between traditional analytics systems and Apache Pinot? 12:19 Was Kafka one of the reasons Apache Pinot could reach its full potential? 16:50 E-commerce Application example - How do I get my data in? 20:07 How is data stored (structured) on the disk? 23:31 Are joins available in Apache Pinot? 26:07 Joins vs pre-computing at ingestion 27:15 How is historical data ingested into Apache Pinot? 28:14 Types of indexes available in Apache Pinot 35:42 Do indexes cause write amplification? Is that a problem in Apache Pinot? 40:02 Point lookups in Apache Pinot 42:54 Anamoly Detection 45:51 Coming up in Apache Pinot Links: StarTree https://startree.ai/ Apache Pinot: https://pinot.apache.org/ Joins in Pinot: https://startree.ai/blog/apache-pinot... Apache Pinot Indexes: https://docs.pinot.apache.org/basics/... Other playlists: Distributed systems:    • Distributed Syste...   Modern Databases:    • Modern Databases   Serverless Architecture:    • Serverless Archit...   Software Engineering:    • Software Engineering   I hope you like the episode. Like, share and subscribe to the channel. Cheers, The GeekNarrator

Episode source