Apache Kafka is an open-source distributed system consisting of servers and clients, which is created by linkedin. Apache Kafka is used by thousands of the world's leading organizations for high-performance data pipelines, streaming analytics, data integration and many other vital applications. Apache Kafka act as an communicaiton/transportation tool between the source system to target system with the following use cases :
- Messaging systems
- Activity Tracking
- Application logs analysis
- Decoupling of system dependencies
- Integration with Big Data technologies like Spark, Flink, Storm, Hdoop.
Why apache kafka :
- Distributed , resilient architecture , fault tolerance .
- Horizontal scalability :
- can scale to 100s of brokers .
- can scale to millions of messages per second .
- High performance latency of less then 10ms in real time.
following are the companies name that uses apache kafka for real time solutions :
1) Netflix uses kafka to apply recommendations in real time while
you are watching TV shows .
2) Uber uses kafka to gather user , taxi and trip data in real
time to compute and forcast demand and compute pricing in real
time .
3) Linkedin uses kafka to prevent spam , collect user interactions
to make better connection recommendation in real time .
Top comments (0)