DEV Community

rajubora
rajubora

Posted on

Apache Kafka :

Apache Kafka is an open-source distributed system consisting of servers and clients, which is created by linkedin. Apache Kafka is used by thousands of the world's leading organizations for high-performance data pipelines, streaming analytics, data integration and many other vital applications. Apache Kafka act as an communicaiton/transportation tool between the source system to target system with the following use cases :

  1. Messaging systems
  2. Activity Tracking
  3. Application logs analysis
  4. Decoupling of system dependencies
  5. Integration with Big Data technologies like Spark, Flink, Storm, Hdoop.

Why apache kafka :

  1. Distributed , resilient architecture , fault tolerance .
  2. Horizontal scalability :
    • can scale to 100s of brokers .
    • can scale to millions of messages per second .
  3. High performance latency of less then 10ms in real time.

following are the companies name that uses apache kafka for real time solutions :

1) Netflix uses kafka to apply recommendations in real time while
you are watching TV shows .

2) Uber uses kafka to gather user , taxi and trip data in real
time to compute and forcast demand and compute pricing in real
time .
3) Linkedin uses kafka to prevent spam , collect user interactions
to make better connection recommendation in real time .

Top comments (0)