Saksham Paliwal

Posted on Jan 20

AWS Kinesis: What It Is and Why It Exists

#kinesis #aws #devops #mlops

You're building something.

Maybe it's a web app. Maybe it's an analytics dashboard. Maybe it's just a service that needs to log some events.

And then someone on your team says, "we should use Kinesis for this."

And you're like... what? Why? We already have databases. We have queues. We have S3. Why do we need another AWS service?

Yeah, I've been there.

Let me walk you through this.

Why does Kinesis even exist?

Here's the thing.

Around 2013, companies like Netflix and Amazon were dealing with a very specific problem. They had millions of users generating data constantly. Clicks, views, searches, purchases, errors, all happening at the same time.

They needed to process this data as it arrived. Not in batches. Not overnight. Right now.

Traditional databases? Too slow. They're built for storing and querying, not for handling thousands of writes per second from different sources.

Message queues like SQS? Better, but they're designed for job processing, not for streaming massive amounts of continuous data to multiple consumers at once.

So AWS built Kinesis.

It was inspired by Apache Kafka (which came out earlier), but made simpler and fully managed for AWS users.

The core idea was simple: give developers a way to ingest, buffer, and process real-time streaming data without managing servers or worrying about scale.

What actually is Kinesis?

Think of Kinesis as a super fast conveyor belt for data.

You put data onto the belt (producers send records). The belt moves continuously. Multiple teams can watch the belt and grab what they need (consumers read records). The belt keeps moving.

That's it.

More technically, Kinesis is a managed service that lets you collect, process, and analyze streaming data in real time.

It's not a database. It's not a queue in the traditional sense. It's a stream.

Wait, what's the difference between a stream and a queue?

Good question!!!

A queue like SQS is meant for one-to-one delivery. You send a message, one consumer picks it up, it's gone.

A stream like Kinesis is meant for one-to-many delivery. You send a record, it stays in the stream for a while (24 hours by default, up to 365 days if you configure it). Multiple consumers can read the same record independently.

Also, streams preserve order within a partition. Queues don't guarantee order unless you use FIFO queues with extra config.

Streams are for high-throughput, real-time data pipelines. Queues are for task distribution and decoupling services.

When do people actually use Kinesis?

Real-time analytics is a big one.

Let's say you're building a gaming app. You want to track every player action, analyze patterns, detect cheating, update leaderboards, all in real time. Kinesis can ingest millions of events per second and let different services consume that data simultaneously.

Log and event data collection is another common use case.

Instead of writing logs directly to S3 or CloudWatch (which can get expensive or slow), you stream logs to Kinesis. Then you can fan out to multiple destinations: one consumer writes to S3 for long-term storage, another sends to Elasticsearch for searching, another triggers Lambda functions for alerts.

IoT data ingestion also fits perfectly.

Thousands of devices sending sensor data every second? Kinesis handles it. You can process the data in real time, store it, run machine learning models on it, whatever you need.

Clickstream analysis for websites is super common too.

Every click, scroll, hover gets sent to Kinesis. Your analytics team reads the stream to build dashboards. Your recommendation engine reads the same stream to personalize content. Your data science team reads it to train models.

One stream, multiple consumers, all happening live.

The different flavors of Kinesis

AWS actually has a few different Kinesis services, which is confusing at first.

Kinesis Data Streams is the core service. This is what most people mean when they say "Kinesis." You manage capacity (shards), you control retention, you write producers and consumers.

Kinesis Data Firehose is the simpler version. You just point it at a destination (S3, Redshift, Elasticsearch, etc.), and it automatically delivers your streaming data there. No consumers to write. Great for simple ETL pipelines.

Kinesis Data Analytics lets you run SQL queries on streaming data. Useful if you want to do transformations or aggregations in real time without writing code.

Kinesis Video Streams is for video, which is a whole different thing. Not relevant for most backend use cases.

For now, just know Data Streams exists. That's the foundation.

A super basic example

Let's say you're tracking user sign-ups.

You could write sign-up events directly to a database. But what if you also want to send a welcome email, update analytics, sync to a CRM, and trigger a Slack notification?

You'd have to call all those services from your sign-up endpoint. If one fails, you have to handle retries. If you add a new integration, you have to modify the sign-up code.

With Kinesis, you just write the sign-up event to the stream. Done.

Then you have separate consumers: one Lambda function sends the email, another updates analytics, another syncs to your CRM, another posts to Slack.

Each consumer is independent. If one breaks, the others keep working. The stream keeps the data for 24 hours (or longer), so even if a consumer is down, it can catch up later.

Decoupled, scalable, resilient.

How does it actually work under the hood?

Kinesis Data Streams is made up of shards.

A shard is basically a unit of capacity. Each shard can handle 1 MB/sec of writes and 2 MB/sec of reads.

If you need more throughput, you add more shards. AWS handles the infrastructure.

When you write a record to Kinesis, you specify a partition key. Kinesis hashes that key to decide which shard gets the record.

Records with the same partition key always go to the same shard, which means they're ordered relative to each other.

Consumers read from shards and process records in order within each shard.

You don't have to think about this too much at first, but it's good to know.

The catch (because there's always a catch)

Kinesis isn't free.

You pay per shard-hour, plus data ingestion and retrieval costs. If you're processing a lot of data, it adds up.

You also have to manage shard scaling. If your traffic spikes, you might need to manually increase shards (or set up auto-scaling).

There's also a learning curve.

Writing a producer is pretty straightforward. But building a reliable consumer that handles retries, checkpointing, and shard rebalancing? That takes some work. AWS provides libraries (KCL, Kinesis Client Library) to help, but it's still more complex than, say, using SQS.

And if you don't need real-time processing, Kinesis might be overkill.

If batch processing once an hour is fine, just write to S3 and process later. Simpler, cheaper.

So when should I actually use it?

Use Kinesis when:

You need to process data in real time or near real time
You have multiple consumers that need the same data
You need to preserve order within a partition
You're dealing with high-throughput streaming data

Don't use Kinesis when:

Batch processing is good enough
You only have one consumer (use SQS instead)
You need long-term storage as the primary goal (use S3)
You're just getting started and want the simplest solution (start simple, add Kinesis later if you need it)

What about Kafka?

Yeah, Kafka is the open-source equivalent.

Kinesis is easier to set up (fully managed), but Kafka gives you more control and can be cheaper at scale if you run it yourself (or use a managed service like Confluent or MSK).

If you're already deep in the AWS ecosystem, Kinesis is probably easier.

If you need multi-cloud or have very specific requirements, Kafka might be better.

Honestly, the concepts are similar enough that learning one helps you understand the other.

It's one of those tools that makes way more sense once you hit a specific problem. You'll know when you need it because you'll be sitting there trying to process a flood of real-time events and thinking, "there has to be a better way."

That's when you reach for Kinesis.

Until then? Just know it exists. Know roughly what it does. And when the time comes, you'll know where to look.

You're doing great. Keep building, keep learning, and don't stress about knowing every AWS service by heart. Nobody does!!!

DEV Community