Sarbik Betal

Posted on Jun 21, 2020 • Edited on Jun 26, 2020

Task queues and why do we need them.

#node #javascript #webdev #beginners

^{Cover photo: ©Unsplash/Camille Chen}

Some Background:

What is a task queue and why do you need it?

Analogy

Well, to answer that question let's consider a scenario.
There's a restaurant, and the restaurant has several employees (let's say 10) like waiters, chefs, cashier, receptionist, manager, etc. Now just recall what happens in a restaurant when you place your order.

You inform what you require 🗣️. (Request)
The waiter notes it down 📄, and assures you that your food will be ready in a while 🛎️. (Acknowledge)
The waiter passes you order to a chef 🧑‍🍳, and the chef adds it to the list of orders. (Enqueue)
Then the waiter goes to take orders from another customer 👥. (Next request).
Multiple chefs may be preparing the food 🥪 from the list of orders, one by one or may be many at a time ⌛. (Process)
After a while when your food is ready, the chef calls the waiter and passes the food 🥪. (Dequeue)
The waiter comes and serves you the food 😋. (Response)
Then the waiter goes to some another customer. (Next request).

The waiter and the chef are decoupled from one another, and the waiter takes orders and the chef prepares food independently.

Now imagine the same scenario where all the employees were capable of doing all kinds of jobs (take order, cook, etc.).
If that would have been the case, then the work flow would have changed to something like this.

A waiter arrives, takes your order 📄 and tells you that your food will be ready.
The same waiter goes to the kitchen 🏃 with your order and starts preparing them 🧑‍🍳.
When he/she is done preparing your food, comes back 🏃 and serves you the food 🥪.

You might not see much of a problem here, do you? Well think again, the restaurant has 10 employees only, what would happen if there are 20 or 25 customers waiting to order food?
The former way of handling the orders will easily deal with the pressure. But the latter would just break 🚧, because if all the employees are busy preparing food for the first 10 customers, who 👻 is gonna take orders from the remaining customers? And if the new customers are not addressed within a few minutes, they will surely leave 😠.

Where do we need them?

When we are building web applications/services 🖥️ that does some heavy-lifting in the server that takes time (anything over a few milliseconds) or is a long running job ⏱️ unlike simple CRUD operations like complex calculation, file handling or data analysis, we should always use a task queue. You can think of this as asynchrony (like Promises or Async-await in js) taken to the next level. This would help us to enqueue the task for processing and send the client some kind of acknowledgement immediately before we do the actual processing ⚙️ and move on to the next request (like the waiter). Another server (or maybe the same server which spins off another worker instance/process) would just check for the list 📃 if there is any pending task and process them (like the chef). Once it's done with a job, it will acknowledge the API server which would communicate to the client that the job is done ✔️ (through web-sockets, push notifications, emails or whatever implementation you could think of).

Now if it happens to process the job in one go with your API server (like the restaurant in the second case), things will get really sluggish ⏱️ because the server will take your request, process it, do the heavy lifting 🏋️(which takes time) and respond you back, all in one go. This means that the client would have to wait while the entire operation is complete and your browser will load on and on 🐌 till the server sends the response and if anyone sends a request in between would have to wait for the server to finish the first request before it can even address the second one and then send back the response. Now imagine the same case for thousands of requests per second, that would be really slow and painful and you can imagine that it would result in a very bad UX 🙅.

How do we make it work?

Before getting into the the details of using a task queue, let me introduce some of the terms used extensively over the context of this series.

Queue - Queues are like actual queues in which similar jobs/tasks are grouped together waiting to be processed by a worker in a FIFO (first in first out) manner.
Jobs/Tasks - They are the objects which contain the actual details about the job that is waiting to be processed.
Publisher - It is the one who adds the task in a queue.
Consumer - It watches the job queue for any pending job and sends it for processing.
Worker - The actual powerhouse which processes the job and notifies if it was successful or not. The worker logic can be housed inside of the consumer if you wish to do so.

^{Working of a task queue. © Miguel Grinberg}

Now that you have a basic overview, let's get into the details.

First we set up an API server with some endpoints which would respond to the client's HTTP requests.
The API server publishes the job to its respective queue and sends some kind of acknowledgement to the client like ```json

{
"job": "conversion",
"id": "dcj32q3",
"status": "ok"
}

or in case it fails
```json


{
  "job": "conversion",
  "id": "dcj32q5",
  "status": "failed",
  "reason": "auth_failed"
}

and closes the connection.

A consumer watches and consumes the queue and sends the task for processing to a worker.
The worker processes the job (one or many at a time), reports the progress in between (if it wishes to) and dispatches an event once it is done with the job. You may note that the task can fail at this stage also, so it dispatches a success or a failure event which can be handled accordingly.
The API server queries the progress and reports it to the client (through web-sockets or polling XHR/Fetch requests) so that the application can show a nice progress bar in the UI.
It also listens for the success or failure events and sends a notification to the client.
The client can now request the resource through another API call and the server responds with the requested resource to the client and closes the connection.

This way the clients are assured immediately that

Hey, I'm working on your job. I'll notify you once it is done, in the meanwhile you can do some other stuff.

and no one has to keep waiting for long and the server can efficiently handle more incoming requests.
The task queue essentially glues all these pieces (the API server and the workers) and makes them work together shifting the load from the API server to the worker and thus ensuring a much lower response time and lower down-time.

Conclusion

Hurray! 🎉, now you hopefully understand the basics of a task queue, why do we need them and what are its advantages ✨. If you think about it, this architecture is highly scalable (horizontally) and increased demand can be addressed by adding more worker processes.
I hope this post was helpful for beginners and if you liked this article please show some love, give it a 💗 and stay tuned 📻 for more.
Please comment down below if you have any questions or suggestions and feel free to reach me out 😄.

📸Instagram	📨Email	👨‍💼LinkedIn	👨‍💻Github

In the next article, we will see a step by step guide on how to setup a simple task queue in node js

Simple Node.js task queue with bee-queue and redis

Sarbik Betal ・ Jun 26 '20

#node #javascript #webdev #tutorial

Top comments (10)

Andrei Gatej • Jul 12 '20

Thank you for the article! It was a great read.

I have a small question, though. Referring to the diagram, does it means that the server is a publisher, but also a sort of consumer? Because it publishes some tasks, but then it acts as a consumer when the worker has finished the work and pushes the result back into the queue, where the server will take care of it. Is this correct?

Thank you!

Sarbik Betal • Jul 14 '20 • Edited

Yes Andrei you can think of it that way, if you look closely you will notice there are two kinds of data flow happening here, one is the actual task and other is the event. Different kinds of events are emitted related to the task (task queued, task succeeded, task failed, task stalled. etc). So the server acts as the publisher of the actual task and consumer of those task events that are published by the worker. Hope it helps clear your doubt 😄.

Andrei Gatej • Jul 14 '20

Makes perfect sense! Thank you very much

Joe Previte (he/him) • Apr 15 '21

Fantastic analogy, Sarbik!

I've been wondering about task queues for a while and when/why you might need them. Not only did the analogy clearly explain it, but I now also understand when it would make sense to use one. Thanks for writing this!

Ankur Paul • Apr 16 '21

Inspired from your tweet, want to read this article and get the content out of it. Thanks for digging this out @jsjoeio

Marissa B • Jun 22 '20

Awesome explanation and very clear example with the restaurant! Where was this when I first learned it years ago? :P

Looking forward to the next in the series.

Sarbik Betal • Jun 23 '20

Thank you 😊. I'll be posting the next article soon

Hasan Basri • Dec 15 '20

thanks,, very useful

Wassim Ben Jdida • Aug 28 '20

a really great article.
ive a question, what is a message broker, what it exactly do ? in a simple way please, i searched it on google but i didnt understand a word.
Thanks !!

Oussama Sethoum • Nov 9

This is interesting, the way you explained it with the restaurant example is really good.

DEV Community

Task queues and why do we need them.

Some Background:

What is a task queue and why do you need it?

Analogy

Where do we need them?

How do we make it work?

Conclusion

Simple Node.js task queue with bee-queue and redis

Sarbik Betal ・ Jun 26 '20

Top comments (10)

Read next

What's your favorite book on web development? 📖

What is the ideal workflow between backend and frontend teams when you have short deadlines?

🚨🏆 Top 5 Open-source Alternatives for LLM Development You Must Know About 💥

JavaScript Shared Memory