Matt

Posted on Feb 25, 2020 • Edited on Jan 3, 2021

Asynchronous HTTP requests in Python

#python #beginners #datascience

In python, you can make HTTP request to API using the requests module
or native urllib3 module.

However, requests and urllib3 are synchronous. It means that only one HTTP call can be made at a time in a single thread. Sometimes you have to make multiples HTTP call and synchronous code will perform baldy. To avoid this, you can use multi-threading or since python 3.4 asyncio module.

Test case

In order to show the difference of time between sync and async code, i made a script that read a file with 500 cities names and perform HTTP call to an API to retrieve information about location, population and so on from the city name.

Sync code performance

Here is the sync code version with the requests module

@timeit
def fetch_all(cities):
    responses = []
    with requests.session() as session:
        for city in cities:
            resp = session.get(f"https://geo.api.gouv.fr/communes?nom={city}&fields=nom,region&format=json&geometry=centr")
            responses.append(resp.json())
    return responses

Finished 'fetch_all' in 38.7053 secs

Async code performance

I used aiohttp module to make the async code as the requests module doesn't support asyncio for now.

async def fetch(session, url):
    """Execute an http call async
    Args:
        session: contexte for making the http call
        url: URL to call
    Return:
        responses: A dict like object containing http response
    """
    async with session.get(url) as response:
        resp = await response.json()
        return resp

async def fetch_all(cities):
    """ Gather many HTTP call made async
    Args:
        cities: a list of string 
    Return:
        responses: A list of dict like object containing http response
    """
    async with aiohttp.ClientSession() as session:
        tasks = []
        for city in cities:
            tasks.append(
                fetch(
                    session,
                    f"https://geo.api.gouv.fr/communes?nom={city}&fields=nom,region&format=json&geometry=centr",
                )
            )
        responses = await asyncio.gather(*tasks, return_exceptions=True)
        return responses
@timeit
def run(cities):
    responses = asyncio.run(fetch_all(cities))
    return responses

Finished 'run' in 3.0706 secs

Conclusion

As you can see, the async version is a lot faster than the sync version so if you run into a situation where your code is performing multiple I/O calls then you should consider concurrency to improve performance. However asynchronous version requires more work as you can see.

If you want to see the threading version that works with the requests module and also see how to implement automatic retry and caching on your API call check out this tutorial.

🚀 pgai Vectorizer: SQLAlchemy and LiteLLM Make Vector Search Simple

We built pgai Vectorizer to simplify embedding management for AI applications—without needing a separate database or complex infrastructure. Since launch, developers have created over 3,000 vectorizers on Timescale Cloud, with many more self-hosted.

Read full post →

Top comments (2)

Martin Breuss • Nov 10 '21 • Edited

Wondering the same as Alexander. Is there a feasible way to do this without any external libraries (e.g. using urllib.request for the HTTP calls, but making it async with asyncio)?

Most of what I can find online uses aiohttp, or httpx, which are both external libraries.

Alexander Whillas • Dec 15 '20

this uses aiohttp which is not a native python3 module. is it possible to do this with only in native python3?

DEV Community

Asynchronous HTTP requests in Python

Test case

Sync code performance

Async code performance

Conclusion

🚀 pgai Vectorizer: SQLAlchemy and LiteLLM Make Vector Search Simple

Top comments (2)

The Next Generation Developer Platform

Read next

Password Validator using html css and javascript

Resolving Client Secret Expiry Issues in Microsoft Graph Data Connect for SharePoint

FLUX: Breakthrough 1.58-bit Neural Network Compression Maintains Full Accuracy While Slashing Memory Use by 20x

AI Beats Humans at Fantasy Sports: Deep Learning System Shows 15% Better Team Selection