Make a non-blocking request with requests when running Flask with Gunicorn and Gevent

Question

My Flask application will receive a request, do some processing, and then make a request to a slow external endpoint that takes 5 seconds to respond. It looks like running Gunicorn with Gevent will allow it to handle many of these slow requests at the same time. How can I modify the example below so that the view is non-blocking?

import requests

@app.route('/do', methods = ['POST'])
def do():
    result = requests.get('slow api')
    return result.content

gunicorn server:app -k gevent -w 4

What do you expect would happen here? You can't return anything to the client if you haven't received it yet — Wayne Werner
I was expecting to make it async so when it's waiting for the super slow api the cpu power can be used to handle other incoming requests that can potentially be going to the other path. (Since I assume this application will receive lots of other different incoming requests) — JLTChiu
That doesn't mean what you think it means. And Gunicorn should be handling this for you, you could test to make sure just by adding a time.sleep(30) in there, I think. It's called the reactor pattern, but Gunicorn allows the client to connect, and then passes off the request to a worker. When the worker finishes, it returns the data from the worker and then puts it back in the pool. I'm not sure if it spins up a new worker if all the existing ones are busy, though. — Wayne Werner
I am still learning this, but I expect running Gunicorn should be something like gunicorn server:app -k gevent -w 4 but I am really not sure. — JLTChiu
@WayneWerner, do you mean that with the current code I posted above, when a specific request is waiting for the slow api to response, it will just use the cpu power to process other incoming requests to the application server? — JLTChiu

sytech sytech · Accepted Answer · 2016-10-09T07:27:39

If you're deploying your Flask application with gunicorn, it is already non-blocking. If a client is waiting on a response from one of your views, another client can make a request to the same view without a problem. There will be multiple workers to process multiple requests concurrently. No need to change your code for this to work. This also goes for pretty much every Flask deployment option.

Make a non-blocking request with requests when running Flask with Gunicorn and Gevent

3 Answers