We should use aihttp instead of requests (what about wikidata ?) #22

alexgarel · 2022-08-25T17:16:17Z

The requests library does not seems to be async framework compatible… it blocks the thread while it should release it for the

This would make our server very unresponsive to a lot of requests.

We have to use either aihttp or requests-future instead (or any popular library for that.

We have the problem with wikidata lib which use urllib.request… we should see if we could monkeypatch the code for that…

alexgarel · 2022-08-25T17:19:48Z

A simple solution for wikidata is to use https://docs.python.org/3/library/asyncio-eventloop.html#asyncio.loop.run_in_executor

alexgarel · 2022-08-26T10:33:56Z

@sumit-158 I found a good lib : https://asyncer.tiangolo.com/ (it's 0.0.1 but as explained in the intro this is not a problem)

This will have lots of benefits:

we can easily make sync code async (for wikidata) thanks to the asyncify method (I think for the rest we'd better use aihttp)
we can use the task_group pattern in our main api function which will really speed up things (all knowledge panel will compute in parallel).

sumit-158 · 2022-09-16T14:29:38Z

@alexgarel I was doing some tests(just for fun!) on the response timing with possible methods and I got this result

process-time of response (The value is from the current main branch le., single facet)

Without async: 10.14 sec
with async and aiohttp but without multiprocessing(parallelism): 8.71 sec
same as above with task-group pattern (parallelism): 5.78 sec

I was also thinking if we can run it with Gunicorn might decrease compute time (I'm not so sure!)

alexgarel · 2022-09-16T20:09:49Z

@sumit-158 I guess that the limiting factor here is our requests time. And the longest might be the wikidata, which needs two requests.
You could time requests and log them thanks to time.monotonic to know how much time is lost outside of our code.

Passing to gunicorn is about handling a lot of concurrent request, not reducing latency of a single request.

teolemon added this to Knowledge Panels for Facets Aug 25, 2022

teolemon mentioned this issue Aug 28, 2022

🎯 What can I work on (Knowledge Panels for Facets) #14

Open

sumit-158 mentioned this issue Sep 2, 2022

feat: making kp async #29

Merged

sumit-158 linked a pull request Sep 2, 2022 that will close this issue

feat: making kp async #29

Merged

sumit-158 closed this as completed in #29 Sep 21, 2022

sumit-158 moved this to 🆕 New in Knowledge Panels for Facets Sep 28, 2022

sumit-158 moved this from 🆕 New to ✅ Done in Knowledge Panels for Facets Sep 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

We should use aihttp instead of requests (what about wikidata ?) #22

We should use aihttp instead of requests (what about wikidata ?) #22

alexgarel commented Aug 25, 2022

alexgarel commented Aug 25, 2022

alexgarel commented Aug 26, 2022

sumit-158 commented Sep 16, 2022 •

edited

Loading

alexgarel commented Sep 16, 2022

We should use aihttp instead of requests (what about wikidata ?) #22

We should use aihttp instead of requests (what about wikidata ?) #22

Comments

alexgarel commented Aug 25, 2022

alexgarel commented Aug 25, 2022

alexgarel commented Aug 26, 2022

sumit-158 commented Sep 16, 2022 • edited Loading

process-time of response (The value is from the current main branch le., single facet)

alexgarel commented Sep 16, 2022

sumit-158 commented Sep 16, 2022 •

edited

Loading