Use concurrent futures to reduce blocking IO #37

codingjoe · 2018-01-19T08:18:19Z

Especially the queue inspection can be currently very slow,
because a transport to the message broker needs to be opened
and messages are exchanged. Especially in multi queue setup
these operations should happen in a concurrent manner.

Especially the queue inspection can be currently very slow, because a transport to the message broker needs to be opened and messages are exchanged. Especially in multi queue setup these operations should happen in a concurrent manner.

codingjoe · 2018-01-19T08:23:54Z

Hi @ryanhiebert I hope you like it. It should give this beauty a couple extra horse powers. It currently takes use 3200ms to process this view. I will give you an update on how the performance improves with concurrent IO.

codingjoe · 2018-01-19T09:24:58Z

Sorry for all the commits, I was so lazy I coded it in a browser :P
Anyhow, I tested it. Works, and is faster.
Best! Joe

codingjoe · 2018-04-16T12:02:08Z

/ping @ryanhiebert

ryanhiebert

It looks really good to me, so I'm just thinking through to try and see if there are possible unintended consequences. Thank you for your work on this, and for reminding me about it.

Have you tried this code in your own application yet? I'd love some verification that it's working for somebody before I merge and release it.

ryanhiebert · 2018-04-16T15:11:49Z

setup.py

-    install_requires=['six'],
+    install_requires=[
+        'six',
+        'futures; python_version == "2.7"',


I didn't know about this syntax. This is cool, and helped me find this article: https://hynek.me/articles/conditional-python-dependencies/

ryanhiebert · 2018-04-16T15:14:21Z

hirefire/procs/__init__.py

+    with ThreadPoolExecutor() as executor:
+        # Execute all procs in parallel to avoid blocking IO
+        # especially celery which needs to open a transport to AMQP.
+        return list(executor.map(_run, procs.items()))


Is there any downside to this? Are the app and connection used by Celery thread-safe? We use the same app that celery does to get the connection, etc. I don't want to end up flooding the RabbitMQ server with connections, and I'm just trying to think of other possible downsides. I'm always slow to use threads, because I don't use them very often and I don't have a lot of practice with thinking through possible issues with them.

That is a good point. I recently had an issue with connection stacking regarding the DB even if the DB was not used, see revsys/django-health-check#182

I checked both amqp and librabbitmq which uses rabbitmq-c which seem to be thread safe.

The DB issue only happens in configuration edge cases, but I am happy to add the bit that ensures they are closed if you want me to.

Does that mean that you have not yet used this branch in production? How will the affect other Procs, for queues other than Celery?

The only other Proc we use uses Redis. I can not vouch that this will not have any side effects on other third party packages.
We are currently not using it on production, I will setup a staging environment tho and stress test the endpoint.

Can you assign the PR to me, that way I won't for get. I am going on vacation on Friday. I don't know if I can manage before I leave.

@codingjoe : Have you been able to test it in a staging or production environment?

ryanhiebert · 2018-05-01T14:02:35Z

Perhaps this needs to be a setting, so that if we find that there are bugs with race conditions, etc, it will only affect those who have opted into using this feature. It does seems like a pretty cool thing to do.

ryanhiebert · 2018-05-07T14:11:03Z

I'd really like this to be behind a setting. @codingjoe : are you willing to add such a setting, so that we can be sure that things won't break for existing users, even if there's some weirdness in one of the backend types that isn't compatible with threads?

codingjoe · 2018-05-07T14:50:07Z

That's a great idea. I'll do that. I will be disabled by default to ensure backwards compatibility.

ryanhiebert · 2018-05-14T16:28:04Z

I have Slack periodically reminding me about this, so it doesn't fall off my radar, so I'm passing that reminder along to you, @codingjoe . Do you have an idea of when you might be able to look at it, so I don't keep bugging you when you know you won't be able to get to it yet?

codingjoe · 2018-05-23T07:50:18Z

@ryanhiebert I am on DjangoCon now, have plenty time to pick up my pending contributions. I'll work on it right now.

codingjoe · 2018-05-23T08:04:11Z

@ryanhiebert do you want this to be a Django setting? It's tricky, since it's implemented in the core library. I might need to inherit some parts in the contrib section to ensure it's working in Django properly.

codingjoe · 2018-05-23T12:58:32Z

@ryanhiebert ok, this should do it. BTW, I say the the custom json encoder is not really needed anywhere since you don't encode datetimes. You might want to consider dropping it.

codingjoe · 2018-05-23T12:59:48Z

Oh I also took the liberty to use a JsonResponse. Its there since 1.8, I could find an official list of supported versions. But the oldest supported Django version (support by Django itself) is 1.11. So this should be fine.

codingjoe · 2018-08-18T11:35:30Z

ping @ryanhiebert

ryanhiebert · 2018-08-20T12:36:52Z

I'm sorry that I haven't responded on this. The design looks great. It turns out this is going to break my own usage of the library, because in my $WORK code I use that native_dump_procs function, and that's why I didn't get to it like I should have. Thank you for the ping, I needed it.

It looks really good, despite the hangup that has been slowing me down, and I think it's the right way to go.

- Use concurrent futures to reduce blocking IO (#37) - Add a test suite (#41) - Fix RabbitMQ connection `TimeoutError` (#42). Acquire and dispose broker connection per request

codingjoe · 2018-10-10T16:41:12Z

@ryanhiebert little anecdote: This change + disabling worker pool inspection increased performance on our site by 206x from >3s to <15ms ;)

ryanhiebert · 2018-10-10T19:56:33Z

Sweet! The worker inspection is what really kills the time, so that doesn't surprise me in the least. I'm not sure, though, that you necessarily gained much just from this change. Problem is that I don't get a reliable number, that won't shut down running tasks, unless I use the worker inspection, so its kinda a non-starter to do that for my use-cases.

codingjoe · 2018-10-10T20:12:20Z

Yes, that is right. We run many queues tho. So concurrently did cut it by 10x
Skipping inspection by another 20x. We don't really need the numbers to be perfect. Since we only scale on queue size for slow tasks and high volume. We scale high speed queues based on message rate. We wrote your own Procs for that. I'll port them upstream once it get the time. Message rate can be really helpful if you want ensure a certain msg ack time.

codingjoe added 2 commits January 19, 2018 09:15

Use concurrent futures to reduce blocking IO

02b7f42

Especially the queue inspection can be currently very slow, because a transport to the message broker needs to be opened and messages are exchanged. Especially in multi queue setup these operations should happen in a concurrent manner.

Add futures backport for Python 2.7

303b2ac

codingjoe added 4 commits January 19, 2018 09:52

Fix syntaxerror

df32c13

fix all the things ;)

89443f1

I really should not code in a browser

97551b3

Update __init__.py

60c93ff

ryanhiebert reviewed Apr 16, 2018

View reviewed changes

ryanhiebert assigned codingjoe Apr 18, 2018

Add serializer concept and setting to enable concurrent futures

abf0cb5

ryanhiebert approved these changes Aug 20, 2018

View reviewed changes

ryanhiebert merged commit 956b79a into ryanhiebert:master Aug 20, 2018

codingjoe deleted the patch-1 branch August 21, 2018 11:40

codingjoe added a commit that referenced this pull request Oct 10, 2018

Release 0.6

03ee76f

- Use concurrent futures to reduce blocking IO (#37) - Add a test suite (#41) - Fix RabbitMQ connection `TimeoutError` (#42). Acquire and dispose broker connection per request

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use concurrent futures to reduce blocking IO #37

Use concurrent futures to reduce blocking IO #37

codingjoe commented Jan 19, 2018

codingjoe commented Jan 19, 2018

codingjoe commented Jan 19, 2018

codingjoe commented Apr 16, 2018

ryanhiebert left a comment

ryanhiebert Apr 16, 2018

ryanhiebert Apr 16, 2018

codingjoe Apr 17, 2018

ryanhiebert Apr 17, 2018

codingjoe Apr 18, 2018

ryanhiebert May 1, 2018

ryanhiebert commented May 1, 2018

ryanhiebert commented May 7, 2018

codingjoe commented May 7, 2018

ryanhiebert commented May 14, 2018

codingjoe commented May 23, 2018

codingjoe commented May 23, 2018

codingjoe commented May 23, 2018

codingjoe commented May 23, 2018

codingjoe commented Aug 18, 2018

ryanhiebert commented Aug 20, 2018

codingjoe commented Oct 10, 2018

ryanhiebert commented Oct 10, 2018

codingjoe commented Oct 10, 2018

Use concurrent futures to reduce blocking IO #37

Use concurrent futures to reduce blocking IO #37

Conversation

codingjoe commented Jan 19, 2018

codingjoe commented Jan 19, 2018

codingjoe commented Jan 19, 2018

codingjoe commented Apr 16, 2018

ryanhiebert left a comment

Choose a reason for hiding this comment

ryanhiebert Apr 16, 2018

Choose a reason for hiding this comment

ryanhiebert Apr 16, 2018

Choose a reason for hiding this comment

codingjoe Apr 17, 2018

Choose a reason for hiding this comment

ryanhiebert Apr 17, 2018

Choose a reason for hiding this comment

codingjoe Apr 18, 2018

Choose a reason for hiding this comment

ryanhiebert May 1, 2018

Choose a reason for hiding this comment

ryanhiebert commented May 1, 2018

ryanhiebert commented May 7, 2018

codingjoe commented May 7, 2018

ryanhiebert commented May 14, 2018

codingjoe commented May 23, 2018

codingjoe commented May 23, 2018

codingjoe commented May 23, 2018

codingjoe commented May 23, 2018

codingjoe commented Aug 18, 2018

ryanhiebert commented Aug 20, 2018

codingjoe commented Oct 10, 2018

ryanhiebert commented Oct 10, 2018

codingjoe commented Oct 10, 2018