paho Rust slower than paho Python? #63

tobdub-snce · 2020-01-08T10:29:44Z

I have an application implemented in both Rust and Python using the paho mqtt libraries for each language.
The app is receiving around 800 mqtt messages per second and then triggering http calls for a few of the messages based on some simple parsing.
The Rust version is using the futures API with tokio 0.2. The Python version is using PyPy3.6 v7.3.0.
For some reason the Rust version is using 50% more cpu than the Python version (running on AWS T3 instance). This was a bit surprising to me as I expected the Rust version to consume less resources.

flamegraph.svg.gz

fpagliughi · 2020-01-08T13:08:18Z

Ouch! Yes, I'm with you. The Rust version should be more efficient. I really do want to get a set of standard measurement for the Paho libraries so that we can get side-by-side comparisons of the performance and requirements of each. Messages per sec, memory use, CPU use, etc.

The one performance issue that I'm aware of is that there is more memory copying than might be necessary on the border between the Rust and underlying C library. Sometimes a buffer is copied in order to ensure Rust lifetime guarantees, but there might be places to improve on this. But I wouldn't imagine that it degrades performance to what you report.

The only thing I can think of is that there have been some bugs against the C library filed recently that the library is "spinning" and using up a lot of CPU in some instances:
eclipse-paho/paho.mqtt.c#781

That could be related.

tobdub-snce · 2020-01-08T13:32:06Z

According to the flamegraph, a lot of time is spent in WebSocket_getch. It seems to be reading single chars from the TCP stream? But I guess that is on the C library side.

fpagliughi · 2020-01-08T14:34:57Z

Ah. (Sorry, I didn't have much time this morning to dig into the graph).
The WebSocket implementation is a fairly recent addition to the C lib, from a contribution about a year ago. On the Rust side it was awesome in that it came completely for free. It just worked. But if the performance is lagging, that would be worth looking into. Perhaps its worth us cross-posting this information on the Paho C repo as well.

icraggs · 2020-01-09T11:52:01Z

I need to look at the Websocket implementation, or someone does. It works to the extent that basic function operates but there are issues that need addressing.

Also remember that the Python implementation has no disk persistence. You can turn that off in the C library if you want.

tobdub-snce · 2020-01-09T13:07:18Z

The code is not actually using websockets. Looks like WebSocket_getch just calls SSLSocket_getch. I also tried disabling SSL, but that was actually slightly slower...
Disk persistence is disabled (mqtt::PersistenceType::None) and the subscriptions is made with QoS 0.

icraggs · 2020-01-09T15:31:53Z

Ok.

WebSocket_getch() is where we wait for the next incoming packet to be delivered (the first byte of the MQTT packet). WebSocket_getdata() is where the rest of the packet will be read in. So I'd be surprised if the getch() call is using a lot of CPU time. Elapsed time?

tobdub-snce · 2020-01-09T17:15:08Z

Should be CPU time. Created using cargo-flamegraph.
The message payloads are around 100 bytes.
I created a new flamegraph in the cloud (the first was from my laptop) using an example based on https://github.com/eclipse/paho.mqtt.rust/blob/master/examples/futures_consume.rs
and the results look a bit different, WebSocket_getch is smaller, but still larger than WebSocket_getdata, syscall overhead?. Could the I/O be made buffered?
A fair amount of time seems to be related to Rust futures as well, will remeasure tomorrow using the non-futures version.
The Python version is still faster (with PyPy).

flamegraph.svg.gz

tobdub-snce · 2020-01-10T14:01:55Z

I remeasured again using the https://github.com/eclipse/paho.mqtt.rust/blob/master/examples/async_subscribe.rs example, and it is faster, almost as fast as the Python PyPy version.
The C lib StackTrace seems to cause noticeable overhead (37.6% CPU time), maybe that could be disabled for release builds or added as a feature flag?
flamegraph.svg.gz

tobdub-snce · 2020-01-14T10:38:31Z

There are also some logs triggered by the C lib that may be possible to disable (21% CPU time with StackTrace enabled).

tobdub-snce · 2020-04-28T06:33:47Z

A PAHO_HIGH_PERFORMANCE CMake flag is now available in the C lib. It doubles the performance for my use case and makes the Rust version faster than the Python version. It would be great if that flag could be enabled, or exposed by a feature in the Rust lib.

fpagliughi · 2020-04-28T13:44:56Z

Agreed. I pushed out v0.7 based on what had been sitting in the repo for months waiting on the upstream bug fixes. But I'm immediately jumping on the next release and will start testing this.

I was assuming I would just enable this in the build. I didn't imagine not wanting to use it, but I suppose I can add an inverted feature to turn it off, just in case.

fpagliughi · 2020-05-27T03:02:15Z

This is in the develop branch.

fpagliughi · 2020-11-30T00:26:57Z

Released in v0.8

fpagliughi mentioned this issue Jan 8, 2020

Turn off debug facilities to improve performance - but allow them to be switched on dynamically eclipse-paho/paho.mqtt.c#793

Open

fpagliughi added this to the v0.8 milestone Apr 28, 2020

fpagliughi mentioned this issue Apr 30, 2020

build with musl #57

Closed

fpagliughi added the fix added A fix was added to an unreleased branch label May 27, 2020

fpagliughi closed this as completed Nov 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

paho Rust slower than paho Python? #63

paho Rust slower than paho Python? #63

tobdub-snce commented Jan 8, 2020 •

edited

Loading

fpagliughi commented Jan 8, 2020

tobdub-snce commented Jan 8, 2020

fpagliughi commented Jan 8, 2020

icraggs commented Jan 9, 2020

tobdub-snce commented Jan 9, 2020

icraggs commented Jan 9, 2020

tobdub-snce commented Jan 9, 2020 •

edited

Loading

tobdub-snce commented Jan 10, 2020 •

edited

Loading

tobdub-snce commented Jan 14, 2020 •

edited

Loading

tobdub-snce commented Apr 28, 2020

fpagliughi commented Apr 28, 2020

fpagliughi commented May 27, 2020

fpagliughi commented Nov 30, 2020

paho Rust slower than paho Python? #63

paho Rust slower than paho Python? #63

Comments

tobdub-snce commented Jan 8, 2020 • edited Loading

fpagliughi commented Jan 8, 2020

tobdub-snce commented Jan 8, 2020

fpagliughi commented Jan 8, 2020

icraggs commented Jan 9, 2020

tobdub-snce commented Jan 9, 2020

icraggs commented Jan 9, 2020

tobdub-snce commented Jan 9, 2020 • edited Loading

tobdub-snce commented Jan 10, 2020 • edited Loading

tobdub-snce commented Jan 14, 2020 • edited Loading

tobdub-snce commented Apr 28, 2020

fpagliughi commented Apr 28, 2020

fpagliughi commented May 27, 2020

fpagliughi commented Nov 30, 2020

tobdub-snce commented Jan 8, 2020 •

edited

Loading

tobdub-snce commented Jan 9, 2020 •

edited

Loading

tobdub-snce commented Jan 10, 2020 •

edited

Loading

tobdub-snce commented Jan 14, 2020 •

edited

Loading