Fix for crash when using the JACK backend and quickly reconfiguring #529

hselasky · 2020-08-18T19:37:20Z

Found some bugs here and there and managed to improve overall jitter with my audio card.

corrados · 2020-08-18T20:27:42Z

Thanks for your code. I'll put some comments to your new code.

src/client.h

src/client.cpp

corrados · 2020-08-18T20:32:31Z

src/client.cpp

+        return;
+    }
+
+    // In common audio subsystems, there may be multiple audio sizes in use.


Interesting concept. Is this Linux-specific or does this apply to all supported OSs?

It applies to all OS'es where Jamulus is not the only client of the sound card subsystem.

Usually sound mixers have a pre-defined and fixed interval for the buffering.

Also further, USB hardware only supports small buffers of size of 2ms 3ms 4ms and so on. When Jamulus specifies 64 samples of buffer, the USB hardware cannot technically program that! So you end up with audio bursts, that in turn affect the timing of the UDP packets, and I found this to be the root cause of all jitter problems during my tests with FreeBSD. Too bad Windows doesn't have a nanosecond clock. I'm not fully sure how this patch will work there. Maybe someone can help testing?

I can do it when I am back at home in a week. If Windows does not support it, we should move your code in the Jack Sound.cpp file for now. Actually with my audio devices I tested, I did not have any higher jitter from the audio driver. What audio hardware are you using? And how did you measure the jitter?

I have a X32-RACK audio device configured with 8-channel audio, where I use 2 for Jamulus. I measured the jitter by using wireshark and looking at the time between data packets going on the wire. After this patch I saw a major improvement.

I'm disabling this for Win32, because I think the timer there is not good enough (milliseconds only).

src/main.cpp

src/util.h

hselasky · 2020-08-18T22:32:07Z

Thanks for your code. I'll put some comments to your new code.

Thank you!

hselasky · 2020-08-20T17:53:22Z

I added in one more patch. I hope it is not too much :-)

WolfganP · 2020-08-20T21:07:46Z

Switching from int to float wouldn't add much processing time? (usually int arithmetic is way faster than float)

corrados · 2020-08-20T21:46:55Z

I added in one more patch. I hope it is not too much :-)

Well, actually it is. I would prefer smaller pull requests.

corrados · 2020-08-20T21:52:52Z

Switch all Jamulus audio sample processing to use floats instead of...

Can you please give some more background information why you did this?

hselasky · 2020-08-21T09:19:39Z

Yes, the opus codec supports float, and I thought it would be cheap to add support for more than 16-bit sound. Many of the professional audio devices you recommend for use with Jamulus do 24-bit audio. There is a distinct difference between 16-bit audio and 24-bit audio, and this may help if the audio level is low, that you get a clear signal, and simply don't loose bits.

On the server side, double is used for mixing today. This is overkill! Using float will give a slight performance gain.
On the client side, using float instead of int16_t will not have any significant performance impact. We are already multiplying with double's, so I think we are good here too.

hselasky · 2020-08-21T09:20:18Z

Switching from int to float wouldn't add much processing time? (usually int arithmetic is way faster than float)

The server is using "double" today. And using "float" is expected to reduce the processing time a bit.

hselasky · 2020-08-21T09:25:30Z

@corrados : If some of the commits are good to go, and I can put them in a separate pull request, or maybe you can just push them separately and I'll rebase this pull request.

corrados · 2020-08-21T09:33:49Z

Let's do things step-by-step and not all together.

~~1) Obvious little fix like in socket the public slot -> ok, should be merged~~ -> done

Crash which you fixed by Mutex. The Mutex fixes the crash but not the root of the problem. I would like to see where the root of the problem is and fix that. Since the way the code is right now should not crash.
Delaying audio packets to minimize jitter: Interesting concept. You code is right now in client.cpp and applies to all OSs. If you have an audio interface which is rocket solid regarding the timing, you may introduce up to a block size of additional delay by your code. So it may make things worse. But if you have an audio driver which gives us bursts, I agree that your code lowers the jitter which makes smalle jitter buffers possible at the server. But this is a special case. All my audio cards I have used with Jamulus do not have this bursted behaviour. I would not apply this algorithm per default for everyone.
Converting to float: This is a big change in the core of the Jamulus signal processing. I see your point with the 24 bits but I don't think it gives you any noticeable improvement in the "jam session" use case. 24 bits may make sense in a studio environment. Have you done speed evaluations regarding OPUS working on float instead of short? Is it equally fast? I also have concerns about special functionality like the clipping indicator (which, I guess, will not work correctly with your new code) and, e.g., the fader group function which works on relative levels and I think you will need double precision to avoid error propagation. BTW, changing the current signal processing code to work on float instead of double in the server is a trivial and small change.
Bottom line: I am not sure if Jamulus really needs this change.

hselasky · 2020-08-21T09:38:13Z

Will you merge #1 ?

hselasky · 2020-08-21T09:59:09Z

Regarding #3, what kind audio cards are these? What interface do they use? What OS?

hselasky · 2020-08-21T10:24:38Z

Regarding #4 I don't see any noticable increase in CPU usage, its like 10->11 % CPU for both server and client on my machine.

corrados · 2020-08-21T10:34:24Z

Will you merge #1 ?

yes

corrados · 2020-08-21T10:36:39Z

Regarding #3, what kind audio cards are these? What interface do they use? What OS?

I have tried with Soundblaster Audigy on Lubuntu, Behringer UCA 202 on Windows and Mac, Lexicon Omega on Windows and Mac.

hselasky · 2020-08-21T11:28:59Z

Regarding #3, what kind audio cards are these? What interface do they use? What OS?

I have tried with Soundblaster Audigy on Lubuntu, Behringer UCA 202 on Windows and Mac, Lexicon Omega on Windows and Mac.

Did you enable the jitter measuring debug code for these setups in the audio callback ?

BTW: I changed my patch a bit, to attack the UDP transmission instead of the AudioProcess routine. Because encoding/decoding takes a variable amount of time, so to get that out of the equation, I've moved the delay around a bit.

hselasky · 2020-08-21T11:43:52Z

@corrados : Something else before I forget it:
celt/entdec.c: _this->nbits_total=EC_CODE_BITS+1
celt/entenc.c: _this->nbits_total=EC_CODE_BITS+1;

The first bit in every OPUS frame is unused! This may be possible to use for something like a toggle .....

corrados · 2020-08-21T13:14:49Z

Will you merge #1 ?

yes

done

hselasky · 2020-08-21T13:19:04Z

Will you merge #1 ?

yes

done

Thank you - just rebased my patches.

corrados · 2020-08-21T13:33:11Z

Did you enable the jitter measuring debug code for these setups in the audio callback ?

I just did this for the Soundblaster Audigy card under Linux:

The spikes are caused by other processes interfering with Jamulus (causing x-runs). As you can see, the regular jitter is very small so there is no need for traffic shaping for that audio card.

Can you post such a plot for your audio hardware? I am curious how it looks in your case where you get bursts of audio blocks.

corrados · 2020-08-21T13:41:55Z

Regarding #4 I don't see any noticable increase in CPU usage, its like 10->11 % CPU for both server and client on my machine.

I think we should not test it on a normal desktop CPU but on low end hardware like the Raspberry Pi Zero, see: #483 (comment)

hselasky · 2020-08-21T15:22:46Z

@corrados : Can you test a USB based audio device?

corrados · 2020-08-21T16:04:35Z

Thanks for the plots. That is very interesting. I'll evaluate the jitter of my USB-devices as soon as I am back at home. I'm now very curious how it will look like...

Can you please add which operating system you have used for these plots?

hselasky · 2020-08-21T17:29:56Z

Can you please add which operating system you have used for these plots?

I'm using FreeBSD w/ XHCI USB controller.

hselasky · 2020-08-22T11:19:40Z

@corrados

I think we should not test it on a normal desktop CPU but on low end hardware like the Raspberry Pi Zero, see: #483

I tested on a RPI3 running FreeBSD:

Jamulus running on RPI3:
w/o patches: 10.11% CPU usage approx
w/ float patches: 8.96% CPU usage approx

However a new problem arised:

ntpdate 0.freebsd.pool.ntp.org
22 Aug 11:12:14 ntpdate[4271]: adjust time server 192.36.143.130 offset -0.021860 sec

The RPI3 produces an effective sample rate of 48060 Hz, when there is no packet loss, compared to my other test computer, meaning the jitter buffer on the client "goes nuts" after a while :-(

I think we need some tuning parameter here, or calibration, for this to work better!

hselasky · 2020-08-22T18:41:42Z

@corrados : I have an idea how we can solve all of this jitter issues once and for all by analyzing the timestamps on all packets received on the client :-) Let me work on it a bit.

The main problem I see is that we send 3 packets every 4 milliseconds, for the smallest buffer size or 3 packets every 8 milliseconds. There is a simple statistical remainder trick that can be used here, but we need a bigger (prime) number than 3 I think!

The goal is not depend on a high-resolution timer, but that a millisecond timer would suffice, like is available in Windows.

Meanwhile I would like you to consider my floating point patch. I can put it first in the commit list to make merging easier!

corrados · 2020-08-23T06:02:59Z

Meanwhile I would like you to consider my floating point patch. I can put it first in the commit list to make merging easier!

Yes, I need a pull request that only contains these changes. Then I can Start reviewing your changes.

hselasky · 2020-08-23T11:07:21Z

@corrados : Once #535 is resolved, we can resume this one!

corrados · 2020-08-24T11:21:16Z

This is what I get on my Lexicon Omega under Windows:

So it gives me a jitter of about 1 ms.

hselasky · 2020-08-24T12:05:38Z

Something like that is expected. It likely means the USB stack in Windows is using a 1ms audio buffer, which is a requirement by the XHCI USB host controller.

hselasky · 2020-08-24T12:08:35Z

@corrados : I have the plans ready for a cool jitter correction algorithm. All I need is one bit per frame, and I see the first bit of every OPUS frame appears unused, so I'm asking for permission to use that bit for something!

corrados · 2020-08-24T12:16:51Z

Can you please give some more info for what reason you need the bit?

hselasky · 2020-08-24T12:35:22Z

@corrados : I want to implement a "noise-based" (not sure what you would call it) sequence number for every OPUS frame. That means there is for example a 13-bit frame number incrementing one by one, then via a formula only one of the 13-bits is or'ed into every OPUS frame.

The reason we need a frame number is because it is very difficult to estimate the packet loss accurately when there is not timing information provided by every UDP packet, from the sender side.

Also 48kHz is not 48kHz. Over time the clocks will drift (sender and receiver side), but if we had a sequence number, this clock drift could easily be eliminated by libsamplerate on the client side!

Without a sequence number it is impossible to know how many packets were dropped on the wire.

--HPS

corrados · 2020-08-24T12:57:17Z

This goes in a wrong direction. The idea of Jamulus is to keep it simple. About 10 years back I also had done tests with a resampling but that did not give any improvement. The idea is: if a packet is lost, it is lost. No need to track the packet number. You can read more about this in http://llcon.sourceforge.net/PerformingBandRehearsalsontheInternetWithJamulus.pdf

If you want to include a resampler, you have a problem that we are working on blocks of data. If you have a clock difference, you would have to resample and get, e.g., 129 samples out of the resampler instead of 128. But you have to transmit 128. So what to do with the additional sample? You would have to introduce a new buffer. So you will have additional latency. The only meaningful clock correction would be hardware based (maybe GPS-based). But this is out of scope for the Jamulus software.

it is very difficult to estimate the packet loss

For what reason do you need the packet loss for improving the sound card jitter?

hselasky · 2020-08-24T14:11:27Z

I understand that Jamulus doesn't care about:

a) reordering of packets
b) lost packets

Using the OPUS codec to conceal jitter might be more time-clever than using libsamplerate.
OK, agreed, let's forget about libsamplerate.

For what reason do you need the packet loss for improving the sound card jitter?

When you know the exact clock rate differences, you can then subtract or add that to the sound card jitter, to reduce the size of the jitter buffer needed.

Right now in the "perfect" link scenario all clock differences are accounted as jitter. It is like Jamulus needs packet loss to keep the jitter buffer down!

You asked about testing a Raspberry PI, and what I found is that the system clock is not that accurate.

The RPI for example generates 751 64-sample OPUS packets per second, but the client only consumes 750 per second.
So every second with "perfect" transmission the jitter buffer grows by 1 packet . This also adds up for the latency - right?

Even for 1 second of continous data, we could reduce the latency by 1.33 ms, if we knew some sequence numbers here and there, which is possible by filling the first bit of every OPUS frame with a predictable "noise" pattern.

I think it is simply wrong to assume that "the internet connection" will always have packet drops.

corrados · 2020-08-24T14:57:42Z

When you know the exact clock rate differences, you can then subtract or add that to the sound card jitter, to reduce the size of the jitter buffer needed.

Right, you could do that. But then you have to drop a packet from time to time to correct for the different clock rates. The jitter buffer just does exactly this anyway.

So every second with "perfect" transmission the jitter buffer grows by 1 packet . This also adds up for the latency - right?

This depends on the selected jitter buffer size. If the jitter buffer size is set so that it corrects 1 packet per second, then you have exactly the same behavior.

I think it is simply wrong to assume that "the internet connection" will always have packet drops.

Sure, you cannot say that the network always drops packet (do I do that in my paper? ;-) ). It depends on what type of network you have. There are more reliable connections and less reliable connections. E.g. over WLAN you will get packet loss in most cases.

Usually Jamulus users will have network issues by other devices overloading the network with updates, video streams, etc. In that case you will not be able to estimate the clock offset reliably. Just for the special case for a near perfect network condition you will have a chance to estimate it and drop some audio blocks at the right time to compensate for the clock offsets. But again, this is what the jitter buffer already does.

corrados · 2020-08-24T15:28:13Z

Here is the jitter I get if I use the Lexicon Omega under MacOS with 128 samples block size:

And this is with 64 samples block size:

As you can see, I have very low jitter.
When I do my Jamulus jam sessions, I usually do it with 64 samples under MacOS. That gives me a very good performance.

hselasky · 2020-08-24T15:54:44Z

I see. Let me do a proof of concept on an own branch, and see if it makes any difference, hopefully w/o breaking too much of the established concepts.

hselasky · 2020-08-24T15:56:12Z

over WLAN you will get packet loss in most cases.

Yeah. Been there done that. Does not work, even with expensive ones like from Ubiquity.

hselasky · 2020-08-25T19:24:38Z

I reduced this pull request to only contain two small fixes. Hopefully increasing the chance of being merged.

corrados · 2020-08-26T08:30:57Z

Could you please split the pull request further? That makes the integration and testing easier. One pull request for each "topic" like e.g. "replacing the presice timer for ping" would be great. Thanks.

hselasky · 2020-08-26T08:37:01Z

@corrados : Yes.

Need to ensure that JACK callbacks see the running flag as false after being stopped. Use a QMutex for this. Signed-off-by: Hans Petter Selasky <hps@selasky.org>

corrados reviewed Aug 18, 2020

View reviewed changes

src/client.h Outdated Show resolved Hide resolved

corrados reviewed Aug 18, 2020

View reviewed changes

src/client.cpp Outdated Show resolved Hide resolved

corrados reviewed Aug 18, 2020

View reviewed changes

src/main.cpp Outdated Show resolved Hide resolved

corrados reviewed Aug 18, 2020

View reviewed changes

src/util.h Outdated Show resolved Hide resolved

This was referenced Aug 21, 2020

Option to disable OPUS encoding on client #516

Closed

Jack : buffer per period parameter and audio latency calculation #437

Closed

corrados mentioned this pull request Aug 24, 2020

Switch all Jamulus audio sample processing to use floats instead of a mix of double and int16_t #535

Merged

hselasky mentioned this pull request Aug 25, 2020

Implement new RX jitter buffer for use with Jamulus. #539

Closed

Fix for crash when using the JACK backend and quickly reconfiguring.

ecff80f

Need to ensure that JACK callbacks see the running flag as false after being stopped. Use a QMutex for this. Signed-off-by: Hans Petter Selasky <hps@selasky.org>

hselasky changed the title ~~Various fixes~~ Fix for crash when using the JACK backend and quickly reconfiguring Aug 26, 2020

corrados mentioned this pull request Aug 26, 2020

Crash when using the JACK backend and quickly reconfiguring #543

Closed

corrados merged commit 05783a6 into jamulussoftware:master Aug 29, 2020

corrados mentioned this pull request Aug 29, 2020

Improve the jitter buffer #545

Open

corrados mentioned this pull request Oct 12, 2020

Jamulus on Linux vs Windows #669

Closed

Fix for crash when using the JACK backend and quickly reconfiguring #529

Fix for crash when using the JACK backend and quickly reconfiguring #529

Conversation

hselasky commented Aug 18, 2020

corrados commented Aug 18, 2020

corrados Aug 18, 2020

Choose a reason for hiding this comment

hselasky Aug 18, 2020

Choose a reason for hiding this comment

corrados Aug 19, 2020

Choose a reason for hiding this comment

hselasky Aug 19, 2020 • edited Loading

Choose a reason for hiding this comment

hselasky commented Aug 18, 2020

hselasky commented Aug 20, 2020

WolfganP commented Aug 20, 2020

corrados commented Aug 20, 2020

corrados commented Aug 20, 2020

hselasky commented Aug 21, 2020

hselasky commented Aug 21, 2020

hselasky commented Aug 21, 2020

corrados commented Aug 21, 2020 • edited Loading

hselasky commented Aug 21, 2020

hselasky commented Aug 21, 2020

hselasky commented Aug 21, 2020

corrados commented Aug 21, 2020

corrados commented Aug 21, 2020

hselasky commented Aug 21, 2020

hselasky commented Aug 21, 2020

corrados commented Aug 21, 2020

hselasky commented Aug 21, 2020

corrados commented Aug 21, 2020 • edited Loading

corrados commented Aug 21, 2020

hselasky commented Aug 21, 2020

corrados commented Aug 21, 2020 • edited Loading

hselasky commented Aug 21, 2020

hselasky commented Aug 22, 2020

hselasky commented Aug 22, 2020

corrados commented Aug 23, 2020 • edited Loading

hselasky commented Aug 23, 2020

corrados commented Aug 24, 2020

hselasky commented Aug 24, 2020

hselasky commented Aug 24, 2020

corrados commented Aug 24, 2020

hselasky commented Aug 24, 2020

corrados commented Aug 24, 2020 • edited Loading

hselasky commented Aug 24, 2020

corrados commented Aug 24, 2020

corrados commented Aug 24, 2020

hselasky commented Aug 24, 2020

hselasky commented Aug 24, 2020

hselasky commented Aug 25, 2020

corrados commented Aug 26, 2020 • edited Loading

hselasky commented Aug 26, 2020

hselasky Aug 19, 2020 •

edited

Loading

corrados commented Aug 21, 2020 •

edited

Loading

corrados commented Aug 21, 2020 •

edited

Loading

corrados commented Aug 21, 2020 •

edited

Loading

corrados commented Aug 23, 2020 •

edited

Loading

corrados commented Aug 24, 2020 •

edited

Loading

corrados commented Aug 26, 2020 •

edited

Loading