Add bounds checks in SDL_qsort #10066

aikawayataro · 2024-06-20T01:37:26Z

Description

Updating qsort implementation fixed only part of the non-transitive compare issue.
Using such a compare function should be considered a user code bug, but I believe it's better not to crash the whole program.
I set up a fuzzer and found a few unchecked memory reads and writes. With the proposed changes, qsort will not crash with invalid compare functions.
Fuzzer source code: fuzzer.zip

madebr · 2024-06-20T12:58:18Z

These checks are insufficient.
When adding extra tests to testqsort.c (madebr@1ea94b0), ci fails:
https://github.com/madebr/SDL/actions/runs/9597768806/job/26467538455

Feel free to use the commit in this pr to verify the fixes.

aikawayataro · 2024-06-20T19:55:52Z

@madebr You're accessing a zero-sized allocated block at line 61 when arraylen=0 😁 there's no memory corruptions from qsort.
As for your test of non-transitive compare, this usecase, which is considered invalid and should not be tested.

DanielGibson · 2024-06-20T21:10:25Z

As for your test of non-transitive compare, this usecase, which is considered invalid and should not be tested.

It's not valid, but testing it to make sure it at least doesn't crash makes sense (that's what these changes are about, after all).
But you can't expect the data to be sorted afterwards, so verifying any order doesn't make sense (even the "incorrect" order after running qsort with invalid comparator on a given array might not be deterministic in the long term, when the qsort implementation is updated again; and it might even be different depending on wordsize and whatever potentially platform-dependent special cases that implementation handles).

(No idea what exactly goes wrong in the qsort test, unless I missed something the log only mentions an incorrect exit code?)

madebr · 2024-06-20T21:14:00Z

@madebr You're accessing a zero-sized allocated block at line 61 when arraylen=0 😁 there's no memory corruptions from qsort. As for your test of non-transitive compare, this usecase, which is considered invalid and should not be tested.

Whoops! Good catch.
I think my changes still make sense, with a fix for the bug you noticed.

    if (arraylen > 0) {
        prev = nums[0];
    }

aikawayataro · 2024-06-20T22:12:45Z

It's not valid, but testing it to make sure it at least doesn't crash makes sense (that's what these changes are about, after all).
But you can't expect the data to be sorted afterwards, so verifying any order doesn't make sense (even the "incorrect" order after running qsort with invalid comparator on a given array might not be deterministic in the long term, when the qsort implementation is updated again; and it might even be different depending on wordsize and whatever potentially platform-dependent special cases that implementation handles).

That's what I meant to say, we shouldn't test the order, just the function.

(No idea what exactly goes wrong in the qsort test, unless I missed something the log only mentions an incorrect exit code?)

Because of a bug in the test itself, there's a segfault.

I think my changes still make sense, with a fix for the bug you noticed.

Test runs just fine with your fix (there was a note about a non-existent bug I found, apologies).
Also your arraylen with invalid compare function is too big because it takes crazy amounts of cpu compared to test with valid one. I guess I will add other arraylen values that will cover the whole qsort without very large values. We should also test aligned and unaligned branches. I'll add a test with these remarks in mind.

aikawayataro · 2024-06-21T03:24:12Z

I refactored the test, but honestly I think it does not look reasonable. The qsort used is in fact 3 qsorts for different cases. To test them all requires a lot of hackery (ignore failing build I can't figure out right pointer type for const array pointer).
What I believe is that we should not use qsort_aligned and qsort_words but use only plain unaligned version, which works in any case.

…orting

slouken · 2024-08-05T03:01:37Z

What's the status of this PR? Is it something we still need?

aikawayataro · 2024-08-05T06:04:25Z

@slouken
The current state is more like a draft. I've introduced tests to test all three implementations of sorting, but it looks crude to me.
What I propose is to get rid of two other qsort implementations and keep only qsort_r_nonaligned. It will be easier to test a single implementation, and I really don't see reason to keep qsort_r_aligned and qsort_r_words.

slouken · 2024-10-06T18:58:22Z

@slouken The current state is more like a draft. I've introduced tests to test all three implementations of sorting, but it looks crude to me. What I propose is to get rid of two other qsort implementations and keep only qsort_r_nonaligned. It will be easier to test a single implementation, and I really don't see reason to keep qsort_r_aligned and qsort_r_words.

I would tend to agree, but before we do that, we should test performance in release mode on a modern processor to see if we get significant speedup from those modes.

aikawayataro · 2024-10-07T13:26:20Z

Ok, I will set up a benchmark for this

aikawayataro · 2024-10-08T06:49:01Z

I benchmarked it and here are the results:

gcc 14.2.1 -O3
items=5000
rounds=50000
================================
qsort_r_words vs qsort_r_nonaligned for int
qsort_r_words took 171652
qsort_r_nonaligned took 179765
diff=8113, 8.113000ms
qsort_r_words is faster
================================
qsort_r_aligned vs qsort_r_nonaligned for big_struct sizeof=128
qsort_r_aligned took 213069
qsort_r_nonaligned took 202701
diff=10368, 10.368000ms
qsort_r_nonaligned is faster

gcc 14.2.1 -O2
items=5000
rounds=50000
================================
qsort_r_words vs qsort_r_nonaligned for int
qsort_r_words took 165439
qsort_r_nonaligned took 217981
diff=52542, 52.542000ms
qsort_r_words is faster
================================
qsort_r_aligned vs qsort_r_nonaligned for big_struct sizeof=128
qsort_r_aligned took 332536
qsort_r_nonaligned took 874643
diff=542107, 542.107000ms
qsort_r_aligned is faster

CPU: 12th Gen Intel(R) Core(TM) i7-12700H

Well, it makes things harder, I guess.
It can be observed that the nonaligned version performs better under O3 than the aligned version, but it is much slower under O2 (>x2 slower). words version is slightly faster than the nonaligned version in both cases.

I only compared these 2 pairs because
qsort_r_words only useful when size of item is sizeof(int) and when items buffer aligned as int
qsort_r_aligned only useful when item size is sizeof(X) % sizeof(int) == 0 and items buffer aligned as int (almost always due to padding)
qsort_r_nonaligned will always work
So it narrows everything down to these two "competing" cases.

Benchmark code bench.c.tar.gz

slouken · 2024-10-08T15:13:37Z

So it sounds like it's worthwhile keeping all 3 cases. Thanks for the investigation!

Add bounds checks to SDL_qsort

cad2dd8

sezero requested review from icculus and slouken June 20, 2024 01:49

aikawayataro mentioned this pull request Jun 20, 2024

SDL_qsort stack buffer overflow with non-transitive comparison function #10055

Closed

slouken added this to the 3.2.0 milestone Jun 21, 2024

aikawayataro force-pushed the qsort-patch branch from 3422807 to 83dee6e Compare June 21, 2024 03:13

Refactor testqsort to test non-aligned, aligned, and non-transitive s…

b67d9c5

…orting

aikawayataro force-pushed the qsort-patch branch from 83dee6e to b67d9c5 Compare June 21, 2024 03:54

sezero requested a review from madebr June 22, 2024 06:38

slouken marked this pull request as draft August 6, 2024 15:11

slouken modified the milestones: 3.2.0, 3.x Oct 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add bounds checks in SDL_qsort #10066

Add bounds checks in SDL_qsort #10066

aikawayataro commented Jun 20, 2024

madebr commented Jun 20, 2024

aikawayataro commented Jun 20, 2024

DanielGibson commented Jun 20, 2024

madebr commented Jun 20, 2024

aikawayataro commented Jun 20, 2024 •

edited

Loading

aikawayataro commented Jun 21, 2024

slouken commented Aug 5, 2024

aikawayataro commented Aug 5, 2024

slouken commented Oct 6, 2024

aikawayataro commented Oct 7, 2024

aikawayataro commented Oct 8, 2024

slouken commented Oct 8, 2024

Add bounds checks in SDL_qsort #10066

Are you sure you want to change the base?

Add bounds checks in SDL_qsort #10066

Conversation

aikawayataro commented Jun 20, 2024

Description

madebr commented Jun 20, 2024

aikawayataro commented Jun 20, 2024

DanielGibson commented Jun 20, 2024

madebr commented Jun 20, 2024

aikawayataro commented Jun 20, 2024 • edited Loading

aikawayataro commented Jun 21, 2024

slouken commented Aug 5, 2024

aikawayataro commented Aug 5, 2024

slouken commented Oct 6, 2024

aikawayataro commented Oct 7, 2024

aikawayataro commented Oct 8, 2024

slouken commented Oct 8, 2024

aikawayataro commented Jun 20, 2024 •

edited

Loading