-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Median candidate analysis #10
Comments
One thing I'd like to note that the 'piecewise-linear' case in glidesort is taken care of by run analysis and merging, and thus is not a deciding factor for the quicksort pivot selection. Instead I wanted to minimize the number of cache lines that the recursive method touches. That's why I don't do even spacing. It is an interesting idea to change up the grouping though. I would venture to guess that |
The script tests all combinations, so if A cache line is 64 bytes, so 16 elements in the 4-byte case, and the smallest gap is between 1 and 8 elements, so groups of 3 will usually fit in one line and adjacent groups might share one? That's better than I realized; I hadn't worked through all the Yes, natural run detection catches exact piecewise-linear inputs (if the number of pieces is small enough, as it is here). I expect it's common to have inputs that follow the basic idea of "a value is close to neighboring values" without large natural runs: piecewise-linear with a little noise or a few swaps, or small ordered sections, say. |
Code simplicity translates to code size, and I'd like to minimize that where possible. There's definitely experiments to be had though. |
It occurred to me to try a narrower distribution, so I changed the 0, 4, 7 offsets to 1, 4, 6. It's a major improvement! Overall this distribution is much closer to even, so it makes sense that the performance would be more similar. I'm not sure whether this is applicable to the recursive case, but a reason to think so is that the median of 3 seems to do best with an even or very slightly narrower distribution. A way to look at this is that the ends of the list are likely to be atypical, and for a median we don't want a representative picture of the whole list, just a best guess at what's typical. Which seems like a sound principle even outside the piecewise-linear framework. Even distribution:
Glidesort based on 1, 4, 6 (approx 0.148, 0.195, 0.227, 0.523, 0.57, 0.602, 0.773, 0.82, 0.852):
Glidesort based on 1, 3.5, 6 (approx 0.148, 0.188, 0.227, 0.461, 0.5, 0.539, 0.773, 0.813, 0.852):
With this last balanced distribution, which would change |
Outer offsets of 1, 3.5, 6 combined with inner offsets of 0, 3.5, 7 do better still, so the better thing to do, going by this model, is try to evenly space the initial selection of three regions instead of putting the outer two at the edges. This kind of points to the fact that even spacing has a scaling factor of 3 instead of 8, so relative to scaling by 8 it keeps getting wider at each step. Regarding the scheme as a whole, here's the dilemma I see. It does take position into account, as otherwise you'd just pick a bunch of elements from the start of the array and be done with it. The reason for this has to be concern about spatial locality, right? But both empirically and intuitively, I'm finding that the fractal recursion doesn't do much better than a median of 3 at handling locality (it will filter noise fine of course). Tight median-of-3 at the lowest level or two to compute faster and save cache makes sense to me; for higher levels my still-tentative thinking is that it's best to go with more even spacing, and reordering or larger medians if possible. (I'll stop bothering you. Felt if I'm going to do this study I may as well share it...) |
Actually my main justification for spreading out our pivot sample over the input array is distribution drift. The elements at the start of the array might not follow the same distribution as those somewhere else. This is rare in benchmarks but not necessarily so in real world data. |
I fat fingered the close button, oops. |
Talking about perf, I'll add that in my testing just median of 3 for small sizes < threshold e.g. 64 is not ideal for sorting integers. Getting a higher quality median pays of with a total runtime improvement of a couple percent. I recently simplified pivot selection for ipnsort Voultapher/sort-research-rs@4c5a892 but I'm still not happy with it. |
Yes, as generated by the |
Have had this half-finished for a while; I'm writing up what I can even though there's more to investigate. CC @scandum, @Voultapher as I believe ipn's picked up glidesort's candidate approach.
I looked into how the choice of pivot candidates affects median accuracy using a simulation on random piecewise-linear inputs. These seem like they capture one type of order that might be expected in an input, and I don't have any other promising ideas for testing that.
I found that the accuracy depends not only on the way candidates are grouped into sets of 3 (what I initially wanted to test) but also the positions of those candidates (unexpected, but obvious in retrospect). So I measured both for an even distribution, and Glidesort's recursive selection, using midpoints of the nine intervals that come out of the recursion. I copied the arithmetic for this by hand; it looks right but it's possible I made a mistake here. I've also tested various positions not shown here. Error is measured as cost, assuming sorting each of the two partitions takes time proportional to n log(n). For context, the baseline cost is 8965784, so that a relatively high cost difference of 30000 adds 0.33% to total sorting cost due to that step (it'll compound at every partition). The tables below show cost for the best and worst groupings after testing every possibility, as well as some others:
The arrangement 011201220 that I've used and recommended previously does badly, often worse than 000111222.
Beyond any particular choice of grouping it doesn't seem like Glidesort's positions do well: the "true of 9" row is the true median of those and is worse than most pseudomedians with even spacing. The middle 4 makes Glidesort's positions skewed and prevents perfect performance on 1-piece (totally linear) inputs, but changing it to 3.5 isn't much of an improvement (in fact the true median is worse for 2-piece inputs). So I think it's mainly the clustering that weakens it, although I can't say exactly why the effect is so strong. Something that seems a little better is to use 3 as the midpoint for one or two intervals and 4 for the others.
I also did some theoretical analysis on the median-of-median-of-... idea. I find that 3^k candidates processed with recursive medians have about as much power as a true median of (π/2)*2.25^k pivots. For example the pseudomedian of 243 for a 32768-element list is worth a true median of 91. The probability distributions for the median value found this way with random candidates are equal at the exact midpoint, and the shapes seem similar (if anything the pseudomedian is wider). I can write the math up if you're interested.
Even spacing (0.5/9, ..., 8.5/9):
Glidesort arrangement (approx 0.008, 0.07, 0.117, 0.508, 0.57, 0.617, 0.883, 0.945, 0.992):
Source for this, run with CBQN. I can translate to some other language on request. The inputs are created with
•rand
which changes between runs, but the results above don't change significantly.The text was updated successfully, but these errors were encountered: