motivation

I started looking at Julia's Quicksort after finding issue #939. The issue concerns the performance of sort!(). For primitive datatypes, Julia invokes Quicksort, so the general purpose here is to increase the performance of the standard library implementation of Quicksort.

current implementation issues

While looking at Julia's current implementation of Quicksort, several issues became apparent:

Each pivot element is not guaranteed to be placed in its proper position in the array after each pass
the algorithm may have to look at more elements than are necessary
The algorithm fails to complete without the insertion sort optimization. The j scan runs out of bounds when scanning small arrays
this isn't necessarily a huge issue since insertion sort is invoked for these small arrays

As a quick example, let's look at Julia's pure Quicksort on array [4, 10, 11, 24, 9], without the insertion sort optimization hi-lo <= SMALL_THRESHOLD && return isort(v, lo, hi), and without the @inbounds macro.

The value of the pivot index on the first pass is 11, determined by p = (lo+hi)>>>1. However after the first pass, the array becomes [4, 10, 9, 24, 11]. If we let the algorithm try to run to completion, the following error is raised ERROR: BoundsError() on the j index crawl while isless(pivot, v[j]); j -= 1; end.

potential improvements

Improvements are proposed with four permutations of Quicksort, outlined in qsorts.jl.

The canonical Quicksort algorithm that places the pivot in its natural position after each pass. The pivot is chosen as the first element in each subarray
Canonical, and performs a random shuffle (Fisher-Yates) of the array before the initial sort
Canonical, but uses a median of three sampled values (indexes lo, (hi+lo)>>>1, hi) as the pivot
Canonical with both a median of three pivot, and the random shuffle

Some low-hanging fruit optimizations were made to the canonical example, specifically the @inbounds macro which speeds up array access (~2x boost in performance), and the lack of bounds checking on the j index scan.

benchmarking

Naive benchmarking shows a speed and memory footprint improvement across the board over Julia's current Quicksort when all algorithms lack the @inbounds macro. For each sample, we create an array of random integers, copy the array for each implementation, and time each implementation. The results are then normalized to the standard libary's sort time. This method on 10^4 samples of 10^5-element random integer arrays produces

All algorithms without ```@inbounds``` macro

	Mean Ratio	Median Ratio
Canonical	0.9178107	0.9103574
Median-of-3 Pivot	0.9475761	0.9358107
Random Shuffle	0.9215353	0.9102481
Combo	0.9467139	0.9331511

Comparisons when including the ```@inbounds``` macro

When all algorithms have @inbounds, there is not an apparent speed improvement over the current algorithm for large random integer arrays. The following are results for the median of three pivot quicksort benchmarked against the standard library implementation, with 10^4 simulations, when both have the @inbounds macro.

Array Size	Mean Ratio	Median Ratio
10^3	0.999	0.998
10^4	1.011	1.016
10^5	1.011	1.018

Any comments would be greatly appreciated. It would also be great to add different array permutations (sorted, semi-sorted) to the comparisons.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.gitignore		.gitignore
README.md		README.md
benchmark.jl		benchmark.jl
candidates.jl		candidates.jl
qsorts.jl		qsorts.jl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

motivation

current implementation issues

potential improvements

benchmarking

All algorithms without ```@inbounds``` macro

Comparisons when including the ```@inbounds``` macro

About

Releases

Packages

Languages

illerucis/julia_qsortbenchmarks

Folders and files

Latest commit

History

Repository files navigation

motivation

current implementation issues

potential improvements

benchmarking

All algorithms without ```@inbounds``` macro

Comparisons when including the ```@inbounds``` macro

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages