Adds timsort and fix issue #3265 #3616

14427 · 2012-09-28T02:33:10Z

No description provided.

brson · 2012-09-30T22:12:28Z

Besides being unsure about the unsafe code this looks good to me.

It would be nice to refactor std::sort a bit more in the future. le methods can be replaced by Ord. Each of the sorts should probably get their own module. There should probably be some kind of sorting trait. But that all can be done later. Also, we probably don't need this many sort functions.

brson · 2012-09-30T22:44:00Z

Did you compare the performance of this sort to any of the existing ones?

14427 · 2012-09-30T22:54:33Z

It looks like while unique pointers work fine, the sort seems to break the refcount of managed pointers. MergeState's destructor calls set_len, so even if the task fails, tmp get cleaned up without double-freeing or running the destructors of anything left in it. The performance is better than merge_sort in all the cases I tested, and everywhere that quick_sort is faster, quick_sort3 beats it.

14427 · 2012-09-30T23:10:16Z

Here are the perf numbers:
https://gist.github.com/3808689

The benchmark is a translation of python's sortperf.py sort benchmark

14427 · 2012-09-30T23:11:26Z

Here's python's source: http://code.google.com/p/unladen-swallow/source/browse/trunk/Lib/test/sortperf.py
The quicksort numbers don't go on as long, because they run out of stack

14427 · 2012-09-30T23:29:36Z

I also checked that dtors run, so other then leaking managed pointers some of the time, everything works.

brson · 2012-10-01T00:53:44Z

Those are very promising numbers!

14427 · 2012-10-01T19:11:50Z

I also double checked the managed pointer thing, and it it turns out that it works, I just wrote my refcount test wrong. It always expected a count of 1, when several tests duplicated elements, increasing the refcounts.

brson · 2012-10-01T20:07:32Z

This is going to need more tests. It's a complicated algorithm and all the included tests are on scalar data with small vectors. It looks to me like all the test cases cause timsort to immediately bail to the fast path, so there is no coverage for the difficult parts.

Needs tests with both managed and owned boxes, and with cases that cover the different strategies used by timsort. I know it's a bit more work, but it's important.

FWIW, I've poked at writing more tests a few times, but don't have enough time right now to keep at it.

…rison fails

Conflicts: src/libstd/json.rs src/libstd/sort.rs

14427 · 2012-10-04T03:09:35Z

I'm hitting some ICEs with the test suite. Just updated my branch to see if it fixes things.

14427 · 2012-10-04T16:56:02Z

Here's the backtrace from the ICE: https://gist.github.com/3834914
I'll see if I can narrow down what causes it and file an issue.
I'll also submit the rest of the testsuite commented out.

graydon · 2012-10-04T21:56:14Z

Tricky. On the one hand, I'm excited to see such improved performance; on the other it seems like "unsafe required to do sorting" is a bad sign, and I'm pretty hesitant to merge something ostensibly high-level like a sorting algorithm that has unsafe code all through it.

I'm curious exactly how much the performance difference just has to do with hitting native memcpy. If it's "most", we should probably take a step back and figure out how to make vec-to-vec copies in general speed-competitive.

14427 · 2012-10-04T22:52:35Z

The unsafe code isn't for performance, it's so the code doesn't do any copies anywhere, so that it works on non-copyable data structures. Most of the performance difference comes from doing less compares (about 40 times less than quick_sort3 on random data). The performance of tim_sort is actually about the same as quick_sort3 on random data when I added a tls get and set call to the compare when getting the compare numbers.

…rees

14427 · 2012-10-20T20:14:28Z

Broken pending #3821

brson · 2012-10-24T18:30:17Z

This builds and tests for me now. On IRC we discussed reworking this to not use unsafe code, at the expense of adding a Copy bound, and I think that would make everyone more comfortable.

nikomatsakis · 2012-10-25T03:07:48Z

+1 to removing unsafe code for the time being, at least.

14427 · 2012-10-25T05:32:42Z

The current code is generating a lot of warnings about instantiating non-implicitly copyable types in the tests that I'm not sure how to fix. Other than that the Copy code is done.

…nd involving pure code not being considered pure

brson · 2012-10-25T21:19:38Z

Merged. Thanks!

graydon · 2012-11-27T23:09:11Z

This patch doesn't have enough contact information to determine the author (and the github account associated is pretty sparse). Can we get a name and email address?

make basic things work on Android Fixes rust-lang/miri#3608

14427 added 7 commits September 25, 2012 17:53

Add timsort to std/sort

cef7763

Fix timsort to use updated vec::reserve

f98f00f

Put function argument last in sort function. Fixes rust-lang#3265.

868d101

Remove trailing whitespace

f34c4f4

Switch order of merge_sort arguments in some benchmarks

4f9f1c5

Export timsort

f7be2d9

Add a simple testsuite for timsort

4d30d7f

Make local variables and methods use underscores not camel case

579c7e3

14427 added 3 commits October 3, 2012 09:51

Add cleanup code so the the array remains in a valid state if a compa…

0ec5c9a

…rison fails

Backup changes before pull from incoming

44f8a44

Merge remote-tracking branch 'original/incoming' into incoming

efcd238

Conflicts: src/libstd/json.rs src/libstd/sort.rs

14427 added 2 commits October 4, 2012 11:24

Fix my merge

455591d

Get tim_sort working, add test for double-freeing elements in tmp

7bd48b9

14427 added 4 commits October 4, 2012 21:35

Finish up tests, uncomment when ICE is fixed.

74246d4

Remove debug info

eee86d4

Add a test to check that badly written Ord impl do not cause double f…

d4a5483

…rees

Merge remote-tracking branch 'original/incoming' into incoming

0e3bec0

14427 added 7 commits October 22, 2012 18:33

Fix up tests, export tim_sort

9aec7a3

Merge remote-tracking branch 'original/incoming' into incoming

cc0f2c6

Uncomment tests and fix binarysort segmentation fault

71c311c

Use explicit self

1380776

Fix long line

781e446

Fix typo

254a86e

Re-add bad Ord impl test

e0a9d41

14427 added 6 commits October 24, 2012 19:15

Add copy bound to sort

fb61f91

Fix tests for Copy bound

19a59cb

Remove some code that MergeState used to prevent double frees

046460c

Remove and comment out more MergeState code

8e6d209

Remove commented out code

98c8a40

Move binarysort out of MergeState

f2216ec

Remove some unused MergeState code, add a Fixme and remove a workarou…

d4432a7

…nd involving pure code not being considered pure

brson merged commit d4432a7 into rust-lang:incoming Oct 25, 2012

bors pushed a commit to rust-lang-ci/rust that referenced this pull request May 15, 2021

Use trait to abstract emit modes (rust-lang#3616)

dbac28b

RalfJung pushed a commit to RalfJung/rust that referenced this pull request May 19, 2024

Auto merge of rust-lang#3616 - RalfJung:android, r=RalfJung

0e41a80

make basic things work on Android Fixes rust-lang/miri#3608

Adds timsort and fix issue #3265 #3616

Adds timsort and fix issue #3265 #3616

Uh oh!

Conversation

14427 commented Sep 28, 2012

Uh oh!

brson commented Sep 30, 2012

Uh oh!

brson commented Sep 30, 2012

Uh oh!

14427 commented Sep 30, 2012

Uh oh!

14427 commented Sep 30, 2012

Uh oh!

14427 commented Sep 30, 2012

Uh oh!

14427 commented Sep 30, 2012

Uh oh!

brson commented Oct 1, 2012

Uh oh!

14427 commented Oct 1, 2012

Uh oh!

brson commented Oct 1, 2012

Uh oh!

14427 commented Oct 4, 2012

Uh oh!

14427 commented Oct 4, 2012

Uh oh!

graydon commented Oct 4, 2012

Uh oh!

14427 commented Oct 4, 2012

Uh oh!

14427 commented Oct 20, 2012

Uh oh!

brson commented Oct 24, 2012

Uh oh!

nikomatsakis commented Oct 25, 2012

Uh oh!

14427 commented Oct 25, 2012

Uh oh!

brson commented Oct 25, 2012

Uh oh!

graydon commented Nov 27, 2012

Uh oh!

Uh oh!