Fixing 1836, adding parallel::sort #1850

biddisco · 2015-11-04T19:46:49Z

Only parallel::sort is included at the moment. I will work on stable_sort and sort by key (indirect sort) as time permits. Not expecting a merge right away, but let me know what is wrong with the code/style etc and if I've missed anything crucial.

Original implementation is available at git@github.com:fjtapia/sort_parallel.git HPX algorithm uses dataflow to avoid the use of a stack + spinlock and executes all steps using async calls which are run on HPX threads. No need to specify how many threads the algorithm is run on as it will simply spawn tasks and the thread handling will be taken care of by the HPX runtime.

… sort implementation Note : Cherry picked Doc updates from parallel_sort branch

…oured Lots of cleanups and compilation fixes for windows.

Internal struct holding comparator was going out of scope when async returns, causing segfaults inside the sort function. Remove struct and copy the comparator, the minor optimization of using only one comparator object would only be significant if the user supplies a comparator with complex internal state, which is unlikely.

hkaiser · 2015-11-04T19:54:32Z

hpx/parallel/algorithms/sort.hpp

+        {
+            typedef typename
+                std::remove_reference<decltype(*(std::declval<RandomIt>()))>::type type;
+        };


Looks like, this could be replaced by std::iterator_traits<RandomIt>::value_type.

- adding projection parameter - adding more docs - renaming variables to be lower-case only - adding concept checks - fixing (suppressing) various warnings about possible loss of precision - switching to boost::random for older compilers - breaking long lines

biddisco · 2015-11-06T08:26:03Z

I'm not sure why the std::random was changed to boost::random (which caused a link error in the latest build). We already require c++11 which provides <random>.

hkaiser · 2015-11-06T12:07:38Z

I'm not sure why the std::random was changed to boost::random (which caused a link error in the latest build). We already require c++11 which provides .

Uhh ohh. My recollection was that not all of our platforms support random_device. I apologize if I broke something. I'll see what I can do to rectify the problem asap.

- add command line option for seed

biddisco · 2015-11-06T21:50:28Z

The improvments look good to me

biddisco · 2015-11-07T10:29:45Z

I added some tests using std::string, but I am concerned that it can take >30secs for all the combinations to run as there are now quite a few and we need fairly long vectors to stress the test. Feel free to remove some combinations if the test is now too slow.

hkaiser · 2015-11-07T13:56:05Z

Would it make sense to isolate the std::string tests into a separate test-executable?

biddisco · 2015-11-07T14:34:39Z

Depends on how strongly people feel about shortness of tests, there are about 60 combinations of test params in the current test (policy, type, comparison) and that's quite thorough, so some could be removed without much chance of letting bugs in. Do we really expect it to pass for sort<int> but fail for sort<double> - I doubt it, so I'll thin the tests down a bit.
Do we have a target test run time? (ie less than N seconds preferred - on an average workstation)

hkaiser · 2015-11-07T14:48:50Z

We don't have a runtime limitation for single tests. The only known limitation is that Intel compilers tend to bail out on large tests as those instantiate a lot of different templates.

hkaiser · 2015-11-07T18:17:34Z

hpx/parallel/algorithms/sort.hpp

+            //----------------------------------------------------------------
+
+            //------------------- check if sorted ----------------------------
+            if (detail::is_sorted_sequential(first, last, comp))


After more thinking - I believe the is_sorted test should be done for the leaves only (before line 101 above?). Currently the algorithm re-scans the input data over and over again. This can be a big hit for nearly sorted data, mainly.

The current code assumes that testing the data sequence for being already sorted adds less overheads than spawning the two sub-tasks for the two partitions. This might be a valid assumption for kernel threads, it might not be a correct assessment for HPX threads, however.

biddisco · 2015-11-07T21:44:09Z

Actually, there's no need to call is_sorted at all. If the data is sorted then std::sort at the leaf will do the right thing, if not, it will still do the right thing. I removed it and the tests still pass. The speed difference is not significant because most of the time the data is not sorted and the is_sorted exits almost immediately. I'll remove it.

biddisco · 2015-11-08T00:22:48Z

I retract my previous comment. For small types (up to 128 bytes) removing is_sorted either makes no differnce, or helps slightly. For larger types and strings removing is_sorted has a significant detrimental impact on performance. I will investigate further.

- this also fixes make_exceptional_future() for arbitrary exception types

Fixing 1836, adding parallel::sort

biddisco and others added 12 commits October 31, 2015 23:50

Cleanup code, remove unused vars, conform to hpx style

6ee18bc

Started to create infrastructure for integrating Francisco's parallel…

2d47f23

… sort implementation Note : Cherry picked Doc updates from parallel_sort branch

Get sort working with correct dispatch according to execution policies

7cb7c98

Better handling of async, par, par(task) etc policy dispatch

72aae05

Add unit test for hpx::parallel::sort

fbba83c

parallel:sort test fixes and improvements

bfe4987

use algorithm_result<ExPolicy>::get to ensure execution policy is hon…

94de3b4

…oured Lots of cleanups and compilation fixes for windows.

Fix missing #include

07319eb

Cleanup parallel::sort test, remove some unused code

affaea4

Add more types to parallel::sort test

8633228

hkaiser added type: enhancement affecting: CSCS category: algorithms labels Nov 4, 2015

hkaiser reviewed Nov 4, 2015
View reviewed changes

biddisco force-pushed the fixing_1836 branch 3 times, most recently from c2cb807 to e20e482 Compare November 5, 2015 08:19

Fix compile warnings and Boost.Inspect defects

a3cd1f9

biddisco force-pushed the fixing_1836 branch 2 times, most recently from 7615519 to 6efb0d5 Compare November 5, 2015 10:32

Print timing for parallel::sort tests

6d769ce

biddisco force-pushed the fixing_1836 branch from 6efb0d5 to 6d769ce Compare November 5, 2015 13:55

hkaiser added this to the 0.9.11 milestone Nov 5, 2015

hkaiser force-pushed the fixing_1836 branch from d9f7df8 to 15168f1 Compare November 5, 2015 19:21

hkaiser mentioned this pull request Nov 5, 2015

Implement N4409 on top of HPX #1141

Closed

47 tasks

hkaiser added 3 commits November 6, 2015 09:23

Replace use of random_device with srand/rand

dcf099f

- add command line option for seed

Switching parallel::sort to use executors instead of plain async

fcedf5e

Adding proper exception handling to parallel::sort

34bfca4

biddisco mentioned this pull request Nov 6, 2015

hpx::parallel does not have a sort implementation #1836

Closed

6 tasks

Add std::string test for parallel::sort

8b66cca

biddisco force-pushed the fixing_1836 branch from c240af0 to 8b66cca Compare November 7, 2015 13:49

hkaiser reviewed Nov 7, 2015
View reviewed changes

Improved exception handling for parallel::sort

d49f83b

hkaiser force-pushed the fixing_1836 branch from 9f4f918 to d49f83b Compare November 8, 2015 16:47

Adding exception tests

d602708

- this also fixes make_exceptional_future() for arbitrary exception types

hkaiser force-pushed the fixing_1836 branch from 3c91cec to d602708 Compare November 9, 2015 22:11

hkaiser added a commit that referenced this pull request Nov 10, 2015

Merge pull request #1850 from STEllAR-GROUP/fixing_1836

81f8020

Fixing 1836, adding parallel::sort

hkaiser merged commit 81f8020 into release Nov 10, 2015

hkaiser deleted the fixing_1836 branch November 10, 2015 00:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing 1836, adding parallel::sort #1850

Fixing 1836, adding parallel::sort #1850

biddisco commented Nov 4, 2015

hkaiser Nov 4, 2015

biddisco commented Nov 6, 2015

hkaiser commented Nov 6, 2015

biddisco commented Nov 6, 2015

biddisco commented Nov 7, 2015

hkaiser commented Nov 7, 2015

biddisco commented Nov 7, 2015

hkaiser commented Nov 7, 2015

hkaiser Nov 7, 2015

biddisco commented Nov 7, 2015

biddisco commented Nov 8, 2015

Fixing 1836, adding parallel::sort #1850

Fixing 1836, adding parallel::sort #1850

Conversation

biddisco commented Nov 4, 2015

hkaiser Nov 4, 2015

Choose a reason for hiding this comment

biddisco commented Nov 6, 2015

hkaiser commented Nov 6, 2015

biddisco commented Nov 6, 2015

biddisco commented Nov 7, 2015

hkaiser commented Nov 7, 2015

biddisco commented Nov 7, 2015

hkaiser commented Nov 7, 2015

hkaiser Nov 7, 2015

Choose a reason for hiding this comment

biddisco commented Nov 7, 2015

biddisco commented Nov 8, 2015