Lot's of non-deterministic failures in univariate_statistics_test.cpp #585

jzmaddock · 2021-03-28T11:03:18Z

These seem to be related to gini calculation, I changed all the BOOST_TEST's to BOOST_TEST_LT so we could see how close the failures were, here's a typical run with msvc-14.2:

D:\data\boost\boost\libs\math\test\univariate_statistics_test.cpp(694): test 'abs(gini - expected) < Real(0.03)' ('0.0487317' < '0.03') failed in function 'void __cdecl test_gini_coefficient<float,const class std::execution::parallel_policy&>(const class std::execution::parallel_policy &)'
D:\data\boost\boost\libs\math\test\univariate_statistics_test.cpp(694): test 'abs(gini - expected) < Real(0.03)' ('0.0593992' < '0.03') failed in function 'void __cdecl test_gini_coefficient<long double,const class std::execution::parallel_policy&>(const class std::execution::parallel_policy &)'
D:\data\boost\boost\libs\math\test\univariate_statistics_test.cpp(659): test 'abs(gini - expected) < tol' ('0.666667' < '5.34553e-50') failed in function 'void __cdecl test_gini_coefficient<class boost::multiprecision::number<class boost::multiprecision::backends::cpp_bin_float<50,10,void,int,0,0>,0>,const class std::execution::parallel_policy&>(const class std::execution::parallel_policy &)'
D:\data\boost\boost\libs\math\test\univariate_statistics_test.cpp(682): test 'abs(gini) < tol' ('0.222222' < '5.34553e-50') failed in function 'void __cdecl test_gini_coefficient<class boost::multiprecision::number<class boost::multiprecision::backends::cpp_bin_float<50,10,void,int,0,0>,0>,const class std::execution::parallel_policy&>(const class std::execution::parallel_policy &)'
D:\data\boost\boost\libs\math\test\univariate_statistics_test.cpp(694): test 'abs(gini - expected) < Real(0.03)' ('0.144181' < '0.03') failed in function 'void __cdecl test_gini_coefficient<class boost::multiprecision::number<class boost::multiprecision::backends::cpp_bin_float<50,10,void,int,0,0>,0>,const class std::execution::parallel_policy&>(const class std::execution::parallel_policy &)'
D:\data\boost\boost\libs\math\test\univariate_statistics_test.cpp(629): test 'abs(gini - 1) < tol' ('1' < '5.34553e-50') failed in function 'void __cdecl test_sample_gini_coefficient<class boost::multiprecision::number<class boost::multiprecision::backends::cpp_bin_float<50,10,void,int,0,0>,0>,const class std::execution::parallel_policy&>(const class std::execution::parallel_policy &)'
6 errors detected.

However, the output is different every time! Also note how large some of the errors are, something grievous is going on here, but I don't see it at present.

The text was updated successfully, but these errors were encountered:

mborland · 2021-03-28T11:46:31Z

I remember seeing similar egregious failures with GCC9 which used TBB instead of it's own implementation of the parallelism TS. TBB fails tsan horribly, so I wonder if something similar is going on with MSVC?

jzmaddock · 2021-03-28T13:06:52Z

I don't know, but I went back through the history and the issue is still present at 1eb3c71 which is the original #434 "Implement Policies in Statistics" commit.

mborland · 2021-03-28T13:45:15Z

Can you please give PR #586 a try?

jzmaddock · 2021-03-28T18:12:10Z

Looking at:

template<typename ReturnType, typename ExecutionPolicy, typename RandomAccessIterator>
ReturnType gini_coefficient_parallel_impl(ExecutionPolicy&& exec, RandomAccessIterator first, RandomAccessIterator last)
{
    using Real = typename std::iterator_traits<RandomAccessIterator>::value_type;
    
    ReturnType i = 1;
    ReturnType num = 0;
    ReturnType denom = 0;
    
    std::for_each(exec, first, last, [&i, &num, &denom](const Real& val)
    {
        num = num + val * i;
        denom = denom + val;
        i = i + 1;
    });

    if(denom == 0)
    {
        return ReturnType(0);
    }

    return ((2*num)/denom - i)/(i-1);
}

I have 2 comments:

How is there not a race condition here accessing num, denom and i ?
Does the order of the elements matter for the result? I ask because the parallel for_each can call the functor in any order, so i*val will be different for each run.

jzmaddock · 2021-03-29T10:36:42Z

OK, looking at this some more I think we have several issues:

Yes there really is a race condition on modifying the shared variables.
The input data must be sorted, but parallel std::for_each can call the functor in any order, which effectively "unsorts" it.
I'm not convinced that gcc/libstdc++ is actually running the tests in parallel: there's some complicated logic going on, but on mingw at least, the test code ends up in a sequential/serial for_each which is why the tests pass there. More investigation needed there.

I fear the only correct way to implement this as a parallel algorithm, is to divide the input range into N segments and create a future for each one, each of which returns a pair of partial sums, which can then be accumulated at the end. Any other thoughts?

We should probably review the other parallel algorithms for similar issues too.

jzmaddock · 2021-03-29T10:52:26Z

A quick eyeball over the code found nothing else obvious. @mborland , has everything (regression tests and performance tests) been run through clang's thread sanitizer just to be on the safe side?

See #585

NAThompson · 2021-03-29T13:14:39Z

Do we have a threadsanitizer build in CI?

jzmaddock · 2021-03-29T14:02:05Z

Do we have a threadsanitizer build in CI?

No, I know I added some sanitizer to the multiprecision CI tests, but we have nothing here yet I think?

In general they're quite hard to run as CI jobs: either the VM rejects them or there are false positives. But yes we should try and set something up.

mborland · 2021-03-29T16:32:14Z

I fear the only correct way to implement this as a parallel algorithm, is to divide the input range into N segments and create a future for each one, each of which returns a pair of partial sums, which can then be accumulated at the end. Any other thoughts?

We should probably review the other parallel algorithms for similar issues too.

The bulk of the other algorithms use exactly what you are describing. They would serve as an effective starting point.

A quick eyeball over the code found nothing else obvious. @mborland , has everything (regression tests and performance tests) been run through clang's thread sanitizer just to be on the safe side?

In the original PR I ran everything through GCC's asan and tsan. Have not tried the Clang equivalent.

Do we have a threadsanitizer build in CI?

No, I know I added some sanitizer to the multiprecision CI tests, but we have nothing here yet I think?

There is currently nothing in the CI to run asan/tsan/ubsan. Perhaps we could add a CircleCI config to only run these? The runs for GHA already take many times longer now that most of Boost has moved to it so I would recommend against adding more to GHA. GCC11 and Clang12 should be mainstream pretty soon here too...

jzmaddock · 2021-03-29T18:26:14Z

Tentative CircleCI run added here: #592

This also adds doc building and inspection report run.

jzmaddock · 2021-03-29T18:30:02Z

Haha, well CircleCI might not have been the best choice, apparently I have -24K free credits left!

See #585

Fix for issue #585

mborland · 2021-05-03T16:28:54Z

@jzmaddock This issue can be closed.

mborland added a commit to mborland/math that referenced this issue Mar 28, 2021

Fix for issue boostorg#585

7d5e0f2

jzmaddock added a commit that referenced this issue Mar 29, 2021

Make gini calculation serial only for now.

fcebb62

See #585

jzmaddock mentioned this issue Mar 29, 2021

Make gini calculation serial only for now. #587

Merged

jzmaddock added a commit that referenced this issue Mar 30, 2021

Make gini calculation serial only for now.

3ecdffc

See #585

mborland added a commit to mborland/math that referenced this issue Mar 31, 2021

Fix for issue boostorg#585

ad9731f

jzmaddock added a commit that referenced this issue Apr 6, 2021

Merge pull request #586 from mborland/gini

858e8a6

Fix for issue #585

jzmaddock closed this as completed May 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lot's of non-deterministic failures in univariate_statistics_test.cpp #585

Lot's of non-deterministic failures in univariate_statistics_test.cpp #585

jzmaddock commented Mar 28, 2021

mborland commented Mar 28, 2021

jzmaddock commented Mar 28, 2021

mborland commented Mar 28, 2021

jzmaddock commented Mar 28, 2021

jzmaddock commented Mar 29, 2021

jzmaddock commented Mar 29, 2021

NAThompson commented Mar 29, 2021

jzmaddock commented Mar 29, 2021

mborland commented Mar 29, 2021

jzmaddock commented Mar 29, 2021

jzmaddock commented Mar 29, 2021

mborland commented May 3, 2021

Lot's of non-deterministic failures in univariate_statistics_test.cpp #585

Lot's of non-deterministic failures in univariate_statistics_test.cpp #585

Comments

jzmaddock commented Mar 28, 2021

mborland commented Mar 28, 2021

jzmaddock commented Mar 28, 2021

mborland commented Mar 28, 2021

jzmaddock commented Mar 28, 2021

jzmaddock commented Mar 29, 2021

jzmaddock commented Mar 29, 2021

NAThompson commented Mar 29, 2021

jzmaddock commented Mar 29, 2021

mborland commented Mar 29, 2021

jzmaddock commented Mar 29, 2021

jzmaddock commented Mar 29, 2021

mborland commented May 3, 2021