Fix io_service_fixture_with_threads: create given threads count #75

grishavanika · 2018-03-25T09:30:24Z

Looks like small typo: only one thread was created always.

UPD: And looking into failed optimized x86 build shows that "multi-threaded" case with 3 threads fails with exceptions:

RangeChecks instrumentation code detected an out of range array access.

and callstack is always the same:

>	run.exe!cppcoro::task<int> <lambda>(void)$_ResumeCoro$2::operator()() Line 74	C++
 	[Inline Frame] run.exe!std::experimental::coroutine_handle<void>::resume() Line 110	C++
 	[Inline Frame] run.exe!cppcoro::single_consumer_async_auto_reset_event::set() Line 41	C++
 	run.exe!cppcoro::task<int> <lambda>(int)$_ResumeCoro$2::operator()() Line 82	C++
 	[Inline Frame] run.exe!std::experimental::coroutine_handle<void>::resume() Line 110	C++
 	run.exe!cppcoro::io_service::try_process_one_event(bool waitForEvent) Line 599	C++
 	run.exe!cppcoro::io_service::process_events() Line 356	C++
 	[Inline Frame] run.exe!io_service_fixture::{ctor}::__l5::<lambda_4cb3593f6bc008fa86c60dd0ebff6953>::operator()() Line 30	C++
 	[Inline Frame] run.exe!std::_Invoker_functor::_Call(io_service_fixture::{ctor}::__l5::<lambda_4cb3593f6bc008fa86c60dd0ebff6953> &&) Line 232	C++
 	[Inline Frame] run.exe!std::invoke(io_service_fixture::{ctor}::__l5::<lambda_4cb3593f6bc008fa86c60dd0ebff6953> &&) Line 232	C++
 	[Inline Frame] run.exe!std::_LaunchPad<std::unique_ptr<std::tuple<<lambda_4cb3593f6bc008fa86c60dd0ebff6953> >,std::default_delete<std::tuple<<lambda_4cb3593f6bc008fa86c60dd0ebff6953> > > > >::_Execute(std::tuple<<lambda_4cb3593f6bc008fa86c60dd0ebff6953> > &) Line 240	C++
 	[Inline Frame] run.exe!std::_LaunchPad<std::unique_ptr<std::tuple<<lambda_4cb3593f6bc008fa86c60dd0ebff6953> >,std::default_delete<std::tuple<<lambda_4cb3593f6bc008fa86c60dd0ebff6953> > > > >::_Run(std::_LaunchPad<std::unique_ptr<std::tuple<<lambda_4cb3593f6bc008fa86c60dd0ebff6953> >,std::default_delete<std::tuple<<lambda_4cb3593f6bc008fa86c60dd0ebff6953> > > > > *) Line 247	C++
 	run.exe!std::_LaunchPad<std::unique_ptr<std::tuple<<lambda_4cb3593f6bc008fa86c60dd0ebff6953> >,std::default_delete<std::tuple<<lambda_4cb3593f6bc008fa86c60dd0ebff6953> > > > >::_Go() Line 232	C++
 	run.exe!std::_Pad::_Call_func(void * _Data) Line 211	C++
 	ucrtbase.dll!7761ab7d()	Unknown
 	[Frames below may be incorrect and/or missing, no symbols loaded for ucrtbase.dll]	
 	kernel32.dll!758d8654()	Unknown
 	ntdll.dll!77864a77()	Unknown
 	ntdll.dll!77864a47()	Unknown

This test-case will fail with any number of threads that is greater than one.
With some dancing around I found "fix", but I have no idea what happens.
Here it is: link.

Unfortunately, this strange fix works fine for my local build, but causes tests to fail on appveyor in some other place (?) with another exception (?)

…ccess` exception

lewissbaker · 2018-03-26T12:08:46Z

test/single_consumer_async_auto_reset_event_tests.cpp

@@ -57,6 +57,18 @@ TEST_CASE("single waiter")

 TEST_CASE_FIXTURE(io_service_fixture_with_threads<3>, "multi-threaded")
 {
+#if (CPPCORO_CPU_X86)
+#if (1)
+	const int iterations = 10'000;


Do you know how/why this change works around the issue with x86 optimised builds?

I can only assume that because it's now a variable that is captured by reference that this makes the code below slightly more complicated and less able to be optimised.

I have seen several different tests all fail under x86 optimised over various recent versions of msvc.
This mostly just seems like bad codegen for coroutines under x86, although I haven't looked into some of the more recent failures to identify the root cause yet. I've been largely just ignoring x86 optimised failures lately and am hoping that the codegen bugs will be fixed in the 15.7 update.

No, I have no idea. I had no time to take a look into assembler output yet.

Also, bug disappear if iterations count will be less (lets say 6'000). In this case integer literal can be used.

recent versions of msvc

I'm using 15.6.4 and this fix helps me, but, as I said, appveyor build still fails. Since there are no linker errors (that I have for 15.6.4), appveyor has older MSVC version (15.6.3 ?). So (if this is codegen bug) something was changed in latest version.

Also, because less number of iteration helps, I thought that this is something relative to thread's stack size (?), but increasing stack size to 10 MB from default 1 MB does not help.

I can try to change iterations count to 2'000 and build again on appveyor if you think changes should be accepted.

Thank you for looking into the change :)

I have confirmed that this change also makes the test pass on MSVC 15.7 (preview 2).

That is very weird about the changing iteration causing issues. Perhaps reducing the iteration count reduces the contention on shared data-structures and thus is avoiding some race-conditions?

The AppVeyor tests are still running with an older version of msvc 15.6.2 but are failing on a different test (writing a file). This test only seems to fail on the AppVeyor CI machines. I haven't been able to reproduce it on a dev machine. My current working theory for that failing test is also bad x86 codegen but I don't have any evidence other than "other things are broken under x86 optimised due to compiler bugs so it's likely this is too".

I'll put through a change to disable the failing file test under x86 optimised for now.

@lewissbaker, sorry for late response. Just to be sure: can I help you with this somehow ?

Would you be able to amend the PR to remove the change to test/single_consumer_async_auto_reset_event_tests.cpp ?

I'll put the PR through with just the fix to the io_service_fixture constructor.

I've put through a change on master that ignores test failures on x86 for now and will leave it as a known issue. There's no point in working around the compiler bug in the test - users of the library are just as likely to run into the bug in their own code.

Changes reverted (not sure that was done right, I have few uneeded commits in the forked repository, but you have 1 file in this the pull request, finally)

lewissbaker · 2018-03-26T12:09:33Z

test/io_service_fixture.hpp

@@ -25,7 +25,10 @@ struct io_service_fixture
 		m_ioThreads.reserve(threadCount);
 		try
 		{
-			m_ioThreads.emplace_back([this] { m_ioService.process_events(); });


… array access` exception" This reverts commit 8467a21.

…ssbaker#75)

grishavanika added 2 commits March 25, 2018 12:34

Fix io_service_fixture_with_threads: create given threads count

c5fd167

Work-around for Optimized x86 build that avoids `out of range array a…

8467a21

…ccess` exception

lewissbaker reviewed Mar 26, 2018

View reviewed changes

grishavanika and others added 4 commits April 5, 2018 07:16

Revert work-around for Optimized x86 build

a82c968

Merge branch 'master' of https://github.com/grishavanika/cppcoro

88dfb36

Revert "Work-around for Optimized x86 build that avoids `out of range…

658f32e

… array access` exception" This reverts commit 8467a21.

Merge branch 'master' into master

6542e03

lewissbaker merged commit 14ad4bd into lewissbaker:master Apr 5, 2018

blapid pushed a commit to blapid/cppcoro that referenced this pull request Jul 24, 2018

Fix io_service_fixture_with_threads: create given threads count (lewi…

62e8697

…ssbaker#75)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix io_service_fixture_with_threads: create given threads count #75

Fix io_service_fixture_with_threads: create given threads count #75

grishavanika commented Mar 25, 2018 •

edited

Loading

lewissbaker Mar 26, 2018

grishavanika Mar 26, 2018

lewissbaker Mar 27, 2018

grishavanika Apr 5, 2018

lewissbaker Apr 5, 2018

grishavanika Apr 5, 2018

lewissbaker Mar 26, 2018

Fix io_service_fixture_with_threads: create given threads count #75

Fix io_service_fixture_with_threads: create given threads count #75

Conversation

grishavanika commented Mar 25, 2018 • edited Loading

lewissbaker Mar 26, 2018

Choose a reason for hiding this comment

grishavanika Mar 26, 2018

Choose a reason for hiding this comment

lewissbaker Mar 27, 2018

Choose a reason for hiding this comment

grishavanika Apr 5, 2018

Choose a reason for hiding this comment

lewissbaker Apr 5, 2018

Choose a reason for hiding this comment

grishavanika Apr 5, 2018

Choose a reason for hiding this comment

lewissbaker Mar 26, 2018

Choose a reason for hiding this comment

grishavanika commented Mar 25, 2018 •

edited

Loading