Test: System.Net.Sockets.Tests.SendReceiveEap.SendToRecvFrom_Datagram_UDP(loopbackAddress: ::1) failed in CI #1712

KristinXie1 · 2017-03-10T02:14:29Z

Failed test: System.Net.Sockets.Tests.SendReceiveEap.SendToRecvFrom_Datagram_UDP(loopbackAddress: ::1)
Configuration: OuterLoop_CentOS7.1_debug (build#123)

Message:

System.Net.Sockets.Tests.SendReceiveEap.SendToRecvFrom_Datagram_UDP(loopbackAddress: ::1) [FAIL]
        Assert.True() Failure
        Expected: True
        Actual:   False

Stack Trace:

/mnt/resource/j/workspace/dotnet_corefx/master/outerloop_centos7.1_debug/src/System.Net.Sockets/tests/FunctionalTests/SendReceive.cs(148,0): at System.Net.Sockets.Tests.SendReceive.<SendToRecvFrom_Datagram_UDP>d__17.MoveNext()
           --- End of stack trace from previous location where exception was thrown ---
              at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
              at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
           --- End of stack trace from previous location where exception was thrown ---
              at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
              at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
           --- End of stack trace from previous location where exception was thrown ---
              at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
              at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)

Detail: https://ci.dot.net/job/dotnet_corefx/job/master/job/outerloop_centos7.1_debug/123/consoleText

The text was updated successfully, but these errors were encountered:

KristinXie1 · 2017-03-13T01:48:05Z

This issue is repro on OuterLoop_CentOS7.1_release (build#123), detail: https://ci.dot.net/job/dotnet_corefx/job/master/job/outerloop_centos7.1_release/123/testReport/System.Net.Sockets.Tests/SendReceiveEap/SendToRecvFrom_Datagram_UDP_loopbackAddress____1_/

karelz · 2017-03-14T16:50:52Z

cc @Priya91 @ianhays @steveharter

KristinXie1 · 2017-03-20T06:40:27Z

Failed again here: https://ci.dot.net/job/dotnet_corefx/job/master/job/outerloop_centos7.1_debug/134/testReport/System.Net.Sockets.Tests/SendReceiveEap/SendToRecvFrom_Datagram_UDP_loopbackAddress____1_/

steveharter · 2017-03-22T19:23:16Z

@stephentoub this test was added about three weeks ago. Any thoughts? We should probably disable it.

  int sent = await SendToAsync(right, new ArraySegment<byte>(sendBuffer), leftEndpoint);
>>Assert.True(await receiverAck.WaitAsync(AckTimeout));
  senderAck.Release();

Also there are two Assert.True's here without any userMessage param so that should be added.

stephentoub · 2017-03-22T19:29:04Z

this test was added about three weeks ago

The test itself has actually been in the repo since 2015:
dotnet/corefx@09932a4#diff-5d77ee23eb2ea596e218f9c1ef09d793R22

What changed a few weeks ago was allowing the test to work with the various Async APIs on Socket, e.g. allowing it to work with the EAP, Task, and APM methods (and the sync ones), rather than just the APM ones. It's possible an issue was introduced as part of that conversion.

KristinXie1 · 2017-03-23T06:06:06Z

Failed again on build 20170322.02: https://mc.dot.net/#/product/netcore/master/source/official~2Fcorefx~2Fmaster~2F/type/test~2Ffunctional~2Fcli~2F/build/20170322.02/workItem/System.Net.Sockets.Tests/analysis/xunit/System.Net.Sockets.Tests.SendReceiveApm~2FSendToRecvFrom_Datagram_UDP(loopbackAddress:%20::1)

KristinXie1 · 2017-03-24T03:05:49Z

Failed again on build 20170324.01

KristinXie1 · 2017-03-28T01:38:08Z

Failed again here: https://ci.dot.net/job/dotnet_corefx/job/master/job/outerloop_centos7.1_release/138/testReport/System.Net.Sockets.Tests/SendReceiveEap/SendToRecvFrom_Datagram_UDP_loopbackAddress____1_/

stephentoub · 2017-03-28T21:54:42Z

What changed a few weeks ago was allowing the test to work with the various Async APIs on Socket

@steveharter, actually, FYI, it looks like the test has failed not just on EAP but also on APM, which was the one that previously existed.

stephentoub · 2017-03-28T22:01:01Z

@steveharter, this looks like the same problem as was fixed for some other tests in https://github.com/dotnet/corefx/issues/5185. The test is sending 10 packets over UDP and expecting all 10 to be received, which isn't guaranteed. There's even a TODO in the test stating that it needs to be hardened against such loss:
https://github.com/dotnet/corefx/blob/ed823ad9470f2ecf412d5089fe36cd6958fc5834/src/System.Net.Sockets/tests/FunctionalTests/SendReceive.cs#L71

steveharter · 2017-03-28T22:24:10Z

What changed a few weeks ago was allowing the test to work with the various Async APIs on Socket

@steveharter, actually, FYI, it looks like the test has failed not just on EAP but also on APM, which was the one that previously existed.

FWIW according to jdash, the tests started failing on 3/9/2017. The refactoring work was on 2/26/2017 - dotnet/corefx@ca392ca#diff-5d77ee23eb2ea596e218f9c1ef09d793

Test Case: System.Net.Sockets.Tests.SendReceiveEap.SendToRecvFrom_Datagram_UDP(loopbackAddress: ::1)
Failed Jenkins Jobs
Build Number 	Machine Name 	Date
dotnet_corefx/master/outerloop_centos7.1_debug 123 	centos71-20170216-outer1fbe70 	03/09 07:52 AM
dotnet_corefx/master/outerloop_centos7.1_debug 125 	centos71-20170216-outer125790 	03/11 07:52 AM
dotnet_corefx/master/outerloop_centos7.1_release 123 	centos71-20170216-outer6ffb00 	03/12 03:32 PM

steveharter · 2017-03-28T22:28:32Z

I believe many timeouts or proposed UDP packet loss are interference from other tests that happen to listen on another tests port, where tests do a receivefrom on a port that they did not bind to (IPV4\IPV4 dual mode tests), so the 'bad' test receives the data causing the 'good' test to timeout or miss some data. This is mostly a Linux issue due to the randomness in port assignment (vs Windows which is incremental).

So if the refactoring isn't to blame, perhaps a new or other modified test is (that was added\modified on or a few days before 3/9/2017).

KristinXie1 · 2017-04-06T08:29:52Z

Failed again here: https://ci.dot.net/job/dotnet_corefx/job/master/job/outerloop_netcoreapp_centos7.1_release/7/testReport/System.Net.Sockets.Tests/SendReceiveEap/SendToRecvFrom_Datagram_UDP_loopbackAddress____1_/

KristinXie1 · 2017-04-07T07:36:07Z

This issue is repro on portable core tests: https://mc.dot.net/#/product/netcore/master/source/official~2Fcorefx~2Fmaster~2F/type/test~2Ffunctional~2Fportable~2Fcli~2F/build/20170407.01/workItem/System.Net.Sockets.Tests/analysis/xunit/System.Net.Sockets.Tests.SendReceiveApm~2FSendToRecvFrom_Datagram_UDP(loopbackAddress:%20::1)

steveharter · 2017-04-15T23:21:38Z

Stress testing Windows 10 resulted in this test starting but not finishing (so not considered a failure in CI reports), unlike Linux which does fail in CI. For the Windows 10 repro, a background exception was reported that may have originated from that test, or from the other test that was currently running: System.Net.Sockets.Tests.DualModeConnectToIPAddressArray.DualModeConnect_IPAddressListToHost_Throws

System.Net.Sockets.Tests.SendReceiveApm.SendToRecvFrom_Datagram_UDP(loopbackAddress: 127.0.0.1) [STARTING]
...
System.Net.Sockets.SocketException: An existing connection was forcibly closed by the remote host

For Linux, possible interference from the DualMode test which mixes\matches IPv4\v6 addresses expecting failures, which may randomly have port collisions on linux so those should be disabled.

karelz · 2017-04-16T00:22:10Z

Overall it looks like 1/week failure rate ... borderline for addressing it in 2.0. Given that this is most likely just bad test and we're tight on workforce for 2.0, we will keep it in Future - I marked it as 'wishlist' as it should be on top of our Future backlog.

wfurt · 2017-12-07T20:10:13Z

I could not find any recent failure. Please reopen if this fails again.

krwq · 2018-04-04T19:36:14Z

No failures because test case is disabled:

System.Net.Sockets/tests/FunctionalTests/SendReceive.cs:41

antonfirsov · 2020-01-14T14:28:04Z

This is a major blocker for implementing #938, since we need a robust way to test UDP, if we want to cover those changes.

antonfirsov · 2020-11-06T15:40:12Z

This failure is likely not about port stealing or any other direct interference with other tests.

The test tends to fail when CPU load is high. I can reproduce the failure by running the Theory's cases on a 2 VCPU Linux system in parallel with one single other test case that calculates PI.

…44591) Some `SendReceive` socket tests may be prone to timing issues on CI. This seems to be the root cause of #1712. We need a reliable way to run such tests to unblock the work on new UDP socket API-s in #33418. This PR defines a new `SendReceiveNonParallel` test group, moving `SendToRecvFrom_Datagram_UDP` into that group. Since this is already a significant reorganization, it seemed reasonable to also: - Harmonize naming: all SendReceive test classses are now named either `SendReceive_[SubVariant]` or `SendReceiveNonParallel_[SubVariant]` - Split `SendReceive.cs` into multiple files: - `SendReceive.cs` for the parallel variants - `SendReceiveNonParallel.cs` for the new, non-parallel variants - Rename the non-generic class `SendReceive` to `SendReceiveMisc` (to avoid name collision and confusion with `SendReceive<T>`) and move it to `SendReceiveMisc.cs` - Move `SendReceiveListener` and `SendReceiveUdpClient` to separate files, rename `SendReceiveListener` to `SendReceiveTcpClient`

Fixes dotnet#1712. Some RVA data blobs within the compiler are special and contain other dependencies the compiler needs to look at during scanning phase. This fixes an issue where the `<Module>` type wasn't having its metadata generated in optimized builds because the scanning phase never saw the `<Module>` type being allocated and didn't predict it as needing metadata. The p/invoke fixup blob references the `<Module>` type as "a type from the assembly that contained the p/invoke". We need to scan the fixup blob during the scanning phase so that the type is seen.

steveharter assigned stephentoub Mar 22, 2017

stephentoub removed their assignment Apr 12, 2017

wfurt closed this as completed Dec 7, 2017

krwq reopened this Apr 4, 2018

wfurt self-assigned this Mar 7, 2019

wfurt removed their assignment Aug 6, 2019

antonfirsov transferred this issue from dotnet/corefx Jan 14, 2020

Dotnet-GitSync-Bot added area-System.Net.Sockets untriaged New issue has not been triaged by the area owner labels Jan 14, 2020

antonfirsov added disabled-test The test is disabled in source code against the issue test-bug Problem in test source code (most likely) labels Jan 14, 2020

antonfirsov removed the untriaged New issue has not been triaged by the area owner label Jan 14, 2020

This was referenced Jan 21, 2020

enable SendToRecvFrom_Datagram_UDP again dotnet/corefx#35846

Merged

disable SendToRecvFrom_Datagram_UDP test again dotnet/corefx#35920

Merged

antonfirsov self-assigned this Jan 27, 2020

antonfirsov mentioned this issue Jan 27, 2020

Re-enable dual-mode socket tests that were disabled on Linux\Mac #1481

Closed

antonfirsov mentioned this issue Feb 6, 2020

Fix SendToRecvFrom_Datagram_UDP, organize SendReceive tests #31878

Closed

karelz added this to the 5.0 milestone Feb 20, 2020

karelz added the test-run-core Test failures in .NET Core test runs label Feb 22, 2020

karelz unassigned antonfirsov May 19, 2020

karelz modified the milestones: 5.0, Future Jun 5, 2020

antonfirsov mentioned this issue Nov 3, 2020

Sockets: Reimplement remaining Task-based async methods using SocketAsyncEventArgs #41502

Closed

antonfirsov modified the milestones: Future, 6.0.0 Nov 4, 2020

antonfirsov self-assigned this Nov 4, 2020

antonfirsov mentioned this issue Nov 5, 2020

Chat with CI about SendToRecvFrom_Datagram_UDP #44308

Closed

antonfirsov mentioned this issue Nov 12, 2020

Organize SendReceive tests and isolate non-parallel test collection #44591

Merged

antonfirsov closed this as completed in #44591 Nov 16, 2020

ghost locked as resolved and limited conversation to collaborators Dec 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test: System.Net.Sockets.Tests.SendReceiveEap.SendToRecvFrom_Datagram_UDP(loopbackAddress: ::1) failed in CI #1712

Test: System.Net.Sockets.Tests.SendReceiveEap.SendToRecvFrom_Datagram_UDP(loopbackAddress: ::1) failed in CI #1712

KristinXie1 commented Mar 10, 2017

KristinXie1 commented Mar 13, 2017

karelz commented Mar 14, 2017

KristinXie1 commented Mar 20, 2017

steveharter commented Mar 22, 2017

stephentoub commented Mar 22, 2017

KristinXie1 commented Mar 23, 2017

KristinXie1 commented Mar 24, 2017

KristinXie1 commented Mar 28, 2017

stephentoub commented Mar 28, 2017

stephentoub commented Mar 28, 2017

steveharter commented Mar 28, 2017

steveharter commented Mar 28, 2017 •

edited

Loading

KristinXie1 commented Apr 6, 2017

KristinXie1 commented Apr 7, 2017

steveharter commented Apr 15, 2017 •

edited

Loading

karelz commented Apr 16, 2017

wfurt commented Dec 7, 2017

krwq commented Apr 4, 2018

antonfirsov commented Jan 14, 2020

antonfirsov commented Nov 6, 2020 •

edited

Loading

Test: System.Net.Sockets.Tests.SendReceiveEap.SendToRecvFrom_Datagram_UDP(loopbackAddress: ::1) failed in CI #1712

Test: System.Net.Sockets.Tests.SendReceiveEap.SendToRecvFrom_Datagram_UDP(loopbackAddress: ::1) failed in CI #1712

Comments

KristinXie1 commented Mar 10, 2017

KristinXie1 commented Mar 13, 2017

karelz commented Mar 14, 2017

KristinXie1 commented Mar 20, 2017

steveharter commented Mar 22, 2017

stephentoub commented Mar 22, 2017

KristinXie1 commented Mar 23, 2017

KristinXie1 commented Mar 24, 2017

KristinXie1 commented Mar 28, 2017

stephentoub commented Mar 28, 2017

stephentoub commented Mar 28, 2017

steveharter commented Mar 28, 2017

steveharter commented Mar 28, 2017 • edited Loading

KristinXie1 commented Apr 6, 2017

KristinXie1 commented Apr 7, 2017

steveharter commented Apr 15, 2017 • edited Loading

karelz commented Apr 16, 2017

wfurt commented Dec 7, 2017

krwq commented Apr 4, 2018

antonfirsov commented Jan 14, 2020

antonfirsov commented Nov 6, 2020 • edited Loading

steveharter commented Mar 28, 2017 •

edited

Loading

steveharter commented Apr 15, 2017 •

edited

Loading

antonfirsov commented Nov 6, 2020 •

edited

Loading