Plumb CancellationToken through Socket.Receive/SendAsync #36516

stephentoub · 2019-04-01T02:16:20Z

In .NET Core 2.1 we added overloads of Send/ReceiveAsync, and we proactively added CancellationToken arguments to them, but those tokens were only checked at the beginning of the call; if a cancellation request came in after that check, the operation would not be interrupted.

This PR plumbs the token through so that a cancellation request at any point in the operation will cancel that operation. On Windows we register to use CancelIoEx to request cancellation of the specific overlapped operation on the specific socket. On Unix we use the existing cancellation infrastructure already in place to support the existing custom queueing scheme.

Some caveats:

On Windows, canceling a TCP receive will end up canceling all TCP receives pending on that socket, even when we request cancellation of a specific overlapped operation; this is just how cancellation works at the OS level, and there's little we can do about it. It also shouldn't matter much, as multiple pending receives on the same socket are rare.
If multiple concurrent receives or multiple concurrent sends are issued on the same socket, only the first will actually be cancelable. This is because this implementation only plumbs the token through the SocketAsyncEventArgs-based code paths, not the APM based code paths, and currently when using the Task-based APIs, we use the SocketAsyncEventArgs under the covers for only one receive and one send at a time; other receives made while that SAEA receive is in progress or other sends made while that SAEA send is in progress will fall back to using the APM code paths. This could be addressed in the future in various ways, including a) just using the SAEA code paths for all operations and deleting the APM fallback, or b) plumbing cancellation through APM as well. However, for now, this approach addresses the primary use case and should be sufficient.
This only affects code paths to which the CancellationToken passed to Send/ReceiveAsync could reach. If in the future we add additional overloads taking CancellationToken, we will likely need to plumb it to more places.

Fixes https://github.com/dotnet/corefx/issues/24430
cc: @geoffkizer, @davidsh, @wfurt, @tmds

wfurt · 2019-04-01T04:04:41Z

https://mc.dot.net/#/user/dotnet-bot/pr~2Fdotnet~2Fcorefx~2Frefs~2Fpull~2F36516~2Fmerge/test~2Ffunctional~2Fcli~2F/20190331.21/workItem/System.Net.Http.Functional.Tests

GetAsync_CancelDuringResponseBodyReceived_Unbuffered_TaskCanceledQuickly test is failing.
It seems like this may be related to this change.

stephentoub · 2019-04-01T12:35:24Z

GetAsync_CancelDuringResponseBodyReceived_Unbuffered_TaskCanceledQuickly test is failing.
It seems like this may be related to this change.

Definitely is. I'll take a look.

In .NET Core 2.1 we added overloads of Send/ReceiveAsync, and we proactively added CancellationToken arguments to them, but those tokens were only checked at the beginning of the call; if a cancellation request came in after that check, the operation would not be interrupted. This PR plumbs the token through so that a cancellation request at any point in the operation will cancel that operation. On Windows we register to use CancelIoEx to request cancellation of the specific overlapped operation on the specific socket. On Unix we use the existing cancellation infrastructure already in place to support the existing custom queueing scheme. Some caveats: - On Windows, canceling a TCP receive will end up canceling all TCP receives pending on that socket, even when we request cancellation of a specific overlapped operation; this is just how cancellation works at the OS level, and there's little we can do about it. It also shouldn't matter much, as multiple pending receives on the same socket are rare. - If multiple concurrent receives or multiple concurrent sends are issued on the same socket, only the first will actually be cancelable. This is because this implementation only plumbs the token through the SocketAsyncEventArgs-based code paths, not the APM based code paths, and currently when using the Task-based APIs, we use the SocketAsyncEventArgs under the covers for only one receive and one send at a time; other receives made while that SAEA receive is in progress or other sends made while that SAEA send is in progress will fall back to using the APM code paths. This could be addressed in the future in various ways, including a) just using the SAEA code paths for all operations and deleting the APM fallback, or b) plumbing cancellation through APM as well. However, for now, this approach addresses the primary use case and should be sufficient. - This only affects code paths to which the CancellationToken passed to Send/ReceiveAsync could reach. If in the future we add additional overloads taking CancellationToken, we will likely need to plumb it to more places.

stephentoub · 2019-04-01T14:34:02Z

@wfurt, fixed. I needed a try/catch to handle a race condition between the SafeHandle being disposed and trying to use it for cancellation. This showed up in the HTTP tests because we forcefully close the connection as part of cancellation (we could look at revisiting that after this goes in).

stephentoub · 2019-04-02T19:19:54Z

/azp run corefx-outerloop-windows

stephentoub · 2019-04-02T19:20:05Z

/azp run corefx-outerloop-linux

azure-pipelines · 2019-04-02T19:20:08Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2019-04-02T19:20:20Z

Azure Pipelines successfully started running 1 pipeline(s).

stephentoub · 2019-04-02T19:20:20Z

/azp run corefx-outerloop-osx

azure-pipelines · 2019-04-02T19:20:33Z

Azure Pipelines successfully started running 1 pipeline(s).

stephentoub · 2019-04-03T13:15:16Z

@sebastienros, before merging this, I'd like to make sure this doesn't regress Kestrel benchmarks. What's the best way for me to do that these days?

wfurt

LGTM.
And yes, benchmark would be good.

tmds · 2019-04-04T06:53:39Z

@stephentoub the benchmarking still requires some human intervention. For benchmarks @sebastienros is doing with me, I provide him a release build of System.Net.Sockets.dll (baseline version and modified version).

stephentoub · 2019-04-07T20:50:44Z

There doesn't appear to be any meaningful impact on plaintext, positive or negative, which is good (the goal here was to support additional functionality without negatively impacting perf).

The numbers fluctuate a bit from run to run, but here's an example on Windows:

| Description |       RPS | CPU (%) | Memory (MB) | Avg. Latency (ms) | Startup (ms) | First Request (ms) | Latency (ms) | Ratio |
| ----------- | --------- | ------- | ----------- | ----------------- | ------------ | ------------------ | ------------ | ----- |
|    baseline | 2,307,911 |      94 |         158 |              3.94 |          357 |              84.03 |          0.5 |  1.00 |
|   mychanges | 2,330,932 |      90 |         159 |              3.53 |          357 |              84.65 |         0.41 |  1.01 |

and on Linux:

| Description |       RPS | CPU (%) | Memory (MB) | Avg. Latency (ms) | Startup (ms) | First Request (ms) | Latency (ms) | Ratio |
| ----------- | --------- | ------- | ----------- | ----------------- | ------------ | ------------------ | ------------ | ----- |
|    baseline | 2,012,913 |      98 |         176 |              1.32 |          259 |             116.24 |         0.49 |  1.00 |
|   mychanges | 2,008,372 |      98 |         176 |              1.32 |          264 |             118.15 |         0.62 |  1.00 |

tmds · 2019-04-08T11:30:34Z

src/System.Net.Sockets/src/System/Net/Sockets/SocketAsyncContext.Unix.cs

+                                // call TryCancel, so we do this after the op is fully enqueued.
+                                if (cancellationToken.CanBeCanceled)
+                                {
+                                    operation.CancellationRegistration = cancellationToken.UnsafeRegister(s => ((TOperation)s).TryCancel(), operation);


@stephentoub a canceled operation will remain in the queue until it is removed at the next event or the queue gets stopped.

Yes. Is that a problem?

Not functionally. It will be alive for longer and maybe keeping some other things alive.

It will be alive for longer and maybe keeping some other things alive.

Yes, but then again so are the Socket and the SocketAsyncEventArgs. In comparison this shouldn't keep alive much. And if you're canceling the operation, there's a really high likelihood you're also either tearing everything down or about to do something else that will revisit the queue.

…fx#36516) In .NET Core 2.1 we added overloads of Send/ReceiveAsync, and we proactively added CancellationToken arguments to them, but those tokens were only checked at the beginning of the call; if a cancellation request came in after that check, the operation would not be interrupted. This PR plumbs the token through so that a cancellation request at any point in the operation will cancel that operation. On Windows we register to use CancelIoEx to request cancellation of the specific overlapped operation on the specific socket. On Unix we use the existing cancellation infrastructure already in place to support the existing custom queueing scheme. Some caveats: - On Windows, canceling a TCP receive will end up canceling all TCP receives pending on that socket, even when we request cancellation of a specific overlapped operation; this is just how cancellation works at the OS level, and there's little we can do about it. It also shouldn't matter much, as multiple pending receives on the same socket are rare. - If multiple concurrent receives or multiple concurrent sends are issued on the same socket, only the first will actually be cancelable. This is because this implementation only plumbs the token through the SocketAsyncEventArgs-based code paths, not the APM based code paths, and currently when using the Task-based APIs, we use the SocketAsyncEventArgs under the covers for only one receive and one send at a time; other receives made while that SAEA receive is in progress or other sends made while that SAEA send is in progress will fall back to using the APM code paths. This could be addressed in the future in various ways, including a) just using the SAEA code paths for all operations and deleting the APM fallback, or b) plumbing cancellation through APM as well. However, for now, this approach addresses the primary use case and should be sufficient. - This only affects code paths to which the CancellationToken passed to Send/ReceiveAsync could reach. If in the future we add additional overloads taking CancellationToken, we will likely need to plumb it to more places. Commit migrated from dotnet/corefx@2190a0f

davidsh added the area-System.Net.Sockets label Apr 1, 2019

davidsh added this to the 3.0 milestone Apr 1, 2019

davidsh approved these changes Apr 1, 2019

View reviewed changes

stephentoub force-pushed the socketcancellation branch from f1e3e3e to d78c103 Compare April 1, 2019 14:32

wfurt approved these changes Apr 4, 2019

View reviewed changes

JanEggers mentioned this pull request Apr 4, 2019

Stream ReadAsync stuck bug dotnet/MQTTnet#584

Closed

stephentoub merged commit 2190a0f into dotnet:master Apr 7, 2019

stephentoub deleted the socketcancellation branch April 7, 2019 20:51

tmds reviewed Apr 8, 2019

View reviewed changes

tmds mentioned this pull request Dec 16, 2019

Add CancellationToken overloads to Socket.ConnectAsync and Socket.AcceptAsync dotnet/runtime#921

Closed

roji mentioned this pull request Sep 24, 2020

Read with timeout is incompatible with SslStream npgsql/npgsql#1501

Closed

flakey-bit mentioned this pull request Jun 10, 2021

Response unmarshalling code can potentially block forever aws/aws-sdk-net#1870

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plumb CancellationToken through Socket.Receive/SendAsync #36516

Plumb CancellationToken through Socket.Receive/SendAsync #36516

stephentoub commented Apr 1, 2019

wfurt commented Apr 1, 2019

stephentoub commented Apr 1, 2019

stephentoub commented Apr 1, 2019

stephentoub commented Apr 2, 2019

stephentoub commented Apr 2, 2019

azure-pipelines bot commented Apr 2, 2019

azure-pipelines bot commented Apr 2, 2019

stephentoub commented Apr 2, 2019

azure-pipelines bot commented Apr 2, 2019

stephentoub commented Apr 3, 2019

wfurt left a comment

tmds commented Apr 4, 2019

stephentoub commented Apr 7, 2019 •

edited

Loading

tmds Apr 8, 2019

stephentoub Apr 8, 2019

tmds Apr 8, 2019

stephentoub Apr 8, 2019

Plumb CancellationToken through Socket.Receive/SendAsync #36516

Plumb CancellationToken through Socket.Receive/SendAsync #36516

Conversation

stephentoub commented Apr 1, 2019

wfurt commented Apr 1, 2019

stephentoub commented Apr 1, 2019

stephentoub commented Apr 1, 2019

stephentoub commented Apr 2, 2019

stephentoub commented Apr 2, 2019

azure-pipelines bot commented Apr 2, 2019

azure-pipelines bot commented Apr 2, 2019

stephentoub commented Apr 2, 2019

azure-pipelines bot commented Apr 2, 2019

stephentoub commented Apr 3, 2019

wfurt left a comment

Choose a reason for hiding this comment

tmds commented Apr 4, 2019

stephentoub commented Apr 7, 2019 • edited Loading

tmds Apr 8, 2019

Choose a reason for hiding this comment

stephentoub Apr 8, 2019

Choose a reason for hiding this comment

tmds Apr 8, 2019

Choose a reason for hiding this comment

stephentoub Apr 8, 2019

Choose a reason for hiding this comment

stephentoub commented Apr 7, 2019 •

edited

Loading