Add MinBatchSize option #10091

AndriySvyryd · 2017-10-16T23:24:23Z

Part of #9270

roji · 2018-07-21T20:54:47Z

I'm synchronizing some old EF Core changes to Npgsql, and came across this issue. I can see that MinBatchSize defaults to a hardcoded 4 in CommandBatchPreparer, any particular reasoning for this? I saw some benchmarks in #9270, but those are SQL Server-specific and CommandBatchPreparer is used by other providers such as Npgsql.

Concretely I don't have reason to believe it's not a good idea to start batching from 2 statements (and no time to actually benchmark). Any chance you move the constant 4 to some provider-specific place where it would be easier to change? Otherwise I have to set it in NpgsqlOptionsExtension which isn't very nice.

* Switch to PoolableDbContext in various tests (dotnet/efcore#11311) * Test for MinBatchSize (dotnet/efcore#10091)

AndriySvyryd · 2018-07-22T20:33:31Z

@roji Why isn't it nice to do in NpgsqlOptionsExtension? Where would you want to do it?
For Sql Server 3 and 4 are very similar, so we could lower the default if benchmarks show 3 is better for other providers. I suspect that 3 would still be better than 2 for Npgsql

roji · 2018-07-22T20:43:19Z

My point was that I'd like to leave MinBatchSize null on NpgsqlOptionsExtension, to signify that the user hasn't specified it. If I set it to some value such as 2, we can no longer distinguish between a user-set value and a default value... Or maybe I misunderstood you suggestion...

Regarding 2 vs 3 vs 4, I should benchmark. However, on the wire, sending two messages as a batch or as two separate messages involved the exact same protocol messages, the only difference is in whether you wait for the first statement to complete before sending the second. So there really is no reason to think that batching would be bad for performance for any batch size...

I understand this works different for SQL Server, which is why I think it's better to manage the default separately in each provider.

AndriySvyryd · 2018-07-22T21:12:10Z

I just can't think of a scenario where it would matter that the value was set by the user.

There is some overhead in EF when creating batches.

roji · 2018-07-23T04:50:25Z

I just can't think of a scenario where it would matter that the value was set by the user.

For example, when logging we could check whether the value was set by the user, and only only log if so. It's just a theoretical idea that the distinction is important, if we really don't care I can set it.

There is some overhead in EF when creating batches.

Are you sure the slowdown you're seeing in the benchmarks comes from EF Core rather than the ADO.NET driver itself? Because in my experience the network round-trip you add by not batching dominates most things happening inside your application (especially as latency to your database server increases...)

AndriySvyryd · 2018-07-23T07:03:41Z

For example, when logging we could check whether the value was set by the user, and only only log if so. It's just a theoretical idea that the distinction is important, if we really don't care I can set it.

@divega, @ajcvickers ?

Because in my experience the network round-trip you add by not batching dominates most things happening inside your application

Yes, that's why this has to be configurable. For the default I think we should consider the co-located scenario (<10ms latency)

roji · 2018-07-23T07:42:50Z

Because in my experience the network round-trip you add by not batching dominates most things happening inside your application

Yes, that's why this has to be configurable. For the default I think we should consider the co-located scenario (<10ms latency)

I agree, but the point is that in PostgreSQL I don't think there's any real latency low enough where not batching would make sense, even against localhost. I know it's bad to talk about perf in theory, but I have no real environment here to benchmark. Is there any chance you guys can run a quick benchmark with varying batch sizes against PostgreSQL? If not I'll at least run a localhost benchmark.

* Switch to PoolableDbContext in various tests (dotnet/efcore#11311) * Test for MinBatchSize (dotnet/efcore#10091)

dnfclas added the cla-already-signed label Oct 16, 2017

ajcvickers approved these changes Oct 17, 2017

View reviewed changes

Add MinBatchSize option

a22e406

Part of #9270

AndriySvyryd force-pushed the Issue9209 branch from bc8e814 to a22e406 Compare October 17, 2017 18:01

AndriySvyryd merged commit a22e406 into dev Oct 17, 2017

AndriySvyryd deleted the Issue9209 branch October 17, 2017 18:02

divega added the providers-beware label Jul 22, 2018

roji added a commit to roji/efcore.pg that referenced this pull request Jul 22, 2018

More test fixes

b21c574

* Switch to PoolableDbContext in various tests (dotnet/efcore#11311) * Test for MinBatchSize (dotnet/efcore#10091)

AndriySvyryd mentioned this pull request Jul 31, 2018

Set a default max batch size for SQL Server #9270

Closed

roji added a commit to roji/efcore.pg that referenced this pull request Aug 3, 2018

More test fixes

f50d2b3

* Switch to PoolableDbContext in various tests (dotnet/efcore#11311) * Test for MinBatchSize (dotnet/efcore#10091)

roji added a commit to roji/efcore.pg that referenced this pull request Aug 10, 2018

More test fixes

10e8bb8

* Switch to PoolableDbContext in various tests (dotnet/efcore#11311) * Test for MinBatchSize (dotnet/efcore#10091)

roji added a commit to roji/efcore.pg that referenced this pull request Aug 10, 2018

More test fixes

9b1550d

* Switch to PoolableDbContext in various tests (dotnet/efcore#11311) * Test for MinBatchSize (dotnet/efcore#10091)

roji added a commit to roji/efcore.pg that referenced this pull request Aug 12, 2018

More test fixes

200d89b

* Switch to PoolableDbContext in various tests (dotnet/efcore#11311) * Test for MinBatchSize (dotnet/efcore#10091)

roji added a commit to roji/efcore.pg that referenced this pull request Aug 12, 2018

More test fixes

820878e

* Switch to PoolableDbContext in various tests (dotnet/efcore#11311) * Test for MinBatchSize (dotnet/efcore#10091)

roji added a commit to roji/efcore.pg that referenced this pull request Aug 12, 2018

More test fixes

6436a03

* Switch to PoolableDbContext in various tests (dotnet/efcore#11311) * Test for MinBatchSize (dotnet/efcore#10091)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MinBatchSize option #10091

Add MinBatchSize option #10091

AndriySvyryd commented Oct 16, 2017

roji commented Jul 21, 2018

AndriySvyryd commented Jul 22, 2018

roji commented Jul 22, 2018

AndriySvyryd commented Jul 22, 2018

roji commented Jul 23, 2018

AndriySvyryd commented Jul 23, 2018

roji commented Jul 23, 2018 •

edited

Loading

Add MinBatchSize option #10091

Add MinBatchSize option #10091

Conversation

AndriySvyryd commented Oct 16, 2017

roji commented Jul 21, 2018

AndriySvyryd commented Jul 22, 2018

roji commented Jul 22, 2018

AndriySvyryd commented Jul 22, 2018

roji commented Jul 23, 2018

AndriySvyryd commented Jul 23, 2018

roji commented Jul 23, 2018 • edited Loading

roji commented Jul 23, 2018 •

edited

Loading