Batch deal activations #1310

alexytsu · 2023-06-08T14:01:11Z

Addressing #1278

A small tidying refactor and reduction of messaging overhead costs is seen here. Gas savings are not as significant as #1304 since the costliness of the state updates themselves have not been reduced per se.

Design caveats:

ActivateDeals is modified in place to become BatchActivateDeals (which implicitly assumes batches by sector)
In order to more easily preserve a parallel mapping between params and results, callers (miner actor) of BatchActivateDeals no longer check for sectors with zero deals. Instead, these are handled gracefully (though through a shortcut path) in the market actor.

TODOs

Improve parameter and result documentation
Optimise handling of sectors with zero deals
Use map to enforce parallel return structure

codecov-commenter · 2023-06-12T10:57:05Z

Codecov Report

Merging #1310 (2ff38ab) into master (3825cc1) will decrease coverage by 0.01%.
The diff coverage is 95.52%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1310      +/-   ##
==========================================
- Coverage   90.75%   90.75%   -0.01%     
==========================================
  Files         137      137              
  Lines       27296    27348      +52     
==========================================
+ Hits        24772    24819      +47     
- Misses       2524     2529       +5

Impacted Files	Coverage Δ
actors/verifreg/src/types.rs	`100.00% <ø> (ø)`
actors/verifreg/src/lib.rs	`92.11% <93.33%> (-0.10%)`	⬇️
actors/market/src/lib.rs	`92.49% <93.47%> (+0.23%)`	⬆️
actors/miner/src/lib.rs	`82.99% <97.61%> (+0.08%)`	⬆️
actors/market/src/types.rs	`96.87% <100.00%> (+0.10%)`	⬆️
actors/miner/src/ext.rs	`62.50% <100.00%> (+1.20%)`	⬆️
runtime/src/util/batch_return.rs	`98.57% <100.00%> (+18.57%)`	⬆️
test_vm/src/expects.rs	`95.73% <100.00%> (+0.03%)`	⬆️

... and 5 files with indirect coverage changes

anorth

Thanks, this looks good.

I would like to explore reducing the verbosity of the intra-batch error handling and re-use of existing batch result patterns. Inside batch_activate_deals, the match blocks and return None are very noisy compared with propagating ?. How about producing a Vec<Result<ActivateDealsResult, ActorError>> instead of a Vec<Option<...>>, and then you can use .with_context(...)? inside the map closure? Then convert this vector into a BatchReturn and array of only the successful ActivateDealsResult. This will return more info to the caller about what went wrong (tho initially will probably be ignored). You mentioned that the None results were to ease parallel iteration, but I think adding a method to BatchReturn to iterate the exit codes alongside a vector of values corresponding only to the OKs will not be hard, and will be re-usable elsewhere. The caller can use this to get the same parallel results.

FYI @ZenGround0 for opinions.

actors/market/src/lib.rs

actors/market/src/types.rs

actors/miner/src/lib.rs

actors/market/src/lib.rs

ZenGround0 · 2023-06-14T20:58:18Z

actors/market/src/lib.rs

+                .sectors
+                .iter()
+                .map(|p| {
+                    if p.deal_ids.is_empty() {


These should be filtered out on the caller side

I think this method would actually work fine without this check - it's just a minor optimisation. This method could either:

check and fail if a provided sector has no deals

just work and activate no deals

I think the latter is more robust here, and this early return not worth the lines of code. Everything below should handle empty collections too (and most likely already does).

just work and activate no deals

The shortcut path in the miner actor previously simulated this behaviour. The market actor is robust against sectors empty of deals so passing them through the usual path is ok. Though the batch_return pattern works well elsewhere in the PR for handling real errors, this is a valid use case and handling it in the miner actor will add a bit of code complexity to filter then re-interleave the sectors that either succeeded via shortcut or succeeded via market activation.

ZenGround0 · 2023-06-14T21:47:24Z

actors/market/src/lib.rs

+                        }
+                    };
+
+                    let deal_spaces = match validate_and_return_deal_space(


There's potentially a major security issue here.
validate_and_return_deal_spaces needs access to all deals at once to make sure we're not double activating deals and over claiming QA power. But as I understand this change this is being called once per sector. A concrete attack on this is a batch of n deal activations all over the same deal ids giving the caller an amplification of n times their qa power.

It might be that verified registry now saves us here, will have to look more.

We should have a test of this case as part of this change.

Ok it looks like the duplicates will be fed to CLAIM_ALLOCATIONS which doesn't have the same bug and will just fail the whole thing ✅
That said we should still avoid double activations on market actor or convince ourselves that activation is idempotent and then remove all duplication checks.

It's get_proposals that used to check for duplicates, but does't achieve it any more. It's the verified registry's problem to ensure QA power isn't amplified – this actor shouldn't be trusted for that any more and I'm glad to hear it's not.

But we should still avoid double-activating deals. I think it would be pretty confusing for this method to return deal space info for deals that were not activated (because they were already active). So I propose we check and fail the whole thing here if a duplicate deal is detected between sectors. Accumulate deal IDs similar to get_proposals (there's some duplication of work but I don't think worth fixing).

I note that verify_deals_for_activation (called at pre-commit and, redundantly, replica update) won't catch this either, and might be worth fixing in a follow-up.

I've made changes so that if a duplicate deal is detected between sectors, the subsequent activations are skipped and marked as failed. This preserves the behaviour checked in replica_update_test::dealincluded_in_multiple_sectors_failure

ZenGround0 · 2023-06-14T22:32:01Z

How about producing a Vec<Result<ActivateDealsResult, ActorError>> instead of a Vec<Option<...>>, and then you can use .with_context(...)? inside the map closure?

Yes I think this will be a lot better

Then convert this vector into a BatchReturn and array of only the successful ActivateDealsResult

It would be ideal to use this as it was an attempt to standardize batch return data types for ease of use

You mentioned that the None results were to ease parallel iteration, but I think adding a method to BatchReturn to iterate the exit codes alongside a vector of values corresponding only to the OKs will not be hard, and will be re-usable elsewhere.

If using BatchReturn is being unnecessarily clunky then we can perhaps improve it here so that it is more likely to stick as a standard solution.

alexytsu · 2023-06-16T04:10:55Z

actors/market/src/types.rs

-    pub sector_expiry: ChainEpoch,
+#[serde(transparent)]
+pub struct BatchActivateDealsParams {
+    pub sectors: Vec<SectorDeals>,


Sector deals introduces sector_type: RegisteredSealProof which is unecessary in this PR but will be used in #1312 to optimise ProveReplicaUpdates

alexytsu · 2023-06-16T04:19:27Z

actors/market/src/lib.rs

+                .sectors
+                .iter()
+                .map(|p| {
+                    if p.deal_ids.is_empty() {


just work and activate no deals

The shortcut path in the miner actor previously simulated this behaviour. The market actor is robust against sectors empty of deals so passing them through the usual path is ok. Though the batch_return pattern works well elsewhere in the PR for handling real errors, this is a valid use case and handling it in the miner actor will add a bit of code complexity to filter then re-interleave the sectors that either succeeded via shortcut or succeeded via market activation.

actors/market/src/lib.rs

anorth

The change itself looks good, only minor comments.

This new batch functionality needs testing. Please add a few market actor unit tests for batch sizes >1, including the basics like:

multiple sectors succeeding
one deal causing one sector to fail
multiple sectors failing
condition that causes whole call to fail
a sector with no deals passing through alongside another with deals

actors/market/tests/activate_deal_failures.rs

actors/market/tests/market_actor_test.rs

actors/miner/src/lib.rs

runtime/src/util/batch_return.rs

actors/market/src/lib.rs

actors/market/tests/batch_activate_deals.rs

ZenGround0

The case of same deal across different sectors now LGTM. Can look in more detail tomorrow at the rest of the change.

actors/miner/src/lib.rs

alexytsu · 2023-06-22T06:25:49Z

Running a final benchmark on this gets gas down to: 213,060,403 from 239,773,152 or a ~11% reduction in gas cost.

Full traces

alexytsu added 4 commits June 8, 2023 16:12

wip: batch onboarding deals test works

95f55e2

fix activate deals failures tests

d9032fe

verified deal activation test

2d1e125

fix market tests

c859afe

alexytsu force-pushed the alexytsu/1278-batch-deal-activation branch from 9cae3bb to 3b6e862 Compare June 8, 2023 14:41

refactor to a map based pattern to ensure parallel return structure

12a4ae8

alexytsu force-pushed the alexytsu/1278-batch-deal-activation branch from 3b6e862 to 12a4ae8 Compare June 8, 2023 15:09

alexytsu added 2 commits June 9, 2023 01:55

fix deal failure test expectations

3d66878

adjust market tests for new failure expectations

97f07d0

alexytsu force-pushed the alexytsu/1278-batch-deal-activation branch from e4e5bab to 97f07d0 Compare June 8, 2023 16:12

alexytsu added 2 commits June 10, 2023 16:28

cron epoch test

ad36fb8

fix the tests

b063030

alexytsu requested a review from anorth June 12, 2023 23:01

alexytsu marked this pull request as ready for review June 12, 2023 23:01

anorth reviewed Jun 13, 2023

View reviewed changes

ZenGround0 requested changes Jun 14, 2023

View reviewed changes

alexytsu added 9 commits June 15, 2023 14:39

remain ActivateDealsResult to DealActivation

4a16333

commit deal states into state once for all sectors

724205f

use sectordeals for batchactivate

a1195c6

cleanup logic for marketactor::BatchActivateDealsResult shortcut

31c5e33

refactor Market::BatchActivateDeals to use BatchReturn

2fbfd11

revert verifreg to use BatchReturn

0796d28

better error context when deal activation fails

4c305bf

remove shortcut path, market actor already handles empty sectors

de178e9

don't activate sectors with duplicate deals

c5b5fc2

alexytsu commented Jun 16, 2023

View reviewed changes

alexytsu requested review from ZenGround0 and anorth June 16, 2023 06:11

anorth reviewed Jun 18, 2023

View reviewed changes

alexytsu added 3 commits June 19, 2023 17:11

use a batch activation helper

519a8a9

de duplicate harness deal activation paths

efa14cb

drop Copy requirement on BatchGen::success

65e185d

anorth mentioned this pull request Jun 19, 2023

Optimize deal activation by enabling sector activation batching #1278

Closed

alexytsu commented Jun 20, 2023

View reviewed changes

actors/market/tests/batch_activate_deals.rs Outdated Show resolved Hide resolved

ZenGround0 reviewed Jun 21, 2023

View reviewed changes

simple tests for batch_activate_deals

c2a5cdc

alexytsu force-pushed the alexytsu/1278-batch-deal-activation branch from 1541e83 to 2ff38ab Compare June 21, 2023 08:15

anorth approved these changes Jun 22, 2023

View reviewed changes

actors/miner/src/lib.rs Outdated Show resolved Hide resolved

fix tests

ddc6934

alexytsu force-pushed the alexytsu/1278-batch-deal-activation branch from 2ff38ab to ddc6934 Compare June 22, 2023 03:44

anorth requested a review from ZenGround0 June 22, 2023 19:52

ZenGround0 approved these changes Jun 26, 2023

View reviewed changes

alexytsu added this pull request to the merge queue Jun 26, 2023

Merged via the queue into master with commit f414380 Jun 26, 2023

alexytsu deleted the alexytsu/1278-batch-deal-activation branch June 26, 2023 05:23

anorth mentioned this pull request Jul 20, 2023

Claim allocations processes batches by sector #1337

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch deal activations #1310

Batch deal activations #1310

alexytsu commented Jun 8, 2023 •

edited

Loading

codecov-commenter commented Jun 12, 2023 •

edited

Loading

anorth left a comment

ZenGround0 Jun 14, 2023

anorth Jun 15, 2023

alexytsu Jun 16, 2023

ZenGround0 Jun 14, 2023

ZenGround0 Jun 14, 2023

anorth Jun 15, 2023

alexytsu Jun 16, 2023

ZenGround0 commented Jun 14, 2023

alexytsu Jun 16, 2023

alexytsu Jun 16, 2023

anorth left a comment

ZenGround0 left a comment

alexytsu commented Jun 22, 2023

Batch deal activations #1310

Batch deal activations #1310

Conversation

alexytsu commented Jun 8, 2023 • edited Loading

codecov-commenter commented Jun 12, 2023 • edited Loading

Codecov Report

anorth left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ZenGround0 commented Jun 14, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anorth left a comment

Choose a reason for hiding this comment

ZenGround0 left a comment

Choose a reason for hiding this comment

alexytsu commented Jun 22, 2023

alexytsu commented Jun 8, 2023 •

edited

Loading

codecov-commenter commented Jun 12, 2023 •

edited

Loading