PeerDAS: Multiple improvements #14467

nalepae · 2024-09-20T12:36:20Z

Please read all the message commits containing needed information.

Improve logging to help debugging
Use cache to keep track of received / stored data columns

… some columns. Before this commit: Each time we receive a column, we look into the filesystem for all columns we store. ==> For 128 columns, it looks for 1 + 2 + 3 + ... + 128 = 128(128+1)/2 = 8256 files look. Also, as soon as a column is saved into the file system, then if, right after, we look at the filesystem again, we assume the column will be available (strict consistency). It happens not to be always true. ==> Sometimes, we can reconstruct and reseed columns more than once, because of this lack of filesystem strict consistency. After this commit: We use a (strictly consistent) cache to determine if we received a column or not. ==> No more consistency issue, and less stress for the filesystem.

Before this commit, logged values assumed that all requested columns correspond to the same block root, which is not always the case. After this commit, we know which columns are requested for which root.

This is useful to debug "lost data columns" in devnet.

nisdas

Maybe I am missing context, why are we using the data columns cache here in sync ? This seems to duplicate an existing functionality

nisdas · 2024-09-23T06:48:32Z

beacon-chain/p2p/broadcaster.go


 	return nil
 }

 func (s *Service) internalBroadcastDataColumn(
 	ctx context.Context,
+	slot primitives.Slot,


Do we need the slot and slotStartTime if we can already get this information from the sidecar object

Addressed here: 1dc8225

Additional question:
We also have the possibility to remove the root argument from internalBroadcasDatacolumn, since the hash tree root can be obtain from the data columns sidecar object with:

dataColumnSidecar.SignedBlockHeader.GetHeader().HashTreeRoot()

However, is the cost of re-computing HashTreeRoot worth the removal of the root argument in the function?

I think passing the root is a better idea, hashing the header 128 times will waste CPU cycles unnecessarily.

nisdas · 2024-09-23T06:49:23Z

beacon-chain/rpc/prysm/v1alpha1/validator/proposer.go

 	dataColumnsWithholdCount := features.Get().DataColumnsWithholdCount

+	// Get the time corresponding to the start of the slot.
+	genesisTime := uint64(vs.TimeFetcher.GenesisTime().Unix())
+	slotStartTime, err := slots.ToTime(genesisTime, slot)


As mentioned earlier , I think this can be done in the broadcast method itself, we do not need to provide it here.

Addressed in the other comment.

nalepae · 2024-09-23T09:08:43Z

Maybe I am missing context, why are we using the data columns cache here in sync ? This seems to duplicate an existing functionality

The context is explained here.

* `scheduleReconstructedDataColumnsBroadcast`: Really minor refactor. * `receivedDataColumnsFromRootLock` -> `dataColumnsFromRootLock` * `reconstructDataColumns`: Stop looking into the DB to know if we have some columns. Before this commit: Each time we receive a column, we look into the filesystem for all columns we store. ==> For 128 columns, it looks for 1 + 2 + 3 + ... + 128 = 128(128+1)/2 = 8256 files look. Also, as soon as a column is saved into the file system, then if, right after, we look at the filesystem again, we assume the column will be available (strict consistency). It happens not to be always true. ==> Sometimes, we can reconstruct and reseed columns more than once, because of this lack of filesystem strict consistency. After this commit: We use a (strictly consistent) cache to determine if we received a column or not. ==> No more consistency issue, and less stress for the filesystem. * `dataColumnSidecarByRootRPCHandler`: Improve logging. Before this commit, logged values assumed that all requested columns correspond to the same block root, which is not always the case. After this commit, we know which columns are requested for which root. * Add a log when broadcasting a data column. This is useful to debug "lost data columns" in devnet. * Address Nishant's comment

nalepae added 2 commits September 19, 2024 19:31

scheduleReconstructedDataColumnsBroadcast: Really minor refactor.

37ef091

receivedDataColumnsFromRootLock -> dataColumnsFromRootLock

24a5e78

nalepae requested a review from a team as a code owner September 20, 2024 12:36

nalepae requested review from kasey, terencechain and james-prysm and removed request for a team September 20, 2024 12:36

nalepae added 3 commits September 20, 2024 15:44

dataColumnSidecarByRootRPCHandler: Improve logging.

be2198c

Before this commit, logged values assumed that all requested columns correspond to the same block root, which is not always the case. After this commit, we know which columns are requested for which root.

Add a log when broadcasting a data column.

de90f26

This is useful to debug "lost data columns" in devnet.

nalepae force-pushed the peerdas-misc branch from 40fdb9e to de90f26 Compare September 20, 2024 13:44

nalepae added the peerDAS label Sep 20, 2024

nisdas reviewed Sep 23, 2024

View reviewed changes

Address Nishant's comment

1dc8225

nisdas approved these changes Sep 23, 2024

View reviewed changes

nalepae merged commit 5f896aa into peerDAS Sep 23, 2024
14 of 15 checks passed

nalepae deleted the peerdas-misc branch September 23, 2024 09:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PeerDAS: Multiple improvements #14467

PeerDAS: Multiple improvements #14467

nalepae commented Sep 20, 2024 •

edited

Loading

nisdas left a comment

nisdas Sep 23, 2024

nalepae Sep 23, 2024

nisdas Sep 23, 2024

nisdas Sep 23, 2024

nalepae Sep 23, 2024

nalepae commented Sep 23, 2024

PeerDAS: Multiple improvements #14467

PeerDAS: Multiple improvements #14467

Conversation

nalepae commented Sep 20, 2024 • edited Loading

nisdas left a comment

Choose a reason for hiding this comment

nisdas Sep 23, 2024

Choose a reason for hiding this comment

nalepae Sep 23, 2024

Choose a reason for hiding this comment

nisdas Sep 23, 2024

Choose a reason for hiding this comment

nisdas Sep 23, 2024

Choose a reason for hiding this comment

nalepae Sep 23, 2024

Choose a reason for hiding this comment

nalepae commented Sep 23, 2024

nalepae commented Sep 20, 2024 •

edited

Loading