Fetch data columns from multiple peers instead of just supernodes #14977

niran · 2025-02-21T23:25:08Z

What type of PR is this?

Feature

What does this PR do? Why is it needed?

The initial implementation of data column sampling required all columns to be retrieved from the same peer (typically a supernode). This PR extracts the peer selection logic from the initial sync's block fetcher to be used when pending blocks are received as well. Requests for subsets of data columns are sent to each selected peer.

Which issues(s) does this PR fix?

Implements a portion of #14129

Other notes for review

Acknowledgements

I have read CONTRIBUTING.md.
I have included a uniquely named changelog fragment file.
I have added a description to this PR with sufficient context for reviewers to understand this PR.

…it can be used in both by range and by root requests

…rs instead of supernodes

CLAassistant · 2025-02-21T23:25:14Z

All committers have signed the CLA.

nalepae

I think we can remove all instances of AdmissibleCustodySamplingPeers.

beacon-chain/p2p/custody.go

beacon-chain/sync/rpc_beacon_blocks_by_root.go

beacon-chain/p2p/custody.go

…equests to multiple peers

Co-authored-by: Manu NALEPA <[email protected]>

… columns

beacon-chain/p2p/testing/fuzz_p2p.go

beacon-chain/p2p/testing/p2p.go

beacon-chain/sync/initial-sync/service.go

beacon-chain/sync/data_columns.go

nalepae · 2025-03-05T09:50:08Z

beacon-chain/sync/data_columns.go

+
+	for len(peers) > len(badPeers) && len(remainingColumns) > 0 {
+		// Filter out bad peers from the admissible peers
+		filteredDataColumnsByAdmissiblePeer := make(map[peer.ID]map[uint64]bool)


Instead of having dataColumnsByAdmissiblePeer, badPeers and filteredDataColumnsByAdmissiblePeer, what about having only dataColumnsByAdmissiblePeer and goodPeers?
At the end of the loop, we remove a peer from the goodPeers list if some columns from the peer are missing. It then avoid this:

for p, cols := range dataColumnsByAdmissiblePeer { if !badPeers[p] { filteredDataColumnsByAdmissiblePeer[p] = cols } }

nalepae · 2025-03-05T09:51:36Z

beacon-chain/sync/data_columns.go

+		}
+
+		if len(remainingColumns) == 0 {
+			break


Instead of breaking here (which is the happy path), I think it's better to put all the code after

// Validate the received sidecars

So, exiting the for loop is the unhappy path, and does not need to test any more

if len(remainingColumns) > 0

That check was leftover from a previous iteration and isn't needed at all anymore. But I don't understand what you were suggesting about moving code around. I just removed it for now but let me know if there was something else you were looking for!

nalepae · 2025-03-05T09:52:44Z

beacon-chain/sync/data_columns.go

+// RequestDataColumnSidecars sends a data column sidecars by root request to one
+// or more peers that can provide the needed data columns.
+func RequestDataColumnSidecars(
+	ctx context.Context,


This function is not tested at all.
I agree it's the kind of function hard to test.
You can look at the test that have been done for data columns by range to see how to create fake peers etc...

Yep, started working on this already based on the blocks fetcher tests. Will be done today!

niran added 3 commits February 21, 2025 17:00

Extract the block fetcher's peer selection logic for data columns so …

5d563b3

…it can be used in both by range and by root requests

Refactor data column sidecar request to send requests to multiple pee…

578995f

…rs instead of supernodes

Remove comment

6ef2afd

niran requested a review from a team as a code owner February 21, 2025 23:25

niran requested review from terencechain, rkapka and dB2510 and removed request for a team February 21, 2025 23:25

niran added 2 commits February 24, 2025 17:01

Remove unused method

f3dc3e2

Add tests for dmissiblePeersForDataColumns

b9c7cee

nalepae reviewed Feb 26, 2025

View reviewed changes

niran and others added 7 commits February 27, 2025 14:21

Extract data column fetching into standalone functions

f0cbf13

Remove AdmissibleCustodyGroupsPeers and replace the final call with r…

4ec748e

…equests to multiple peers

Apply suggestions from code review

e7eb089

Co-authored-by: Manu NALEPA <[email protected]>

Wrap errors

03e4651

Use cached peedas.Info and properly convert custody groups to custody…

b40fc01

… columns

Rename filterPeersForRangeReq

c36b3e6

Preserve debugging descriptions when filtering out peers

883cdc1

niran changed the title ~~WIP: Fetch data columns from multiple peers instead of just supernodes~~ Fetch data columns from multiple peers instead of just supernodes Feb 28, 2025

Remove unused functions.

ea7226a

nalepae reviewed Mar 3, 2025

View reviewed changes

beacon-chain/p2p/testing/fuzz_p2p.go Show resolved Hide resolved

nalepae reviewed Mar 3, 2025

View reviewed changes

beacon-chain/p2p/testing/p2p.go Show resolved Hide resolved

nalepae reviewed Mar 3, 2025

View reviewed changes

beacon-chain/sync/initial-sync/service.go Outdated Show resolved Hide resolved

nalepae reviewed Mar 3, 2025

View reviewed changes

beacon-chain/sync/data_columns.go Show resolved Hide resolved

niran added 5 commits March 3, 2025 13:46

Initialize nested maps

de9a18e

Fix comment

9940a49

First pass at retry logic for data column requests

875603a

Select fresh peers for each retry

e119b13

Return an error if there are requested columns remaining

a204514

Adjust errors

750d852

nalepae added the peerDAS label Mar 4, 2025

nalepae added 5 commits March 4, 2025 11:06

Improve slightly the godoc.

a0ca743

Improve wrapped error messages.

b8f2cb2

AdmissiblePeersForDataColumns: Use value or range.

ea253d0

Remove convertCustodyGroupsToDataColumnsByPeer since used only once.

cd1908b

Minor fixes.

45d4774

nalepae reviewed Mar 4, 2025

View reviewed changes

beacon-chain/sync/data_columns.go Outdated Show resolved Hide resolved

Retry until we run out of peers

8a1e4d2

nalepae reviewed Mar 5, 2025

View reviewed changes

niran added 2 commits March 5, 2025 10:20

Delete from the map of peers instead of filtering

94720ea

Remove unneeded break

5606637

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fetch data columns from multiple peers instead of just supernodes #14977

Fetch data columns from multiple peers instead of just supernodes #14977

niran commented Feb 21, 2025

CLAassistant commented Feb 21, 2025 •

edited

Loading

nalepae left a comment

nalepae Mar 5, 2025

nalepae Mar 5, 2025

niran Mar 5, 2025

nalepae Mar 5, 2025

niran Mar 5, 2025

Fetch data columns from multiple peers instead of just supernodes #14977

Are you sure you want to change the base?

Fetch data columns from multiple peers instead of just supernodes #14977

Conversation

niran commented Feb 21, 2025

CLAassistant commented Feb 21, 2025 • edited Loading

nalepae left a comment

Choose a reason for hiding this comment

nalepae Mar 5, 2025

Choose a reason for hiding this comment

nalepae Mar 5, 2025

Choose a reason for hiding this comment

niran Mar 5, 2025

Choose a reason for hiding this comment

nalepae Mar 5, 2025

Choose a reason for hiding this comment

niran Mar 5, 2025

Choose a reason for hiding this comment

CLAassistant commented Feb 21, 2025 •

edited

Loading