Pass messages from network crate to managers #147

dknopik · 2025-02-18T15:08:32Z

Passes messages to application by converting into messages understood by the QBFT and signing manager, and directly calling their functions meant for receiving network messages. These functions queue the messages for consumption by the corresponding long running tasks.

There is also some infrastructure created in this PR:

CommitteeId
Partial Signature Message

It includes the changes from #137, and therefore supersedes it.

…message passing

diegomrsantos · 2025-02-19T11:24:57Z

anchor/client/src/lib.rs

+        let network = Network::try_new(
+            &config.network,
+            subnet_tracker,
+            qbft_manager.clone(),


Do you think the network should be aware of the QBFT Manager? Could we communicate between them using a channel?

effectively, that's what the QBFT manager struct does. selecting the correct qbft instance and sending a message to it.

Could this be achieved by decoupling the network and the manager and establishing communication between them using a single channel, through which all messages are routed within the manager?

one more channel means one more channel that must do one of the following if it can't keep up:

grow unbounded

block

drop messages

not sure if that is worth it

Those are good points, but let's pause this for a while. I noticed we're using unbounded channels for both per-instance communication and the network transmission. As in production it could lead to memory issues, I was thinking that switching to bounded channels might be a good idea. What do you think?

Agree on that.

Regarding the coupling:

Direct method calls between Network → QbftManager create tight coupling

Makes it difficult to test network layer without QBFT logic

Central Channel-Based Solution:

Network → (channel) → QbftManager → (per-instance channels) → QBFT instances

Benefits:

Enables testing network in isolation by:

Mocking the output channel

Verifying sent messages without QBFT dependencies

Allows testing QBFT manager with:

Mocked network messages

Clearer boundaries between layers

I need to think deeper about it, but right now it seems to me the core challenge here isn't using a central channel, but what to do when a qbft instance's bounded channel is full.

I need to think deeper about it, but right now it seems to me the core challenge here isn't using a central channel, but what to do when a qbft instance's bounded channel is full.

We should discard the message. And that's fine. If it's happening due to temporary resource constraints, we can recover after (and maybe even operate partially). If we block, we can also recover, but block other incoming messages. If we grow unbounded, we crash.

The central channel makes it worse because we introduce another bottleneck.

The central channel makes it worse because we introduce another bottleneck.

Could you elaborate more on how it's a bottleneck? Isn't it only the qbft instances that process messages? If a specific message can't be delivered it's dropped and the qbft instance won't participate in this consensus round.

diegomrsantos · 2025-02-19T11:31:24Z

anchor/common/ssv_types/src/cluster.rs

@@ -36,6 +37,14 @@ impl Cluster {
    pub fn get_f(&self) -> u64 {
        (self.cluster_members.len().saturating_sub(1) / 3) as u64
    }
+
+    pub fn committee_id(&self) -> CommitteeId {


Would it be better to move this to the file where CommitteeId is defined? It'd receive the cluster members and return the Id.

The one holding a Cluster should not need to fiddle with the fields and pass it somewhere - a utility function is clearer at the call site, I think. The actual logic (hashing the operator ids) is already contained in the committee.rs file.

It's not important, but sth like CommitteeId::from(&self.cluster_members); could be more natural. But I see, your motivation with this function was to make it even less verbose at the caller.

diegomrsantos · 2025-02-19T11:47:38Z

anchor/common/ssv_types/src/committee.rs

+#[derive(Clone, Copy, Debug, Default, Eq, PartialEq, Hash, From, Deref)]
+pub struct CommitteeId(pub [u8; COMMITTEE_ID_LEN]);
+
+impl From<Vec<OperatorId>> for CommitteeId {


If we'd like to simplify the caller code we could implement sth like:

impl From<&IndexSet<OperatorId>> for CommitteeId { fn from(cluster_members: &IndexSet<OperatorId>) -> Self { let mut sorted: Vec<_> = cluster_members.iter().copied().collect(); sorted.sort(); let mut data: Vec<u8> = Vec::with_capacity(sorted.len() * 4); for id in sorted { data.extend_from_slice(&id.to_le_bytes()); } keccak256(data).0.into() } }

Then call let committee_id: CommitteeId = (&self.cluster_members).into();

dknopik · 2025-02-19T15:41:41Z

Thinking about bottlenecks makes me reconsider validation again. What do you think about moving all validation behind the QBFT manager queues to parallelize them? and only do the most basic validation (e.g. is the message of interest to us) before?

diegomrsantos · 2025-02-19T16:31:53Z

anchor/network/src/network.rs

+        }
+    }
+
+    fn on_consensus_message_received(&mut self, message: SignedSSVMessage) {


Could it be moved to the qbft manager?

diegomrsantos · 2025-02-19T16:44:35Z

Thinking about bottlenecks makes me reconsider validation again. What do you think about moving all validation behind the QBFT manager queues to parallelize them? and only do the most basic validation (e.g. is the message of interest to us) before?

Seems a great idea!

dknopik added 3 commits February 18, 2025 16:03

create validation stub to facilitate parallel work on validation and …

ddee094

…message passing

pass messages from network to manager

87a4a50

format and sort

6c60b16

dknopik marked this pull request as ready for review February 19, 2025 08:19

dknopik requested a review from diegomrsantos February 19, 2025 08:19

dknopik added ready-for-review This PR is ready to be reviewed network labels Feb 19, 2025

Merge branch 'unstable' into handle-incoming

083b712

dknopik mentioned this pull request Feb 19, 2025

create validation stub to facilitate parallel work on validation and message passing #137

Closed

diegomrsantos reviewed Feb 19, 2025

View reviewed changes

dknopik mentioned this pull request Feb 20, 2025

prepare types for network queueing system #151

Merged

diegomrsantos mentioned this pull request Feb 20, 2025

Create message_validator crate #152

Open

dknopik marked this pull request as draft February 20, 2025 12:59

dknopik removed the ready-for-review This PR is ready to be reviewed label Feb 21, 2025

dknopik mentioned this pull request Feb 25, 2025

Message sending for Signature Collector #159

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pass messages from network crate to managers #147

Pass messages from network crate to managers #147

dknopik commented Feb 18, 2025 •

edited

Loading

diegomrsantos Feb 19, 2025

dknopik Feb 19, 2025

diegomrsantos Feb 19, 2025

dknopik Feb 19, 2025

diegomrsantos Feb 19, 2025

dknopik Feb 19, 2025

diegomrsantos Feb 19, 2025

diegomrsantos Feb 19, 2025

dknopik Feb 19, 2025

diegomrsantos Feb 19, 2025

diegomrsantos Feb 19, 2025

dknopik Feb 19, 2025

diegomrsantos Feb 19, 2025 •

edited

Loading

diegomrsantos Feb 19, 2025

dknopik commented Feb 19, 2025

diegomrsantos Feb 19, 2025

diegomrsantos commented Feb 19, 2025

Pass messages from network crate to managers #147

Are you sure you want to change the base?

Pass messages from network crate to managers #147

Conversation

dknopik commented Feb 18, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

diegomrsantos Feb 19, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dknopik commented Feb 19, 2025

Choose a reason for hiding this comment

diegomrsantos commented Feb 19, 2025

dknopik commented Feb 18, 2025 •

edited

Loading

diegomrsantos Feb 19, 2025 •

edited

Loading