feat: publish refinery instance ID during pubsub peer comms #1417

MikeGoldsmith · 2024-11-08T13:19:22Z

Which problem is this PR solving?

When publishing peers via redis pubsub, we need a way to identify whether a node can receive keep and drop decision messages. This is important when performing migrations that update an older deployment that does not support keep/drop decisions to one that does.

Also, it would also be nice to not have to rely purely on a peer address as a means of testing whether the instance if the same for determining whether we need to re-send trace decisions. For example, it's possible the same IP address is used even though the refinery instance has restarted.

This PR updates refinery to generate a new instance ID on start up and then publish the refinery's instance ID as part of it's peer register/unregister commands. As the message format is changing, any messages that does not conform to the new format will not be added as valid peers.

Short description of the changes

Generate and inject an instance ID during process start up in main.go
Update redis pubsub to take the instance ID via injection
Update redis pubsub peer commands to also specify the instance ID
Update marshal and unmarshall funcs to handle and require instance ID
Update peers to be a set of peerRecord's that has both ID and address
Update pubsub tests

kentquirk

The peer protocol was designed to be small because every pubsub message gets multiplied by N. Hence the 1-character command plus the address, which in most cases is going to be less than 20 characters. This adds 38 characters to it, which roughly triples the amount of data sent. I'm wondering if we could use, say, a 32-bit hash of the UUID instead so it only adds 8 characters?

The other problem, though, is that the new ID is only used as a flag to differentiate from the previous version of this protocol. It is not used to define uniqueness for the set of peers -- only the address does that. So once this rolls out, if a peer falls over and comes back up with the same address but a different ID, this does not do what it claims because the ID is not saved in the peer list.

I think to do what we want, we need to create a simple POD type like this:

type peerRecord struct {
  ID string
  Address string
}

And then the peers member becomes *generics.SetWithTTL[peerRecord].

kentquirk · 2024-11-08T23:54:51Z

I apologize in advance -- I started playing with this on my own machine, and decided to try writing MapWithTTL, and I liked that better, so I ended up creating #1420 because it was going to be hard to do as a modification to this PR.

If you dislike it we can use this one instead.

feat: publish refinery instance ID during pubsub peer comms

4d34ffa

MikeGoldsmith self-assigned this Nov 8, 2024

MikeGoldsmith requested a review from a team as a code owner November 8, 2024 13:19

kentquirk requested changes Nov 8, 2024

View reviewed changes

MikeGoldsmith and others added 4 commits November 8, 2024 17:15

make pubsub peers use a struct to hold ID and address

6faa9b0

sort the peer list when generating the hash

7d4d19d

Merge branch 'main' into mike/instance-id

80f1ffc

Merge branch 'main' into mike/instance-id

e6aa7c6

kentquirk mentioned this pull request Nov 8, 2024

feat: publish instanceID during peer comms #1420

Merged

MikeGoldsmith closed this in #1420 Nov 12, 2024

MikeGoldsmith closed this in 53f9dca Nov 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: publish refinery instance ID during pubsub peer comms #1417

feat: publish refinery instance ID during pubsub peer comms #1417

MikeGoldsmith commented Nov 8, 2024 •

edited

Loading

kentquirk left a comment •

edited

Loading

kentquirk commented Nov 8, 2024

feat: publish refinery instance ID during pubsub peer comms #1417

feat: publish refinery instance ID during pubsub peer comms #1417

Conversation

MikeGoldsmith commented Nov 8, 2024 • edited Loading

Which problem is this PR solving?

Short description of the changes

kentquirk left a comment • edited Loading

Choose a reason for hiding this comment

kentquirk commented Nov 8, 2024

MikeGoldsmith commented Nov 8, 2024 •

edited

Loading

kentquirk left a comment •

edited

Loading