chore: provide a way to manage RTT component networks when hosts "disappear" (on top of #15) #16

doudou · 2025-02-18T18:21:18Z

On top of #15

Whenever a remote host "disappears", a lot of operations related to dataflow are becoming blocking as well (have to wait until timeout), because these operations will call the remote side for disconnection.

This makes systems greatly unstable and misbehaving for a while, until all these calls clear. And kills the possibility for a system management layer to do the cleanup knowing what is happening, and gives situations where some half-channels will be left dangling (for instance, a task will get an OldData on a port because its part of the connection is still there).

This PR adds a new API without touching the current behaviour. The API allows to manage "half channels", that is the part of the channel that is within the process, without touching the remote side.

… RTT itself

…connection The issue with having a connendpoint without having the connection registered is that it crashes on disconnect, since the endpoint calls the port and then the port cannot find the connection

…update the policy Policy updating is needed to exfiltrate some information in the OOB transport case (namely, a name that explains what the other side should do to connect, as for instance the MQ name for the MQ transport). Turns out that only the output half is doing so, and the other take the policy as input. Ideally, we would also have cleaned up what information is or is not being passed to the other calls (the connect calls, for instance, really don't need much policy information), but that would be for another PR.

The current RTT behaviour is to have destructors explicitly disconnect channels. It's all well and good, but at destruction time things are ... unorderly. Allow to assume that a system manager will handle the cleanup when possible.

pierrewillenbrockdfki

Looks good to me. I was questioning the need for the remote_side_lock, but it turns out that the remote_side variable itself needs to be protected against concurrent access(independent of the reference counter and the referenced object).

I'll keep this in mind when reworking the cpp rock-display connection handling.

@maltewi this might be interesting for cnd/execution?

doudou · 2025-02-25T16:00:46Z

I'll keep this in mind when reworking the cpp rock-display connection handling.

A rock-display-like tool won't necessarily benefit from this. The current connection handling will continue working fine (the signalling flag is a lot more critical). A syskit-like tool, on the other hand, can definitely benefit from this in term of robustness in distributed systems. On local systems, really not that much. But the migration is quite a bit of work.

I'm still testing, there are some crashes.

I'd be happy to discuss it with (both of) you over a call if you'd like.

doudou added 6 commits February 14, 2025 10:33

fix: resolve warning about polymorphic exception

6259bf9

fix: expand the CORBA connection API to avoid all remote calls within…

e9df864

… RTT itself

chore: refactor the build*Half to pair endpoint creation with adding …

a9b4112

…connection The issue with having a connendpoint without having the connection registered is that it crashes on disconnect, since the endpoint calls the port and then the port cannot find the connection

feat: implement CRemoteChannelElement::disconnectHalf

5931584

doudou requested review from pierrewillenbrockdfki and jhonasiv February 18, 2025 18:21

doudou mentioned this pull request Feb 18, 2025

feat: add support for the explicit connection API (on top of #164) rock-core/tools-orocosrb#165

Open

1 task

doudou changed the title ~~chore: provide a way to manage RTT component networks when hosts "disappear"~~ chore: provide a way to manage RTT component networks when hosts "disappear" (on top of #15) Feb 18, 2025

pierrewillenbrockdfki approved these changes Feb 25, 2025

View reviewed changes

jhonasiv approved these changes Feb 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: provide a way to manage RTT component networks when hosts "disappear" (on top of #15) #16

chore: provide a way to manage RTT component networks when hosts "disappear" (on top of #15) #16

doudou commented Feb 18, 2025 •

edited

Loading

pierrewillenbrockdfki left a comment

doudou commented Feb 25, 2025

chore: provide a way to manage RTT component networks when hosts "disappear" (on top of #15) #16

Are you sure you want to change the base?

chore: provide a way to manage RTT component networks when hosts "disappear" (on top of #15) #16

Conversation

doudou commented Feb 18, 2025 • edited Loading

pierrewillenbrockdfki left a comment

Choose a reason for hiding this comment

doudou commented Feb 25, 2025

doudou commented Feb 18, 2025 •

edited

Loading