Pre restart probe fixup #25243

bashtanov · 2025-03-04T14:50:34Z

Following post-merge comment from @dotnwat on #24928

Backports Required

Release Notes

none

Pre-restart probe is a lengthy operation with scheduling points, report cache refresh may change the data it iterates over. Use lw_shared_ptr to hold 2 copies if pre-restart probe calculation and refresh work concurrently.

vbotbuildovich · 2025-03-04T19:11:08Z

CI test results

test results on build#62600

test_id	test_kind	job_url	test_status	passed
rptest.tests.datalake.cluster_restore_test.DatalakeClusterRestoreTest.test_slow_tiered_storage_dlq.cloud_storage_type=CloudStorageType.S3.catalog_type=CatalogType.REST_HADOOP	ducktape	https://buildkite.com/redpanda/redpanda/builds/62600#01956202-54e8-4a1a-ade7-f33bb276410f	FLAKY	1/3
rptest.tests.multi_restarts_with_archival_test.MultiRestartTest.test_recovery_after_multiple_restarts.cloud_storage_type=CloudStorageType.ABS	ducktape	https://buildkite.com/redpanda/redpanda/builds/62600#01956202-54e9-4b77-91af-87de5d095068	FLAKY	1/2
rptest.tests.timequery_test.TimeQueryTest.test_timequery_below_start_offset.spillover=False	ducktape	https://buildkite.com/redpanda/redpanda/builds/62600#019561ff-9685-4519-b998-4188089f5822	FLAKY	1/2
storage_e2e_single_thread_rpunit.storage_e2e_single_thread_rpunit	unit	https://buildkite.com/redpanda/redpanda/builds/62600#019561a4-0e7a-43e1-969e-7fbf1fbfa055	FLAKY	1/2

bharathv

lgtm, there are a ton loops in health_monitor_backend, would be great to get another pair of 👀 incase I missed something.

bharathv · 2025-03-05T18:36:36Z

src/v/cluster/health_monitor_backend.cc

    absl::erase_if(_status, not_in_members_table);

+    _reports = new_reports;


nit: _reports = std::move(new_reports);

dotnwat · 2025-03-06T02:32:15Z

src/v/cluster/health_monitor_backend.cc

    // local report
-    auto local_report_it = std::as_const(_reports).find(_self);
-    if (local_report_it == _reports.cend()) {
+    auto local_report_it = reports->find(_self);


It's not clear to me how using a shared pointer solves the concurrency problem. It looks like the we're still iterating over this structure with (presumably) shared access where iterators e.g. are held across scheduling points?

(my understanding)

all the loops with scheduling points operate on the (shared_ptr) copy of the report. The only modification to _report is in collect_cluster_health() which makes a new shared_ptr (report) and eventually moves it into _report while the original loop iteration can safely operate on a copy until it is alive.

Yes, we only change data stored under _reports in collect_cluster_health where we construct a new object and replace the shared pointer to it. At this point the old object ceases to be referred by _reports. If it is being iterated over in walk_local_and_remote_reports it is kept alive by reports shared pointer (or multiple copies thereof if there are multiple iterations in flight), otherwise it gets destructed immediately. When all iterations are over it eventually gets destructed.

so if we are making a copy for local concurrent-free iteration, what's the point of the shared pointer?

We never make a copy of the data explicitly, we only copy a shared pointer thus creating one more handle to the same dataset.

Also reports collection is never modified, we should change _reports type to lw_shard_ptr<const report_cache_t> (I will do this). The collection is only replaced as a whole. When it is replaced, this function still holds a shared pointer to the old version thus keeping it from being destroyed.

Does it shed any light?

I think it's a bit clearer now, thanks. If ya'll think it's safe then I'm good. It's pretty hard to analyze this stuff as a reviewer without seeing the big picture.

I actually tried to make it const and found a place where it is mutated! Thanks for the heads up.

bashtanov added 3 commits March 4, 2025 14:47

cluster/health_monitor/backend: remove unneeded inline kwds and include

f1df826

cluster/health_monitor/backend: pass functions as values

13cb6d2

bashtanov requested review from dotnwat, bharathv, ztlpn and mmaslankaprv March 4, 2025 14:50

github-actions bot added the area/redpanda label Mar 4, 2025

bashtanov mentioned this pull request Mar 4, 2025

Pre-restart probe tests #25236

Open

7 tasks

bharathv approved these changes Mar 5, 2025

View reviewed changes

dotnwat reviewed Mar 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pre restart probe fixup #25243

Pre restart probe fixup #25243

bashtanov commented Mar 4, 2025

vbotbuildovich commented Mar 4, 2025

bharathv left a comment

bharathv Mar 5, 2025

dotnwat Mar 6, 2025

bharathv Mar 6, 2025

bashtanov Mar 6, 2025

dotnwat Mar 7, 2025

bashtanov Mar 8, 2025

dotnwat Mar 8, 2025

bashtanov Mar 8, 2025

		absl::erase_if(_status, not_in_members_table);

		_reports = new_reports;

Pre restart probe fixup #25243

Are you sure you want to change the base?

Pre restart probe fixup #25243

Conversation

bashtanov commented Mar 4, 2025

Backports Required

Release Notes

vbotbuildovich commented Mar 4, 2025

CI test results

bharathv left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment