Track in/out pages in exchange #120867

dnhatn · 2025-01-26T06:50:12Z

This is a spin-off of the "retry node requests on shard-level failures" work.

Currently, a driver can execute against multiple shards simultaneously. If the execution fails and no pages are added to the sink, we can retry the failed shards on another node. In another scenario, if no pages are fetched or added to the exchange source and the entire data node request fails, we can also retry the entire request. This change adds callbacks to RemoteSink and ExchangeSink, allowing for tracking of in/out pages.

elasticsearchmachine · 2025-01-26T15:57:52Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

idegtiarenko · 2025-01-27T08:11:20Z

x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/plugin/ClusterComputeHandler.java

-                        exchangeSource,
-                        exchangeSink
+                        exchangeSource::createExchangeSource,
+                        () -> exchangeSink.createExchangeSink(() -> {})


It seems all prod implementations are passing a noop runnable. Could you please point me to the actual usage? Or is it going to be added in a later prs?

Its usage will be added later in "retry node requests on shard-level failures" work.

quux00 · 2025-01-27T15:55:12Z

For the use case cited, this looks fine, as far as I can tell, as I'm not exactly sure how you'll use this in the next PR. But are there other use cases for this callback? If so, should we use something other than Runnable, in favor of something that returns metadata about the block? For example, would it be useful know things like: 1) which node or cluster the block came from; 2) whether this is the last block and no other blocks will be coming. Those could be useful for metadata accounting, especially around CCS work or maybe incremental results work that is planned for later this yar.

dnhatn · 2025-01-27T16:25:21Z

@quux00 For my use case, the callback updates a shared atomic boolean, while others might need a page count. Therefore, I chose to pass a Runnable to allow callers to manage their metadata externally.

dnhatn · 2025-01-27T17:28:33Z

Thanks everyone!

This is a spin-off of the "retry node requests on shard-level failures" work. Currently, a driver can execute against multiple shards simultaneously. If the execution fails and no pages are added to the sink, we can retry the failed shards on another node. In another scenario, if no pages are fetched or added to the exchange source and the entire data node request fails, we can also retry the entire request. This change adds callbacks to RemoteSink and ExchangeSink, allowing for tracking of in/out pages.

elasticsearchmachine · 2025-01-27T17:30:21Z

💚 Backport successful

Status	Branch	Result
✅	8.x

This is a spin-off of the "retry node requests on shard-level failures" work. Currently, a driver can execute against multiple shards simultaneously. If the execution fails and no pages are added to the sink, we can retry the failed shards on another node. In another scenario, if no pages are fetched or added to the exchange source and the entire data node request fails, we can also retry the entire request. This change adds callbacks to RemoteSink and ExchangeSink, allowing for tracking of in/out pages.

Track in/out pages in exchange

f52b768

dnhatn added the auto-backport Automatically create backport pull requests when merged label Jan 26, 2025

elasticsearchmachine added the v9.0.0 label Jan 26, 2025

dnhatn added v8.18.0 :Analytics/ES|QL AKA ESQL >non-issue labels Jan 26, 2025

dnhatn requested review from smalyshev, quux00 and nik9000 January 26, 2025 15:56

dnhatn marked this pull request as ready for review January 26, 2025 15:57

elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Jan 26, 2025

idegtiarenko reviewed Jan 27, 2025

View reviewed changes

dnhatn requested a review from idegtiarenko January 27, 2025 16:43

quux00 approved these changes Jan 27, 2025

View reviewed changes

nik9000 approved these changes Jan 27, 2025

View reviewed changes

dnhatn merged commit c971460 into elastic:main Jan 27, 2025
16 checks passed

dnhatn deleted the exchange-tracking-pages branch January 27, 2025 17:29

dnhatn mentioned this pull request Jan 27, 2025

[8.x] Track in/out pages in exchange (#120867) #120943

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Track in/out pages in exchange #120867

Track in/out pages in exchange #120867

dnhatn commented Jan 26, 2025 •

edited

Loading

elasticsearchmachine commented Jan 26, 2025

idegtiarenko Jan 27, 2025 •

edited

Loading

dnhatn Jan 27, 2025 •

edited

Loading

quux00 commented Jan 27, 2025

dnhatn commented Jan 27, 2025

dnhatn commented Jan 27, 2025

elasticsearchmachine commented Jan 27, 2025

Track in/out pages in exchange #120867

Track in/out pages in exchange #120867

Conversation

dnhatn commented Jan 26, 2025 • edited Loading

elasticsearchmachine commented Jan 26, 2025

idegtiarenko Jan 27, 2025 • edited Loading

Choose a reason for hiding this comment

dnhatn Jan 27, 2025 • edited Loading

Choose a reason for hiding this comment

quux00 commented Jan 27, 2025

dnhatn commented Jan 27, 2025

dnhatn commented Jan 27, 2025

elasticsearchmachine commented Jan 27, 2025

💚 Backport successful

dnhatn commented Jan 26, 2025 •

edited

Loading

idegtiarenko Jan 27, 2025 •

edited

Loading

dnhatn Jan 27, 2025 •

edited

Loading