[tuner] Run baseline benchmarks first with reusable helper function and regression detection #789

bangtianliu · 2025-01-08T19:36:25Z

This PR is relevant to address the task in: #783.

Run the baseline benchmark at the start to quickly assess whether the tuner is making progress.
Run the baseline benchmark both at the beginning and the end of tuning to detect potential performance regressions during the process of tuning (e.g., device overheating).

This PR adds a reusable helper function that takes a list of candidate indices and returns benchmarking results, allowing the same function to be used for baseline and candidate benchmark results.

Furthermore, this PR adds detection and logging for several conditions:

Uniqueness of baseline results: Verify the uniqueness of baseline results and logging a warning if duplicates are detected.
Consistency of device IDs across baseline runs: Ensure that the device IDs from two baseline runs match, logging a warning if any discrepancies are found.
Performance regression detection: Identify performance regressions between two baseline runs and logging a warning if regressions are encountered.
Also, unit tests are added to show the corner cases.

Max191

Overall looks good. Just a few nits about the logging and code reuse. Could you also add a more informative PR description (essentially explain the reasoning for benchmarking baselines twice like in the linked issue).

tuner/tuner/libtuner.py

Max191

Looks good, thanks! Just one nit.

tuner/tuner/libtuner.py

kuhar · 2025-01-09T15:54:42Z

Ideally, these corner cases should be covered by unit tests.

bangtianliu · 2025-01-09T20:43:54Z

WIP: need to rebase on top of #799, and continue to work.

kuhar

Can you edit the PR description and explain what error conditions you considered and how they are handled?

tuner/tuner/candidate_gen.py

tuner/tuner/libtuner.py

tuner/tuner/libtuner_test.py

Signed-off-by: Bangtian Liu <[email protected]>

bangtianliu requested review from kuhar and Max191 January 8, 2025 19:36

bangtianliu force-pushed the benchmark_baseline branch 2 times, most recently from bd88a9c to 46d6a87 Compare January 8, 2025 19:40

Max191 reviewed Jan 8, 2025

View reviewed changes

tuner/tuner/libtuner.py Outdated Show resolved Hide resolved

tuner/tuner/libtuner.py Outdated Show resolved Hide resolved

tuner/tuner/libtuner.py Outdated Show resolved Hide resolved

bangtianliu force-pushed the benchmark_baseline branch from 46d6a87 to 222f783 Compare January 9, 2025 00:35

bangtianliu requested a review from Max191 January 9, 2025 00:37

kuhar requested changes Jan 9, 2025

View reviewed changes

bangtianliu force-pushed the benchmark_baseline branch from 222f783 to 4b2165c Compare January 9, 2025 02:47

bangtianliu requested a review from kuhar January 9, 2025 02:54

Max191 approved these changes Jan 9, 2025

View reviewed changes

tuner/tuner/libtuner.py Outdated Show resolved Hide resolved

kuhar reviewed Jan 9, 2025

View reviewed changes

tuner/tuner/libtuner.py Outdated Show resolved Hide resolved

tuner/tuner/libtuner.py Outdated Show resolved Hide resolved

tuner/tuner/libtuner.py Outdated Show resolved Hide resolved

tuner/tuner/libtuner.py Outdated Show resolved Hide resolved

bangtianliu force-pushed the benchmark_baseline branch from 9593df8 to f139ee1 Compare January 9, 2025 23:17

bangtianliu requested a review from kuhar January 9, 2025 23:17

bangtianliu force-pushed the benchmark_baseline branch 3 times, most recently from 30ab7e4 to 6c88791 Compare January 9, 2025 23:22

kuhar reviewed Jan 10, 2025

View reviewed changes

bangtianliu force-pushed the benchmark_baseline branch 4 times, most recently from f4be426 to 6966557 Compare January 10, 2025 04:49

bangtianliu added 4 commits January 9, 2025 22:51

add helper functions and tests

b4ff0a5

Signed-off-by: Bangtian Liu <[email protected]>

continue to add helper functions and tests, fix mypy notes

99b0936

Signed-off-by: Bangtian Liu <[email protected]>

fix mypy issue

f2d5658

Signed-off-by: Bangtian Liu <[email protected]>

address reviewer comments

4c52419

Signed-off-by: Bangtian Liu <[email protected]>

bangtianliu force-pushed the benchmark_baseline branch from 6966557 to 4c52419 Compare January 10, 2025 04:51

bangtianliu changed the title ~~[tuner] Run baseline benchmarks first and check regression~~ [tuner] Run baseline benchmarks first with reusable helper function and regression detection Jan 10, 2025

bangtianliu requested a review from kuhar January 10, 2025 15:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tuner] Run baseline benchmarks first with reusable helper function and regression detection #789

[tuner] Run baseline benchmarks first with reusable helper function and regression detection #789

bangtianliu commented Jan 8, 2025 •

edited

Loading

Max191 left a comment

Max191 left a comment •

edited

Loading

kuhar commented Jan 9, 2025

bangtianliu commented Jan 9, 2025

kuhar left a comment

[tuner] Run baseline benchmarks first with reusable helper function and regression detection #789

Are you sure you want to change the base?

[tuner] Run baseline benchmarks first with reusable helper function and regression detection #789

Conversation

bangtianliu commented Jan 8, 2025 • edited Loading

Max191 left a comment

Choose a reason for hiding this comment

Max191 left a comment • edited Loading

Choose a reason for hiding this comment

kuhar commented Jan 9, 2025

bangtianliu commented Jan 9, 2025

kuhar left a comment

Choose a reason for hiding this comment

bangtianliu commented Jan 8, 2025 •

edited

Loading

Max191 left a comment •

edited

Loading