You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This epic lists future ideas for model benchmarking from the onsite meeting.
Benchmarks
Investigate the following benchmarks which are easy for the model team to run. The expectation is that we could adapt big code with our own tests to start, and use the others as necessary if big code can't handle some of the specific tests below.
continuously add tests over time to match what we're adding/testing in the extension
run once on multiple laptops/OSes/GPUs (using what the team has) to set performance baseline, confirm spec cutoff or where we run which/multiple models
automate pipeline with fixed set of hardware
The text was updated successfully, but these errors were encountered:
This epic lists future ideas for model benchmarking from the onsite meeting.
Benchmarks
Investigate the following benchmarks which are easy for the model team to run. The expectation is that we could adapt big code with our own tests to start, and use the others as necessary if big code can't handle some of the specific tests below.
Tests
Aspects to test:
Phases
The text was updated successfully, but these errors were encountered: