You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We have a list of benchmark tasks which we'd like to run to validate that changes are having a positive impact on the quality of output.
The benchmarks will test that the tasks completed at all and measure how many epochs they took to complete.
Benchmarks will take a while to run and could be pretty expensive so we should make sure we have controls for managing this and cutoffs for maximum epochs.
The text was updated successfully, but these errors were encountered:
We have a list of benchmark tasks which we'd like to run to validate that changes are having a positive impact on the quality of output.
The benchmarks will test that the tasks completed at all and measure how many epochs they took to complete.
Benchmarks will take a while to run and could be pretty expensive so we should make sure we have controls for managing this and cutoffs for maximum epochs.
The text was updated successfully, but these errors were encountered: