-
-
Notifications
You must be signed in to change notification settings - Fork 3
Tournaments
Inspired in part by Tom Murphy's Elo World, tournaments provide a way to compare algorithms in agents in terms of their Elo Rating. Elo ratings are generated by playing a round-robin tournament, with every competitor playing each other competitor N-times. After the game outcomes are collected, the results are shuffled, and rating are adjusted according to the following:
All players ratings are initialized to 1500. The tournament simulation is run n_tournaments
times, in order to provide a better approximation of strength, as the order of games played can have a large effect on a competitor's final rating.
If running in a notebook, a matchup heatmap is generated, showing the win rate of a competitor against each of its opponents, ordered by Elo rating:
-
n_games
: number of games to play between each competitor in the tournament -
n_tournaments
: number of times to simulate the tournament (as Elo scores can greatly vary depending on game simulation order) -
tournament_name
: name assigned to results file -
competitors
: list of competitor objects of the format:-
name
: unique identifier assigned to competitor -
algo_config
: evaluator configuration - [when applicable]
checkpoint
: path to checkpoint file to load the competitor's model from, when applicable
-