v0.5.1
AlphaZero v0.5.1
Closed issues:
- API discussion (#4)
- self play takes more and more time (#41)
- Supervised learning (#48)
- MCTS Optimization for sparse actions (#49)
- Training on the cloud / multiple instances / clusters (#50)
- Any Tips for per-player tracking? (#51)
- Sanity Checks (#52)
- Speed issues? (#53)
Merged pull requests:
- Mancala - fixed set_state!() (#44) (@michelangelo21)
- Invert temperature in formula (documentation) (#45) (@johannes-fischer)