Skip to content

Commit

Permalink
Update redesign/README.md
Browse files Browse the repository at this point in the history
Add an update in the redesign folder.
  • Loading branch information
jonathan-laurent authored Dec 30, 2024
1 parent e587e33 commit 00fe41d
Showing 1 changed file with 5 additions and 1 deletion.
6 changes: 5 additions & 1 deletion redesign/README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,9 @@
# AlphaZero.jl Redesign

## Update

Andreas Spanopoulos made great [progress](https://github.com/AndrewSpano/AlphaZero.jl/tree/dev/redesign) on this redesign project during his 2023 GSOC. Fabrice Rosay ran promising experiments and suggested [improvements](https://github.com/jonathan-laurent/AlphaZero.jl/issues/217). Some work remains to integrate those contributions, add proper documentation and reach feature parity with the original AlphaZero.jl library.

## Redesign Objectives

- A codebase that is more accessible and easier to read
Expand Down Expand Up @@ -102,4 +106,4 @@ By executing code directly in the editor window, the whole stack trace gets high
- [Werner Duvaud's implementation](https://github.com/werner-duvaud/muzero-general)
- [Duvaud's Tictactoe Params](https://github.com/werner-duvaud/muzero-general/blob/master/games/tictactoe.py)

Note that in the MuZero pseudocode, they seem to be updating the network every 1000 batch updates (batches have size 2048). There are 1e6 updates in total so this makes 1000 iterations. The buffer is surprisingly small with 1e6 samples.
Note that in the MuZero pseudocode, they seem to be updating the network every 1000 batch updates (batches have size 2048). There are 1e6 updates in total so this makes 1000 iterations. The buffer is surprisingly small with 1e6 samples.

0 comments on commit 00fe41d

Please sign in to comment.