AlphaZero modifications
I had to make a couple of changes to AlphaZero to apply it to games with randomness and more than two players.
More than two players
Handling more than two players is the easy one. AlphaZero uses values in the range 0-1 to represent state values for ...
explorationbias.hashnode.dev3 min read