How to build your own AlphaZero

With this, AlphaZero was born — the general algorithm for getting good at something, quickly, without any prior knowledge of human expert strategy. - Applied Data Science

caption

see also

Ref

Learning Algorithm summary

  • Mentally play through possible future scenarios, giving priority to promising paths, whilst also considering how others are most likely to react to your actions and continuing to explore the unknown.

  • After reaching a state that is unfamiliar, evaluate how favourable you believe the position to be and cascade the score back through previous positions in the mental pathway that led to this point.

  • After you’ve finished thinking about future possibilities, take the action that you’ve explored the most.

  • At the end of the game, go back and evaluate where you misjudged the value of the future positions and update your understanding accordingly.

Code

see also AlphaGo Zero cheat sheet

Dependencies to add to docker images

pip3 install matplotlib
pip3 install pydot pydot-ng
pip3 install graphviz

apt-get update && apt-get install graphviz
Written on July 9, 2020, Last update on September 26, 2020
AI jupyter tensorflow