A simplified, highly flexible, commented and (hopefully) easy to understand implementation of self-play based reinforcement learning based on the AlphaGo Zero paper (Silver et al). It is designed to ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results