A simplified, highly flexible, commented and (hopefully) easy to understand implementation of self-play based reinforcement learning based on the AlphaGo Zero paper (Silver et al). It is designed to ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results