David J. Wu, Accelerating Self Play Learning in Go
David Silver et. al., Mastering the game of Go without human knowledge
Ivo Danihelka et .al., Policy Improvement By Planning with Gumbel
Brian Lee et .al., Minigo: A Case Study in Reproducing Reinforcement Learning Research
Not exhaustive.