Csaba Szepesvari

Csaba Szepesvari of DeepMind shares his views on Bandits, Adversaries, PUCT in AlphaGo / AlphaZero / MuZero, AGI and RL, what is timeless, and more!

  • Head of the Foundations Team at DeepMind
  • Professor of Computer Science at the University of Alberta
  • Canada CIFAR AI Chair
  • Fellow at the Alberta Machine Intelligence Institute 
  • Co-Author of the book Bandit Algorithms along with Tor Lattimore, and author of the book Algorithms for Reinforcement Learning
References


(c) 2019 Robin Ranjit Singh Chauhan