Csaba Szepesvari

Csaba Szepesvari of DeepMind shares his views on Bandits, Adversaries, PUCT in AlphaGo / AlphaZero / MuZero, AGI and RL, what is timeless, and more!

  • Head of the Foundations Team at DeepMind 
  • Professor of Computer Science at the University of Alberta 
  • Canada CIFAR AI Chair 
  • Fellow at the Alberta Machine Intelligence Institute  
  • Co-Author of the book Bandit Algorithms along with Tor Lattimore, and author of the book Algorithms for Reinforcement Learning 
References 


TalkRL on Apple Podcasts     TalkRL on Google Podcasts     TalkRL on Spotify    


© 2021 Robin Ranjit Singh Chauhan