Csaba Szepesvari

Csaba Szepesvari of DeepMind shares his views on Bandits, Adversaries, PUCT in AlphaGo / AlphaZero / MuZero, AGI and RL, what is timeless, and more!

  • Head of the Foundations Team at DeepMind 
  • Professor of Computer Science at the University of Alberta 
  • Canada CIFAR AI Chair 
  • Fellow at the Alberta Machine Intelligence Institute  
  • Co-Author of the book Bandit Algorithms along with Tor Lattimore, and author of the book Algorithms for Reinforcement Learning 

TalkRL on Apple Podcasts     TalkRL on Google Podcasts     TalkRL on Spotify    

© 2021 Robin Ranjit Singh Chauhan