Csaba Szepesvari

Csaba Szepesvari of DeepMind shares his views on Bandits, Adversaries, PUCT in AlphaGo / AlphaZero / MuZero, AGI and RL, what is timeless, and more!
  • Head of the Foundations Team at DeepMind 
  • Professor of Computer Science at the University of Alberta 
  • Canada CIFAR AI Chair 
  • Fellow at the Alberta Machine Intelligence Institute  
  • Co-Author of the book Bandit Algorithms along with Tor Lattimore, and author of the book Algorithms for Reinforcement Learning 

Creators and Guests

Robin Ranjit Singh Chauhan
Robin Ranjit Singh Chauhan
🌱 Head of Eng @AgFunder 🧠 AI:Reinforcement Learning/ML/DL/NLP🎙️Host @TalkRLPodcast 💳 ex-@Microsoft ecomm PgmMgr 🤖 @UWaterloo CompEng 🇨🇦 🇮🇳
Csaba Szepesvari
Broadcast by