![Robin Ranjit Singh Chauhan](https://img.transistor.fm/qrlD8dfZtBRez_zCzQ6Ah8j57V13cwEjT7sHiXJGHDI/rs:fill:400:400:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9wZXJz/b24vOWI4Yjg4ZWYt/YmRmZi00ZWFkLTk4/YTgtMTRkZGFlZDFh/ZTJkLzE2Njc0MjY0/MDktaW1hZ2UuanBn.webp)
Robin Ranjit Singh Chauhan
๐ฑ Head of Eng @AgFunder ๐ง AI:Reinforcement Learning/ML/DL/NLP๐๏ธHost @TalkRLPodcast ๐ณ ex-@Microsoft ecomm PgmMgr ๐ค @UWaterloo CompEng ๐จ๐ฆ ๐ฎ๐ณ
Appears in 53 Episodes
Marc G. Bellemare
Marc G. Bellemare shares insight on his work including Deep Q-Networks, Distributional RL, Project Loon and RL in the Stratosphere, the origins of the Arcade Learning ...
![](https://img.transistor.fm/aKLwk2U5MB4qXvmhsHHDASxU1HSamwAQThk4lzcLlwY/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzUxMzk5OS8x/NjMzNTM2ODUxLWFy/dHdvcmsuanBn.webp)
Robert Osazuwa Ness
Dr. Robert Osazuwa Ness on Causal Inference, Probabilistic and Generative Models, Causality and RL, AltDeep School of AI, Pyro, and more!
![](https://img.transistor.fm/jOBR7t3iceYC_u4gs0PLbwNCjVK31IDljyHwZsbAXZs/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzQ4MzQwNi8x/NjMyODYxMjg1LWFy/dHdvcmsuanBn.webp)
Marlos C. Machado
Marlos C. Machado on Arcade Learning Environment Evaluation, Generalization and Exploration in RL, Eigenoptions, Autonomous navigation of stratospheric balloons with R...
![](https://img.transistor.fm/WViNVymikK0H-J1PCNpZBkhV7qAUnUYjk43SiiGpCiI/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzUxMzg5NC8x/NjMyNzc5NjU1LWFy/dHdvcmsuanBn.webp)
Nathan Lambert
Nathan Lambert on Model-based RL, Trajectory-based models, Quadrotor control, Hyperparameter Optimization for MBRL, RL vs PID control, and more!
![](https://img.transistor.fm/KwDW4PjtFfn3eufrR3zCYNAhCMrDuxegLmC6ISgCLVY/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzQ5MDMyMi8x/NjMyODY5NzM2LWFy/dHdvcmsuanBn.webp)
Kai Arulkumaran
Kai Arulkumaran on AlphaStar and Evolutionary Computation, Domain Randomisation, Upside-Down Reinforcement Learning, Araya, NNAISENSE, and more!
![](https://img.transistor.fm/a0oW8DOlggtYmBtJqkGrHypc7L1bQc-TpA-9qJRIGLg/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzQ5MDMzMC8x/NjMyNzk4NjEyLWFy/dHdvcmsuanBn.webp)
Michael Dennis
Michael Dennis on Human-Compatible AI, Game Theory, PAIRED, ARCTIC, EPIC, and lots more!
![](https://img.transistor.fm/lpXX5cCKpBK7JYHfok6BGiIyhXMX0XEqRoQT-q4HmoI/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzQ0MTk4My8x/NjMzOTYwMjYxLWFy/dHdvcmsuanBn.webp)
Roman Ring
Roman Ring discusses the Research Engineer role at DeepMind, StarCraft II, AlphaStar, his bachelor's thesis, JAX, Julia, IMPALA and more!
![](https://img.transistor.fm/ws7SMZcmyh5bSJnw4QAh77x--LQW9szj3vA_Y2wK9yg/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzQzMTU5MC8x/NjMyNzc4MTUxLWFy/dHdvcmsuanBn.webp)
Shimon Whiteson
Shimon Whiteson on his WhiRL lab, his work at Waymo UK, variBAD, QMIX, co-operative multi-agent RL, StarCraft Multi-Agent Challenge, advice to grad students, and much ...
![](https://img.transistor.fm/QxJJml6Me25QoGs-TkNjTaXY895Q7V5cwPMExxSSVO8/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9zaG93/LzIwNDcvMTcwNzk1/NDcxMS1hcnR3b3Jr/LmpwZw.webp)
Aravind Srinivas
Aravind Srinivas on his work including CPC v2, RAD, CURL, and SUNRISE, unsupervised learning, teaching a Berkeley course, and more!
![](https://img.transistor.fm/XVOy1jVLxXW16w7miGwA9UbXzbPM5V2hBN8O8dCIDcg/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzM1NDY3MC8x/NjMyNzk5MjI0LWFy/dHdvcmsuanBn.webp)
Taylor Killian
Taylor Killian on the latest in RL for Health, including Hidden Parameter MDPs, Mimic III and Sepsis, Counterfactually Guided Policy Transfer and lots more!
![](https://img.transistor.fm/9SqObnsaLjURpreI9MaQKxx7GZ3iTL89swKKvaEAy6o/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzMxODg2My8x/NjMyNzc4NjQ1LWFy/dHdvcmsuanBn.webp)
Nan Jiang
Nan Jiang takes us deep into Model-based vs Model-free RL, Sim vs Real, Evaluation & Overfitting, RL Theory vs Practice and much more!
![](https://img.transistor.fm/q1Wt8GVud3mZKWInnfspwkQGQbcDUf4nB_dIGWjC780/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzI4Nzc2MC8x/NjMyNzgwNjIxLWFy/dHdvcmsuanBn.webp)
Danijar Hafner
Danijar Hafner takes us on an odyssey through deep learning & neuroscience, PlaNet, Dreamer, world models, latent dynamics, curious agents, and more!
![](https://img.transistor.fm/nrKovm6Lv6dVu1qmRhXvpQtDCN9qWmJGSc4v_Z5WNuo/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzIxNzAxMS8x/NjMyNzcxNzU5LWFy/dHdvcmsuanBn.webp)
Csaba Szepesvari
Csaba Szepesvari of DeepMind shares his views on Bandits, Adversaries, PUCT in AlphaGo / AlphaZero / MuZero, AGI and RL, what is timeless, and more!
![](https://img.transistor.fm/vVlgtYk0ihcys9PHUFM9Voaku5-RLyJYarsYU-3ZXrc/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzIxNjU5My8x/NjMyODYxNTkyLWFy/dHdvcmsuanBn.webp)
Ben Eysenbach
Ben Eysenbach schools us on human supervision, SORB, DIAYN, techniques for exploration, teaching RL, virtual conferences, and much more!
![](https://img.transistor.fm/S3z0dzeLbE6BfI184FQkNRp5EdMGjXR3qW7O3OZISbA/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzIxNzAxMC8x/NjMyNzg0NDQxLWFy/dHdvcmsuanBn.webp)
NeurIPS 2019 Deep RL Workshop
Hear directly from presenters at the NeurIPS 2019 Deep RL Workshop on their work!
![](https://img.transistor.fm/QxJJml6Me25QoGs-TkNjTaXY895Q7V5cwPMExxSSVO8/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9zaG93/LzIwNDcvMTcwNzk1/NDcxMS1hcnR3b3Jr/LmpwZw.webp)
Scott Fujimoto
Scott Fujimoto expounds on his TD3 and BCQ algorithms, DDPG, Benchmarking Batch RL, and more!
![](https://img.transistor.fm/Z9fIqIGd8nNowWjt1L1-AmXA_HYUXL_laHpgRxzd-i8/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzEyMjk3My8x/NjMzMTUyMDk0LWFy/dHdvcmsuanBn.webp)
Jessica Hamrick
Jessica Hamrick sheds light on Model-based RL, Structured agents, Mental simulation, Metacontrol, Construction environments, Blueberries, and more!
![](https://img.transistor.fm/sZXaEvA2-4XsK68D3_f5YT6gNwvXFC0CBuj_6BxkDHc/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzExNDU0Mi8x/NjMyODYxMTUzLWFy/dHdvcmsuanBn.webp)
Pablo Samuel Castro
Pablo Samuel Castro drops in and drops knowledge on distributional RL, bisimulation, the Dopamine RL Framework, TF-Agents, and much more!
![](https://img.transistor.fm/fyhYqG5UpfusvPdTqf_JPlGxIMEN9KIkJnNv5qPk2gM/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzExNDU0MS8x/NjMyODYxNDExLWFy/dHdvcmsuanBn.webp)
Kamyar Azizzadenesheli
Kamyar Azizzadenesheli brings us insight on Bayesian RL, Generative Adversarial Tree search, what goes into great RL papers, and much more!
![](https://img.transistor.fm/QxJJml6Me25QoGs-TkNjTaXY895Q7V5cwPMExxSSVO8/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9zaG93/LzIwNDcvMTcwNzk1/NDcxMS1hcnR3b3Jr/LmpwZw.webp)
Antonin Raffin and Ashley Hill
Antonin Raffin and Ashley Hill discuss Stable Baselines past, present and future, State Representation Learning, S-RL Toolbox, RL on real robots, big compute for RL an...
![](https://img.transistor.fm/oPqyf1IwXjdIhtmGN-DIElT15fHVEPDZXSIoJsXRRNY/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzEwMTY0MS8x/NjMzMDI3OTA3LWFy/dHdvcmsuanBn.webp)
Michael Littman
ACM Fellow Professor Michael L Littman enlightens us on Human feedback in RL, his Udacity courses, Theory of Mind, organizing the RLDM Conference, RL past and present,...
![](https://img.transistor.fm/tEe0Vbvphj2tAxcC7Qvb0aQuD85A2eCecSppl_dMrnk/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzg3MTMwLzE2/MzI3OTg2ODYtYXJ0/d29yay5qcGc.webp)
Natasha Jaques
Natasha Jaques talks about her PhD, her papers on Social Influence in Multi-Agent RL, ML & Climate Change, Sequential Social Dilemmas, internships at DeepMind and Goog...
![](https://img.transistor.fm/YaDbLglL6lWBU2Qsi8A2uGSh0Dtlt8PGontnMS4SFb4/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9lcGlz/b2RlLzg0NTM3LzE2/MzI4NjE3MTItYXJ0/d29yay5qcGc.webp)
About TalkRL Podcast: All Reinforcement Learning, All the Time
Introducing TalkRL Podcast! Also check out our website at talkRL.com
![](https://img.transistor.fm/QxJJml6Me25QoGs-TkNjTaXY895Q7V5cwPMExxSSVO8/rs:fill:800:800:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9zaG93/LzIwNDcvMTcwNzk1/NDcxMS1hcnR3b3Jr/LmpwZw.webp)