TalkRL: The Reinforcement Learning Podcast | Robin Ranjit Singh Chauhan

Jakob Foerster

Jakob Foerster on Multi-Agent learning, Cooperation vs Competition, Emergent Communication, Zero-shot coordination, Opponent Shaping, agents for Hanabi and Prisoner's ...

May 7, 2023 / 01:03:45/E43

Danijar Hafner 2

Danijar Hafner on the DreamerV3 agent and world models, the Director agent and heirarchical RL, realtime RL on robots with DayDreamer, and his framework for unsupervi...

April 12, 2023 / 45:15/E42

Jeff Clune

AI Generating Algos, Learning to play Minecraft with Video PreTraining (VPT), Go-Explore for hard exploration, POET and Open Endedness, AI-GAs and ChatGPT, AGI predict...

March 27, 2023 / 01:11:11/E41

Natasha Jaques 2

Hear about why OpenAI cites her work in RLHF and dialog models, approaches to rewards in RLHF, ChatGPT, Industry vs Academia, PsiPhi-Learning, AGI and more! Dr Natash...

March 13, 2023 / 46:02/E40

Jacob Beck and Risto Vuorio

Jacob Beck and Risto Vuorio on their recent Survey of Meta-Reinforcement Learning. Jacob and Risto are Ph.D. students at Whiteson Research Lab at University of Oxford...

March 7, 2023 / 01:07:05/E39

John Schulman

John Schulman, OpenAI cofounder and researcher, inventor of PPO/TRPO talks RL from human feedback, tuning GPT-3 to follow instructions (InstructGPT) and answer long-fo...

October 18, 2022 / 44:21/E38

Sven Mika

Sven Mika of Anyscale on RLlib present and future, Ray and Ray Summit 2022, applied RL in Games / Finance / RecSys, and more!

August 18, 2022 / 34:56/E37

Karol Hausman and Fei Xia

Karol Hausman and Fei Xia of Google Research on newly updated (PaLM-)SayCan, Inner Monologue, robot learning, combining robotics with language models, and more!

August 16, 2022 / 01:03:09/E36

Sai Krishna Gottipati

Sai Krishna Gottipati of AI Redefined on RL for synthesizable drug discovery, Multi-Teacher Self-Play, Cogment framework for realtime multi-actor RL, AI + Chess, and m...

July 31, 2022 / 01:08:11/E35

Aravind Srinivas 2

Aravind Srinivas, Research Scientist at OpenAI, returns to talk Decision Transformer, VideoGPT, choosing problems, and explore vs exploit in research careers

May 8, 2022 / 58:33/E34

Rohin Shah

DeepMind Research Scientist Dr. Rohin Shah on Value Alignment, Learning from Human feedback, Assistance paradigm, the BASALT MineRL competition, his Alignment Newslett...

April 11, 2022 / 01:37:04/E33

Robert Lange

Robert Lange on learning vs hard-coding, meta-RL, Lottery Tickets and Minimal Task Representations, Action Grammars and more!

December 20, 2021 / 01:10:57/E31

NeurIPS 2021 Political Economy of Reinforcement Learning Systems (PERLS) Workshop

Dr. Thomas Gilbert and Dr. Mark Nitzberg on the upcoming PERLS Workshop @ NeurIPS 2021

November 18, 2021 / 24:07/E30

Amy Zhang

Amy Zhang shares her work on Invariant Causal Prediction for Block MDPs, Multi-Task Reinforcement Learning with Context-based Representations, MBRL-Lib, shares insight...

September 27, 2021 / 01:09:35/E29

Xianyuan Zhan

Xianyuan Zhan on DeepThermal for controlling thermal power plants, the MORE algorithm for Model-based Offline RL, comparing AI in China and the US, and more!

August 30, 2021 / 41:30/E28

Eugene Vinitsky

Eugene Vinitsky of UC Berkeley on social norms and sanctions, traffic simulation, mixed-autonomy traffic, and more!

August 18, 2021 / 01:06:02/E27

Jess Whittlestone

Jess Whittlestone on societal implications of deep reinforcement Learning, AI policy, warning signs of transformative progress in AI, and more!

July 20, 2021 / 01:31:36/E26

Aleksandra Faust

Aleksandra Faust of Google Brain Research on AutoRL, meta-RL, learning to learn & learning to teach, curriculum learning, collaborations between senior and junior rese...

July 6, 2021 / 54:30/E25

Sam Ritter

Sam Ritter of DeepMind on Neuroscience and RL, Episodic Memory, Meta-RL, Synthetic Returns, the MERLIN agent, decoding brain activation, and more!

June 21, 2021 / 01:40:35/E24

Thomas Krendl Gilbert

Thomas Krendl Gilbert on the Political Economy of Reinforcement Learning Systems & Autonomous Vehicles, Sociotechnical Commitments, AI Development for the Public Inter...

May 17, 2021 / 01:12:14/E23

Marc G. Bellemare

Marc G. Bellemare shares insight on his work including Deep Q-Networks, Distributional RL, Project Loon and RL in the Stratosphere, the origins of the Arcade Learning ...

May 12, 2021 / 57:40/E22

Robert Osazuwa Ness

Dr. Robert Osazuwa Ness on Causal Inference, Probabilistic and Generative Models, Causality and RL, AltDeep School of AI, Pyro, and more!

May 8, 2021 / 01:18:43/E21

Marlos C. Machado

Marlos C. Machado on Arcade Learning Environment Evaluation, Generalization and Exploration in RL, Eigenoptions, Autonomous navigation of stratospheric balloons with R...

April 12, 2021 / 01:31:31/E20

Nathan Lambert

Nathan Lambert on Model-based RL, Trajectory-based models, Quadrotor control, Hyperparameter Optimization for MBRL, RL vs PID control, and more!

March 22, 2021 / 50:35/E19

Kai Arulkumaran

Kai Arulkumaran on AlphaStar and Evolutionary Computation, Domain Randomisation, Upside-Down Reinforcement Learning, Araya, NNAISENSE, and more!

March 15, 2021 / 46:26/E18

Michael Dennis

Michael Dennis on Human-Compatible AI, Game Theory, PAIRED, ARCTIC, EPIC, and lots more!

January 25, 2021 / 01:00:50/E17

Roman Ring

Roman Ring discusses the Research Engineer role at DeepMind, StarCraft II, AlphaStar, his bachelor's thesis, JAX, Julia, IMPALA and more!

January 11, 2021 / 42:23/E16

Shimon Whiteson

Shimon Whiteson on his WhiRL lab, his work at Waymo UK, variBAD, QMIX, co-operative multi-agent RL, StarCraft Multi-Agent Challenge, advice to grad students, and much ...

December 6, 2020 / 53:35/E15

Aravind Srinivas

Aravind Srinivas on his work including CPC v2, RAD, CURL, and SUNRISE, unsupervised learning, teaching a Berkeley course, and more!

September 20, 2020 / 01:25:27/E14

Taylor Killian

Taylor Killian on the latest in RL for Health, including Hidden Parameter MDPs, Mimic III and Sepsis, Counterfactually Guided Policy Transfer and lots more!

August 17, 2020 / 01:29:55/E13

Appears in 73 Episodes