TalkRL: The Reinforcement Learning Podcast

NeurIPS 2024 - Posters and Hallways 3

Posters and Hallway episodes are short interviews and poster summaries. Recorded at NeurIPS 2024 in Vancouver BC Canada. Featuring Claire Bizon Monroc from Inria: ...

March 9, 2025 / 10:01/E65

NeurIPS 2024 - Posters and Hallways 2

Posters and Hallway episodes are short interviews and poster summaries. Recorded at NeurIPS 2024 in Vancouver BC Canada. Featuring Jonathan Cook from University of...

March 4, 2025 / 08:48/E64

NeurIPS 2024 - Posters and Hallways 1

Posters and Hallway episodes are short interviews and poster summaries. Recorded at NeurIPS 2024 in Vancouver BC Canada. Featuring Jiaheng Hu of University of Texa...

March 2, 2025 / 09:32/E63

Abhishek Naik on Continuing RL & Average Reward

Abhishek Naik was a student at University of Alberta and Alberta Machine Intelligence Institute, and he just finished his PhD in reinforcement learning, working with R...

February 9, 2025 / 01:21:40/E62

Neurips 2024 RL meetup Hot takes: What sucks about RL?

What do RL researchers complain about after hours at the bar? In this "Hot takes" episode, we find out! Recorded at The Pearl in downtown Vancouver, during the RL me...

December 23, 2024 / 17:45/E61

RLC 2024 - Posters and Hallways 5

Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 David Radke of the Chicago Blackhawks N...

September 20, 2024 / 13:17/E60

RLC 2024 - Posters and Hallways 4

Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 David Abel from DeepMind on 3 Dogmas o...

September 18, 2024 / 04:52/E59

RLC 2024 - Posters and Hallways 3

Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 Kris De Asis from Openmind on Time Discr...

September 18, 2024 / 06:43/E58

RLC 2024 - Posters and Hallways 2

Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 Hector Kohler from Centre Inria de l'Uni...

September 15, 2024 / 15:52/E57

RLC 2024 - Posters and Hallways 1

Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 Ann Huang from Harvard on Learning Dynam...

September 10, 2024 / 05:46/E56

Finale Doshi-Velez on RL for Healthcare @ RCL 2024

Finale Doshi-Velez is a Professor at the Harvard Paulson School of Engineering and Applied Sciences. This off-the-cuff interview was recorded at UMass Amherst during ...

September 2, 2024 / 07:35/E55

David Silver 2 - Discussion after Keynote @ RCL 2024

Thanks to Professor Silver for permission to record this discussion after his RLC 2024 keynote lecture. Recorded at UMass Amherst during RCL 2024.Due to the live rec...

August 28, 2024 / 16:17/E54

David Silver @ RCL 2024

David Silver is a principal research scientist at DeepMind and a professor at University College London. This interview was recorded at UMass Amherst during RLC 2024....

August 26, 2024 / 11:27/E53

Vincent Moens on TorchRL

Dr. Vincent Moens is an Applied Machine Learning Research Scientist at Meta, and an author of TorchRL and TensorDict in pytorch. Featured References TorchRL: A data-d...

April 8, 2024 / 40:14/E52

Arash Ahmadian on Rethinking RLHF

Arash Ahmadian is a Researcher at Cohere and Cohere For AI focussed on Preference Training of large language models. He’s also a researcher at the Vector Institute of ...

March 24, 2024 / 33:30/E51

Glen Berseth on RL Conference

Glen Berseth is an assistant professor at the Université de Montréal, a core academic member of the Mila - Quebec AI Institute, a Canada CIFAR AI chair, member l'Insti...

March 11, 2024 / 21:38/E50

Ian Osband

Ian Osband is a Research scientist at OpenAI (ex DeepMind, Stanford) working on decision making under uncertainty. We spoke about: - Information theory and RL - Explo...

March 7, 2024 / 01:08:26/E49

Sharath Chandra Raparthy

Sharath Chandra Raparthy on In-Context Learning for Sequential Decision Tasks, GFlowNets, and more! Sharath Chandra Raparthy is an AI Resident at FAIR at Meta, and di...

February 11, 2024 / 40:41/E48

Pierluca D'Oro and Martin Klissarov

Pierluca D'Oro and Martin Klissarov on Motif and RLAIF, Noisy Neighborhoods and Return Landscapes, and more! Pierluca D'Oro is PhD student at Mila and visiting resear...

November 13, 2023 / 57:24/E47

Martin Riedmiller

Martin Riedmiller of Google DeepMind on controlling nuclear fusion plasma in a tokamak with RL, the original Deep Q-Network, Neural Fitted Q-Iteration, Collect and Inf...

August 22, 2023 / 01:13:56/E46

Max Schwarzer

Max Schwarzer is a PhD student at Mila, with Aaron Courville and Marc Bellemare, interested in RL scaling, representation learning for RL, and RL for science. Max spe...

August 8, 2023 / 01:10:18/E45

Julian Togelius

Julian Togelius is an Associate Professor of Computer Science and Engineering at NYU, and Cofounder and research director at modl.ai Featured References Choose Your ...

July 25, 2023 / 40:04/E44

Jakob Foerster

Jakob Foerster on Multi-Agent learning, Cooperation vs Competition, Emergent Communication, Zero-shot coordination, Opponent Shaping, agents for Hanabi and Prisoner's ...

May 7, 2023 / 01:03:45/E43

Danijar Hafner 2

Danijar Hafner on the DreamerV3 agent and world models, the Director agent and heirarchical RL, realtime RL on robots with DayDreamer, and his framework for unsupervi...

April 12, 2023 / 45:21/E42

Jeff Clune

AI Generating Algos, Learning to play Minecraft with Video PreTraining (VPT), Go-Explore for hard exploration, POET and Open Endedness, AI-GAs and ChatGPT, AGI predict...

March 27, 2023 / 01:11:11/E41

Natasha Jaques 2

Hear about why OpenAI cites her work in RLHF and dialog models, approaches to rewards in RLHF, ChatGPT, Industry vs Academia, PsiPhi-Learning, AGI and more! Dr Natash...

March 13, 2023 / 46:02/E40

Jacob Beck and Risto Vuorio

Jacob Beck and Risto Vuorio on their recent Survey of Meta-Reinforcement Learning. Jacob and Risto are Ph.D. students at Whiteson Research Lab at University of Oxford...

March 7, 2023 / 01:07:05/E39

John Schulman

John Schulman, OpenAI cofounder and researcher, inventor of PPO/TRPO talks RL from human feedback, tuning GPT-3 to follow instructions (InstructGPT) and answer long-fo...

October 18, 2022 / 44:21/E38

Sven Mika

Sven Mika of Anyscale on RLlib present and future, Ray and Ray Summit 2022, applied RL in Games / Finance / RecSys, and more!

August 18, 2022 / 34:56/E37

Karol Hausman and Fei Xia

Karol Hausman and Fei Xia of Google Research on newly updated (PaLM-)SayCan, Inner Monologue, robot learning, combining robotics with language models, and more!

August 16, 2022 / 01:03:09/E36

All Episodes