TalkRL: The Reinforcement Learning Podcast  Por  arte de portada

TalkRL: The Reinforcement Learning Podcast

De: Robin Ranjit Singh Chauhan
  • Resumen

  • TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Robin Ranjit Singh Chauhan.
    © 2024 Robin Ranjit Singh Chauhan
    Más Menos
Episodios
  • Vincent Moens on TorchRL
    Apr 8 2024

    Dr. Vincent Moens is an Applied Machine Learning Research Scientist at Meta, and an author of TorchRL and TensorDict in pytorch.

    Featured References

    TorchRL: A data-driven decision-making library for PyTorch
    Albert Bou, Matteo Bettini, Sebastian Dittert, Vikash Kumar, Shagun Sodhani, Xiaomeng Yang, Gianni De Fabritiis, Vincent Moens


    Additional References

    • TorchRL on github
    • TensorDict Documentation


    Más Menos
    40 m
  • Arash Ahmadian on Rethinking RLHF
    Mar 25 2024

    Arash Ahmadian is a Researcher at Cohere and Cohere For AI focussed on Preference Training of large language models. He’s also a researcher at the Vector Institute of AI.

    Featured Reference

    Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

    Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker


    Additional References

    • Self-Rewarding Language Models, Yuan et al 2024
    • Reinforcement Learning: An Introduction, Sutton and Barto 1992
    • Learning from Delayed Rewards, Chris Watkins 1989
    • Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning, Williams 1992
    Más Menos
    34 m
  • Glen Berseth on RL Conference
    Mar 11 2024

    Glen Berseth is an assistant professor at the Université de Montréal, a core academic member of the Mila - Quebec AI Institute, a Canada CIFAR AI chair, member l'Institute Courtios, and co-director of the Robotics and Embodied AI Lab (REAL).

    Featured Links

    Reinforcement Learning Conference

    Closing the Gap between TD Learning and Supervised Learning--A Generalisation Point of View
    Raj Ghugare, Matthieu Geist, Glen Berseth, Benjamin Eysenbach

    Más Menos
    22 m

Lo que los oyentes dicen sobre TalkRL: The Reinforcement Learning Podcast

Calificaciones medias de los clientes

Reseñas - Selecciona las pestañas a continuación para cambiar el origen de las reseñas.