Logo
    Search

    reinforcement learning

    Explore "reinforcement learning" with insightful episodes like "Study: Reinforcement Learning from AI Feedback Performs As Well As Human Feedback", "HIBT Lab! OpenAI: Sam Altman", "#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning" and "Oriol Vinyals: DeepMind AlphaStar, StarCraft, Language, and Sequences" from podcasts like ""The AI Breakdown: Daily Artificial Intelligence News and Discussions", "How I Built This with Guy Raz", "Lex Fridman Podcast" and "Lex Fridman Podcast"" and more!

    Episodes (4)

    Study: Reinforcement Learning from AI Feedback Performs As Well As Human Feedback

    Study: Reinforcement Learning from AI Feedback Performs As Well As Human Feedback
    Today on The AI Breakdown, NLW looks at new research from Google that shows that reinforcement learning using artificial intelligence rather than human feedback could perform as well as RLHF. Before that on the Brief: the first AI pop singer gets a record deal; an AI-produced covid drug moves to phase 1 trials, and more. Today's Sponsor: Supermanage - AI for 1-on-1's - https://supermanage.ai/breakdown ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI.  Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/

    HIBT Lab! OpenAI: Sam Altman

    HIBT Lab! OpenAI: Sam Altman

    Artificial Intelligence was once the realm of science fiction. But over the last several years, advances in machine learning and deep neural networks have moved us closer to a reality where computers can learn and solve problems independently, the way a human does. From art and music to medicine and politics, the potential applications of AI are nearly endless, and the technology just keeps getting better.


    This week on How I Built This Lab, Guy talks with one of the leaders in the field of AI development, Sam Altman. Sam talks about his journey from Stanford dropout and teenage entrepreneur to president of the legendary startup incubator Y Combinator and co-founder of the nonprofit OpenAI. Plus, Sam shares his hopes and fears for the future of AI and how his company is working to ensure it ultimately benefits all of humanity.

    See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

    #86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

    #86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning
    David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning. Support this podcast by signing up with these sponsors: - MasterClass: https://masterclass.com/lex - Cash App - use code "LexPodcast" and download: - Cash App (App Store): https://apple.co/2sPrUHe - Cash App (Google Play): https://bit.ly/2MlvP5w EPISODE LINKS: Reinforcement learning (book): https://amzn.to/2Jwp5zG This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on Apple Podcasts, follow on Spotify, or support it on Patreon. Here's the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time. OUTLINE: 00:00 - Introduction 04:09 - First program 11:11 - AlphaGo 21:42 - Rule of the game of Go 25:37 - Reinforcement learning: personal journey 30:15 - What is reinforcement learning? 43:51 - AlphaGo (continued) 53:40 - Supervised learning and self play in AlphaGo 1:06:12 - Lee Sedol retirement from Go play 1:08:57 - Garry Kasparov 1:14:10 - Alpha Zero and self play 1:31:29 - Creativity in AlphaZero 1:35:21 - AlphaZero applications 1:37:59 - Reward functions 1:40:51 - Meaning of life

    Oriol Vinyals: DeepMind AlphaStar, StarCraft, Language, and Sequences

    Oriol Vinyals: DeepMind AlphaStar, StarCraft, Language, and Sequences
    Oriol Vinyals is a senior research scientist at Google DeepMind. Before that he was at Google Brain and Berkeley. His research has been cited over 39,000 times. He is one of the most brilliant and impactful minds in the field of deep learning. He is behind some of the biggest papers and ideas in AI, including sequence to sequence learning, audio generation, image captioning, neural machine translation, and reinforcement learning. He is a co-lead (with David Silver) of the AlphaStar project, creating an agent that defeated a top professional at the game of StarCraft. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations.