#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning
from Lex Fridman Podcast
by Lex Fridman
Published: Fri Apr 03 2020
Show Notes
David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.
Support this podcast by signing up with these sponsors:
– MasterClass: https://masterclass.com/lex
– Cash App – use code “LexPodcast” and download:
– Cash App (App Store): https://apple.co/2sPrUHe
– Cash App (Google Play): https://bit.ly/2MlvP5w
EPISODE LINKS:
Reinforcement learning (book):
https://amzn.to/2Jwp5zG
This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on ApplePodcasts, follow on Spotify, or support it on Patreon.
Here’s the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.
OUTLINE:
– Introduction
– First program
– AlphaGo
– Rule of the game of Go
– Reinforcement learning: personal journey
– What is reinforcement learning?
– AlphaGo (continued)
– Supervised learning and self play in AlphaGo
– Lee Sedol retirement from Go play
– Garry Kasparov
– Alpha Zero and self play
– Creativity in AlphaZero
– AlphaZero applications
– Reward functions
– Meaning of life