Category Archives: Reinforcement Learning

LM101-073: How to Build a Machine that Learns Checkers (remix)

 LM101-073: How to Build a Machine that Learns Checkers (remix) Episode Summary: This is a remix of the original second episode Learning Machines 101 which describes in a little more detail how the computer program that Arthur Samuel developed in 1959 learned to play checkers by itself without human intervention using a mixture of classical artificial intelligence search… Read More »

LM101-063: How to Transform a Supervised Learning Machine into a Policy Gradient Reinforcement Learning Machine

LM101-063: How to Transform a Supervised Learning Machine into a Policy Gradient Reinforcement Learning Machine Episode Summary: This 63rd episode of Learning Machines 101 discusses how to build reinforcement learning machines which become smarter with experience but do not use this acquired knowledge to modify their actions and behaviors. This episode explains how to build reinforcement learning machines… Read More »

LM101-062: How to Transform a Supervised Learning Machine into a Value Function Reinforcement Learning Machine

LM101-062: How to Transform a Supervised Learning Machine into a Value Function Reinforcement Learning Machine Episode Summary: This 62nd episode of Learning Machines 101 discusses how to design reinforcement learning machines using your knowledge of how to build supervised learning machines! Specifically, we focus on Value Function Reinforcement Learning Machines which estimate the unobservable total penalty associated with… Read More »

LM101-061: What happened at the Reinforcement Learning Tutorial? (RERUN)

LM101-061: What happened at the Reinforcement Learning Tutorial? (RERUN) Episode Summary: This is the third of a short subsequence of podcasts providing a summary of events associated with Dr. Golden’s recent visit to the 2015 Neural Information Processing Systems Conference. This is one of the top conferences in the field of Machine Learning. This episode reviews and discusses… Read More »

LM101-049: How to Experiment with Lunar Lander Software

LM101-049: How to Experiment with Lunar Lander Software Episode Summary: In this episode we continue the discussion of learning when the actions of the learning machine can alter the characteristics of the learning machine’s statistical environment. We describe how to download free lunar lander software so you can experiment with an autopilot for a lunar lander module that… Read More »

LM101-048: How to Build a Lunar Lander Autopilot Learning Machine (Rerun)

LM101-048: How to Build a Lunar Lander Autopilot Learning Machine (Rerun) Episode Summary: In this episode we consider the problem of learning when the actions of the learning machine can alter the characteristics of the learning machine’s statistical environment. We illustrate the solution to this problem by designing an autopilot for a lunar lander module that learns from… Read More »

LM101-044: What happened at the Deep Reinforcement Learning Tutorial at the 2015 Neural Information Processing Systems Conference?

LM101-044: What happened at the Deep Reinforcement Learning Tutorial at the 2015 Neural Information Processing Systems Conference? Episode Summary: This is the third of a short subsequence of podcasts providing a summary of events associated with Dr. Golden’s recent visit to the 2015 Neural Information Processing Systems Conference. This is one of the top conferences in the field… Read More »

LM101-035: What is a Neural Network and What is a Hot Dog?

Episode Summary: In this episode, we address the important questions of “What is a neural network?” and “What is a hot dog?” by discussing human brains, neural networks that learn to play Atari video games, and rat brain neural networks. Show Notes: Hello everyone! Welcome to the thirty-fifth podcast in the podcast series Learning Machines 101. In this… Read More »

LM101-025: How to Build a Lunar Lander Autopilot Learning Machine (adaptive control)

LM101-025: How to Build a Lunar Lander Autopilot Learning Machine (adaptive control) Episode Summary: In this episode we consider the problem of learning when the actions of the learning machine can alter the characteristics of the learning machine’s statistical environment. We illustrate the solution to this problem by designing an autopilot for a lunar lander module that learns… Read More »

LM101-002: How to Build a Machine that Learns to Play Checkers

Episode Summary: In this episode, we explain how to build a machine that learns to play checkers.  The solution to this problem involves several key ideas which are fundamental to building systems which are artificially intelligent. Show Notes: Hello everyone! Welcome to the second podcast in the podcast series Learning Machines 101. In this series of podcasts my… Read More »

LM101-001: Welcome to the Big Artificial Intelligence Magic Show!

LM101-001: Welcome to the Big Artificial Intelligence Magic Show! Episode Summary: This episode discusses the similarities between designing an android that can command a starship and designing an android that can play a game of checkers. In addition, the mystery underlying both artificial intelligence and biological intelligence is discussed. Show Notes: Artificial Intelligence (AI) is a field of… Read More »