Tag Archives: gradient descent

LM101-085: Ch7: How to Guarantee your Batch Learning Algorithm Converges

Interpreting skiing as gradient descent.

Podcast: Play in new window | Download | Embed

This 85th episode of Learning Machines 101 discusses formal convergence guarantees for a broad class of machine learning algorithms designed to minimize smooth non-convex objective functions using batch learning methods. In particular, a broad class of unsupervised, supervised, and reinforcement machine learning algorithms which iteratively update their parameter vector by adding a perturbation based upon all of the training data. This process is repeated, making a perturbation of the parameter vector based upon all of the training data until a parameter vector is generated which exhibits improved predictive performance. The magnitude of the perturbation at each learning iteration is called the “stepsize” or “learning rate” and the identity of the perturbation vector is called the “search direction”. Simple mathematical formulas are presented based upon research from the late 1960s by Philip Wolfe and G. Zoutendijk that ensure convergence of the generated sequence of parameter vectors. These formulas may be used as the basis for the design of artificially intelligent smart automatic learning rate selection algorithms. The material in this podcast is designed to provide an overview of Chapter 7 of my new book “Statistical Machine Learning” and is based upon material originally presented in Episode 68 of Learning Machines 101!

LM101-083: Ch5: How to Use Calculus to Design Learning Machines

Robot skier skiing on a mountain which symbolizes minimizing prediction error by gradient descent

Episode Summary: This particular podcast covers the material from Chapter 5 of my new book “Statistical Machine Learning: A unified framework” which is now available! The book chapter shows how matrix calculus is very useful for the analysis and design of both linear and nonlinear learning machines with lots of examples. Show Notes: Hello everyone! Welcome to the… Read More »

LM101-065: How to Design Gradient Descent Learning Machines (Rerun)

Podcast: Play in new window | Download | Embed

LM101-065: How to Design Gradient Descent Learning Machines (Rerun) Episode Summary: In this episode we introduce the concept of gradient descent which is the fundamental principle underlying learning in the majority of deep learning and neural network learning algorithms. Show Notes: Hello everyone! Welcome to the sixteenth podcast in the podcast series Learning Machines 101. In this series… Read More »

LM101-031: How to Analyze and Design Learning Rules using Gradient Descent Methods (RERUN)

Podcast: Play in new window | Download | Embed

LM101-031: How to Analyze and Design Learning Rules using Gradient Descent Methods (RERUN) Episode Summary: In this episode we introduce the concept of gradient descent which is the fundamental principle underlying learning in the majority of machine learning algorithms. Show Notes: Hello everyone! Welcome to the sixteenth podcast in the podcast series Learning Machines 101. In this series… Read More »

LM101-016: How to Analyze and Design Learning Rules using Gradient Descent Methods

Podcast: Play in new window | Download | Embed

Episode Summary: In this episode we introduce the concept of gradient descent which is the fundamental principle underlying learning in the majority of machine learning algorithms. Show Notes: Hello everyone! Welcome to the sixteenth podcast in the podcast series Learning Machines 101. In this series of podcasts my goal is to discuss important concepts of artificial intelligence and… Read More »