Category Archives: Generalization Performance

LM101-077: How to Choose the Best Model using BIC

Episode Summary: In this episode, we explain the proper semantic interpretation of the Bayesian Information Criterion (BIC) and emphasize how this semantic interpretation is fundamentally different from AIC (Akaike Information Criterion) model selection methods. Briefly, BIC is used to estimate the probability of the training data given the probability model, while AIC is used to estimate out-of-sample prediction… Read More »

LM101-076: How To Choose the Best Model using AIC or GAIC

Episode Summary: In this episode, we explain the proper semantic interpretation of the Akaike Information Criterion (AIC) and the Generalized Akaike Information Criterion (GAIC) for the purpose of picking the best model for a given set of training data.  The precise semantic interpretation of these model selection criteria is provided, explicit assumptions are provided for the AIC and… Read More »

LM101-058: How to Identify Hallucinating Learning Machines using Specification Analysis

LM101-058: How to Identify Hallucinating Learning Machines using Specification Analysis Episode Summary: In this 58th episode of Learning Machines 101, I’ll be discussing an important new scientific breakthrough published just last week for the first time in the journal Econometrics  in the special issue on model misspecification titled “Generalized Information Matrix Tests for Detecting Model Misspecification”. The article… Read More »

LM101-055: How to Learn Statistical Regularities using MAP and Maximum Likelihood Estimation (Rerun)

LM101-055: How to Learn Statistical Regularities using MAP and ML Estimation Episode Summary: In this rerun of Episode 10, we discuss fundamental principles of learning in statistical environments including the design of learning machines that can use prior knowledge to facilitate and guide the learning of statistical regularities. Show Notes: Hello everyone! Welcome to the tenth podcast in… Read More »

LM101-040: How to Build a Search Engine, Automatically Grade Essays, and Identify Synonyms using Latent Semantic Analysis

LM101-040: How to Build a Search Engine, Automatically Grade Essays, and Identify Synonyms using Latent Semantic Analysis Episode Summary: In this episode we explain how to build a search engine, automatically grade essays, and identify synonyms using Latent Semantic Analysis. Show Notes: Hello everyone! Welcome to the fortieth podcast in the podcast series Learning Machines 101. In this… Read More »

LM101-033: How to Use Linear Machine Learning Software to Make Predictions (Linear Regression Software)[RERUN]

LM101-033: How to Use Linear Machine Learning Software to Make Predictions (Linear Regression Software)[RERUN]   Episode Summary: In this episode we describe how to download and use free linear machine learning software to make predictions for classifying flower species using a famous machine learning data set. This is a RERUN of Episode 13. Show Notes: Hello everyone! Welcome… Read More »

LM101-028: How to Evaluate the Ability to Generalize from Experience (Cross-Validation Methods)[RERUN]

LM101-028: How to Evaluate the Ability to Generalize from Experience (Cross-Validation Methods)[RERUN] Episode Summary: In this episode we discuss the problem of how to evaluate the ability of a learning machine to make generalizations and construct abstractions given the learning machine is provided a finite limited collection of experiences. Show Notes: Hello everyone! Welcome to a RERUN of… Read More »

LM101-027: How to Learn About Rare and Unseen Events (Smoothing Probabilistic Laws)[RERUN]

LM101-027: How to Learn About Rare and Unseen Events (Smoothing Probabilistic Laws)[RERUN] Episode Summary: In this podcast episode, we discuss the design of statistical learning machines which can make inferences about rare and unseen events using prior knowledge. Show Notes: Hello everyone! Welcome to a RERUN of the 11th podcast in the podcast series Learning Machines 101. In this… Read More »

LM101-013: How to Use Linear Machine Learning Software to Make Predictions (Linear Regression Software)

Episode Summary: In this episode we describe how to download and use free linear machine learning software to make predictions for classifying flower species using a famous machine learning data set. Show Notes: Hello everyone! Welcome to the thirteenth podcast in the podcast series Learning Machines 101. In this series of podcasts my goal is to discuss important… Read More »

LM101-012: How to Evaluate the Ability to Generalize from Experience (Cross-Validation Methods)

Episode Summary: In this episode we discuss the problem of how to evaluate the ability of a learning machine to make generalizations and construct abstractions given the learning machine is provided a finite limited collection of experiences. Show Notes: Hello everyone! Welcome to the twelfth podcast in the podcast series Learning Machines 101. In this series of podcasts… Read More »