In-depth introduction to machine learning in 15 hours of expert videos

R-bloggers 2014-09-23

(This article was first published on R - Data School, and kindly contributed to R-bloggers)
In January 2014, Stanford University professors Trevor Hastie and Rob Tibshirani (authors of the legendary An Introduction to Statistical Learning with Applications in R (ISLR). I found it to be an excellent course in statistical learning (also known as "machine learning"), largely due to the high quality of both the textbook and the video lectures. And as an R user, it was extremely helpful that they included R code to demonstrate most of the techniques described in the book. If you are new to machine learning (and even if you are not an R user), I highly recommend reading ISLR from cover-to-cover to gain both a theoretical and practical understanding of many important methods for regression and classification. It is available as a free PDF download from the authors' website. If you decide to attempt the exercises at the end of each chapter, there is a GitHub repository of solutions provided by students you can use to check your work. As a supplement to the textbook, you may also want to watch the excellent course lecture videos (linked below), in which Dr. Hastie and Dr. Tibshirani discuss much of the material. In case you want to browse the lecture content, I've also linked to the PDF slides used in the videos.

Chapter 1: Introduction (slides, playlist)

Chapter 2: Statistical Learning (slides, playlist)

Chapter 3: Linear Regression (slides, playlist)

Chapter 4: Classification (slides, playlist)

Chapter 5: Resampling Methods (slides, playlist)

Chapter 6: Linear Model Selection and Regularization (slides, playlist)

Chapter 7: Moving Beyond Linearity (slides, playlist)

Chapter 8: Tree-Based Methods (slides, playlist)

Chapter 9: Support Vector Machines (slides, playlist)

Chapter 10: Unsupervised Learning (slides, playlist)

Interviews (playlist)

To leave a comment for the author, please follow the link and comment on his blog: R - Data School. offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...