The Data Skeptic Podcast features interviews and discussion of topics related to data science, stati
The degree to which two variables change together can be calculated in the form of their covariance.
Today's guest is Cameron Davidson-Pilon. Cameron has a masters degree in quantitative finance from t
The central limit theorem is an important statistical result which states that typically, the mean o
Today's guest is Chris Hofstader (@gonz_blinko), an accessibility researcher and advocate, as well a
The multi-armed bandit problem is named with reference to slot machines (one armed bandits). Given t
Our episode this week begins with a correction. Back in episode 28 (Monkeys on Typewriters), Kyle ma
There are several factors that are important to selecting an appropriate sample size and dealing wit
There's an old adage which says you cannot fit a model which has more parameters than you have data.
There are many occasions in which one might want to know the distance or similarity between two thin
ContentMine is a project which provides the tools and workflow to convert scientific literature into
Today's mini-episode explains the distinction between structured and unstructured data, and debates
Yusan Lin shares her research on using data science to explore the fashion industry in this episode.
PageRank is the algorithm most famous for being one of the original innovations that made Google sta
In this episode, Benjamin Uminsky enlightens us about some of the ways the Los Angeles County Regist
This episode explores the k-nearest neighbors algorithm which is an unsupervised, non-parametric met
How do people think rationally about small probability events? What is the optimal statistical proce
This mini-episode is a high level explanation of the basic idea behind MapReduce, which is a fundam
The Credible Hulk joins me in this episode to discuss a recent blog post he wrote about glyphosate
More features are not always better! With an increasing number of features to consider, machine lea
This episode discusses video game analytics with guest Anders Drachen. The way in which people get