Syllabus
Schedule
Notes
Labs
Problem Sets
Lecture Notes
To Explain or to Predict?
A postscript
Apr 28, 2023
Logistic Regression
A model for binary classification
Apr 26, 2023
Case Study: Pricing Homes
Data wrangling, recoding, and transformations.
Apr 20, 2023
Overfitting
Overfitting with scissors and the utility of a train/test split.
Apr 19, 2023
Evaluating and Improving Predictions
\(R^2\)
, Adding Predictors, Transformations, and Polynomials
Apr 14, 2023
The Method of Least Squares
Fitting a regression model by minimizing RSS
Apr 12, 2023
Experiments
The principles of experimental design
Apr 7, 2023
Defining Causality
Conditional counterfactuals and two strategies for constructing causal claims
Apr 5, 2023
Wrong by Design
Type I errors, Type II errors, and statistical power
Mar 24, 2023
Hypothesis Tests II
Simulating the null by taking draws
Mar 22, 2023
Hypothesis Testing
Measuring the consistency between a model and data
Mar 17, 2023
Bootstrapping
Another Approach to Confidence Intervals
Mar 15, 2023
Confidence Intervals
Quantifying the sampling variability of a statistic.
Mar 8, 2023
From Samples to Populations
The bias and variance of moving from sample to population.
Mar 6, 2023
Expected value and variance of a random variable
Measuring the center and spread of a distribution
Mar 3, 2023
Random Variables
Discrete random variables, probability mass functions, cumulative distribution functions
Mar 1, 2023
How to Calculate Chances
Two important ideas: Conditional probabability and Independence
Feb 24, 2023
Introducing Probability
Definitions, examples, and axioms
Feb 22, 2023
Multiple Linear Regression
Summarizing linear relationships in high dimensions
Feb 18, 2023
Summarizing Numerical Associations
Correllation and the least squares line
Feb 15, 2023
Communicating with Graphics
Six ways to hone the message of a data visualization.
Feb 10, 2023
Conditioning
Filtering, groupwise operations, and data pipelines.
Feb 8, 2023
A Grammar of Graphics
A unified framework for constructing statistical graphics.
Feb 3, 2023
Summarizing Numerical Data
Seeing the forest for the trees.
Feb 1, 2023
Summarizing Categorical Data
From data frames to tables. From tables to bar charts.
Jan 27, 2023
A Tool for Computing with Data
An introduction to the R language for statistical computing
Jan 25, 2023
The Taxonomy of Data
Types of variables and the data frame
Jan 20, 2023
Understanding the World through Data
Jan 18, 2023
No matching items