Summarizing Categorical Data

STAT 20: Introduction to Probability and Statistics

Agenda

  • Concept Questions
  • Lab 1.2: Arbuthnot

Concept Questions

The table below displays data from a survey on a class of students.

What proportion of the class was in the marching band?

00:30

What proportion of those in the marching band where juniors?

00:30

What proportion were sophomores not in the marching band?

00:30

What were the dimensions of the raw data from which this table was constructed?

00:30

How would you characterize the association between these two variables?

00:30

Political affiliation and college degree status of 500 survey participants.

Which group is the largest?

01:00

What does this plot show?

00:30

Lab 1: Arbuthnot

20:00

Your first plot

A template for a line plot:


ggplot(DATAFRAME, aes(x = XVARIABLE, y = YVARIABLE)) +
  geom_line()


Where:

  • DATAFRAME is the name of your data frame
  • XVARIABLE is the name of the variable of that data frame that you want on the x-axis
  • YVARIABLE is the name of the variable of that data frmae that you want on the y-axis