Lab 1: Arbuthnot

Slides

Part 1: Understanding the Context of the Data

The first part of each of our labs will be a worksheet where you’ll think more about the context of the data, help formulate your questions, and set expectations for what the data will look like. Please record your answers on the worksheet below and upload it to Gradescope as Lab 1.1.

Part II: Computing on the Data

Quarto Lab Format

The second part of each lab is your chance to dive into a real data set to answer the questions that you posed in Part 1. To ensure your analysis is reproducible, please record your answers to these questions (both the text and code) in a .qmd file. Render your work to an HTML file regularly as a check that your code is running. When you are pleased with the result, print that HTML to a pdf (File > Print > Save as pdf in your browser), then upload to Gradescope as Lab 1.2. See Ed for more info about lab submission.

Lab Questions

The first several questions pertain to the arbuthnot data frame found in library(stat20data).

  1. What is the time frame covered by Arbuthnot’s data?

  2. Which year saw the greatest number of children christened?

  3. What is the proportion of girls christened in 1700?

  4. What is the trend over time in the total number of children christened? Please answer with a plot and written interpretation.

  5. What is the trend over time in the proportion of girls christened? Please answer with a plot and written interpretation.


The remaining questions pertain to the present data frame found in library(stat20data).

  1. What is the time frame covered by the present-day data?

  2. In terms of general magnitude (size), how do the counts in Arbuthnot’s data compare to the counts in the present-day data?

  3. What is the trend over time in the proportion of births that are girls? Please answer with a plot and written interpretation.

  4. Based on these two data sets, what claim are you prepared to make regarding John Arbuthnot’s original question? What reservations, if any, do you have about using this data to make the claim?