The purpose of this assignment is for you to implement and reflect upon what you have learned throughout the semester. In this project, you will need to demonstrate your thoughtful mastery of statist

Answer the following questions in a 3-6 page write up. 

  1. Data and Collection

    1. Introduce the data set you chose

      1. Why did you pick this data set?

      2. Do some research into what your data set describes? (e.g. what is Scottish Hill Racing?)

      3. What types of data are presented in your data set (e.g. hill height, weights, categories, etc.). How would you describe these kinds of data - continuous? discrete? categorical? What level of measurement?

  1. As a student of statistics, what questions do you have about how the data was collected? Think critically about the different ways data can be collected as well as the possible bias involved. Come up with a good data collection strategy as well as a flawed data collection strategy for this data set. Explain your reasoning/choices!

  1. Visualizing Data and Summary Statistics

    1. Choose at least two numerical aspects of your data (e.g. length, time, etc.).

      1. Create a histogram for each data set.

      2. Create a boxplot for each data set and give the five number summary. 

      3. Indicate the summary statistics for each data set including mean, median, and standard deviation.

  1. Discuss what the visualizations and summaries tell you, if anything, about the data (center, spread, distribution). If relevant, compare and contrast your data sets based on the visualizations and summaries. Use complete sentences and justify any observations by tying back to your visualizations and summaries.

  1. Confidence Intervals and Hypothesis Testing

    1. Construct and interpret a confidence interval for a mean and proportion. This will involve:

      1. Confidence Interval for Mean

        1. Choose a numerical aspect of your data and calculate the sample mean (using CODAP)

        2. Choose a confidence level and calculate the Error (this will require you to know the sample size and sample standard deviation, which you can find on CODAP)

        3. Construct the confidence interval

        4. Interpret for someone without any statistics background.

      2. Confidence Interval for Proportion

        1. Choose a numerical aspect of your data and calculate the sample proportion (note - not every numerical aspect lends itself directly to proportions; you need to interpret the data as as a part of a whole that has some property - e.g. the proportion of racers who finished in under 30 minutes)

        2. Choose a confidence level and calculate the Error (this will require you to know the sample size, which you can find on CODAP)

        3. Construct the confidence interval

        4. Interpret for someone without any statistics background.

  1. Construct and create two hypothesis tests - one involving a proportion or mean, and the test comparing two proportions, two means, or a contingency table*. This will involve the following for each: 

    1. Make a claim based on your data (e.g. the average time is less than…) and choose a significance level

    2. Set up the Null and Alternative Hypotheses

    3. Find the appropriate test statistic (you may need information like the sample size or sample standard deviation which can be found on CODAP)

    4. Draw your conclusion and interpret for Interpret for someone without any statistics background.

    5. Write descriptions for both what a Type I Error and a Type II Error looks like for the hypothesis test for mean or proportion

  * If you are going to conduct a test for independence and need to create 

      a contingency table - this help article may prove very useful!

Some notes/links on CODAP:

  • You can save your work in CODAP either on GoogleDrive or to your computer. When you want to open a CODAP file that was saved to your computer, simply drag it to this window.

  • CODAP Help Menu 

  • CODAP  - Getting Started Part 1 and Part 2 (interactive tutorials)

  • CODAP User Manual