statistic data assignment

Data Assignment 1 Instructions 

Use the CORE DATA dataset (MSExcel format) to answer the following questions. Refer to the corresponding data dictionary for information about the data. 

These questions may be answered using R, Excel, Statkey or any other software package. 

You need to submit your output (including any R code), along with the results (i.e. tables, figures). 

After completing the assignment, upload your answers as a PDF file. The file should be saved as LastnameFirstname_da1 (e.g. Josh Freeman should submit his PDF file as ("FreemanJosh_da1.pdf"). 

Research Question

  1. How many observational units (cases) are in this dataset? (1 point)

  2. How many variables are in this dataset? Identify which variables in the dataset are categorical and which are quantitative. (2 points)

  3. For BMI

  1. describe numerically the center and spread. (4 points)

b. create a graphical summary to describe these data. Make sure you give a title and label the axes of your graph. (4 points)

c. Write 2-3 sentences describing the data and the distribution shape. (6 points)

4. Provide a frequency table for smoking status. (2 points)

  1. Repeat the analysis from question 3 for each strata of smoking status. (4 points)

6. Do there appear to be any differences in BMI by smoking status? Write 2-3 sentences to summarize your findings. (2 points)