statistic data assignment
PUBHLTH 223 Data Assignment 1 Instructions Use the CORE DATA FOR 3 DATA ASSIGNMENTS dataset (MSExcel format) found in the Class Information section on Moodle to answer the following questions. Refer to the corresponding data dictionary for information about the data. These questions may be answered using R, Excel, Statkey or any other software package. You need to submit your output (including any R code), along with the results (i.e. tables, figures). After completing the assignment, upload your answers as a PDF file . The file should be saved as Last name First name_da1 (e.g. Josh Freeman should submit his PDF file as ("Freeman Josh _da1.pdf"). Refer to the syllabus for due date. Submissions will be accepted for a grace period of two add itional days but the grades will be reduced by 20%. Research Question 1. How many observational units (cases) are in this dataset? ( 1 point ) 2. How many variables are in this dataset? Identify which variables in the dataset are categorical and which are quantitative. ( 2 points ) 3. For BMI a. describe numerically the center and spread. ( 4 points ) b. create a graphical summary to describe the se data . Make sure you give a t itle and label the axes of your graph. ( 4 points ) c. Write 2 -3 sentences describing the data and the distribution shape . (6 points) 4. Provide a frequency table for smoking status. (2 points) 5. Repeat the analysis from question 3 for each strata of smoking status. ( 4 points ) 6. Do there appear to be any differences in BMI by smoking status? Write 2 -3 sentences to summarize your findings. ( 2 points)