stats week 7

Case Study Analysis 2

The Cholesterol.xls records cholesterol level data for individuals. Descriptions for the data follow:

  • Cholesterol: Cholesterol level (mg/dL)

  • Income: annual income in $

  • Age: age of individual

  • Jogging: number of hours an individual spends on jogging a day

  • Saturated fat: the amount of saturated fat an individual takes a day (g)


  1. Develop an estimated regression equation that can be used to predict Cholesterol level using age, jogging income, and saturated fat. Discuss your findings including interpretation of slope of each variable and significance, using at least 200 words. Use .

  2. Starting with the estimated regression equation developed in part (A), delete any independent variables that are not statistically significant and develop a new estimated regression equation that can be used to predict Cholesterol level. Use . Discuss your findings including interpretation of slope of each variable and significance, using at least 200 words. Use .

  3. Compare model (A) and (B) in terms of R^2 and which model fits the data better? Discuss this using at least 100 words

  4. In model B, what are the most important factors affecting Cholesterol level? What are the least important factors? Discuss this using at least 100 words