This week’s assignment requires that you demonstrate the relationship between two variables.  You will be required to use the least squares approach as well as use software (SPSS preferred) to perform

TIM7100

DATA FILE 6

ASSIGNMENT 7

  1. Baseball wisdom says if you can’t hit, you can’t win. But is the number of games won by a major-league baseball team in a season related to the team’s batting average? The table below shows the number of games won and the batting averages for 14 games in one American League season.

Team

Games Won, y

Team Batting Average, x

Cleveland

99

.293

New York

92

.288

Boston

85

.283

Toronto

74

.259

Texas

90

.284

Detroit

53

.256

Minnesota

78

.288

Baltimore

88

.274

California

70

.276

Milwaukee

80

.279

Seattle

85

.287

Kansas City

75

.267

Oakland

78

.265

Chicago

85

.281

    1. If you were to model the relationship between the mean (or expected) number of games won by a major-league team and the team’s batting average x, using a straight line, would you expect the slope of the line to be positive or negative? Explain.

    2. Construct a scattergram of the data. Does the pattern revealed by the scattergram agree with your answer to part a? Construct a simple linear regression of the data using SPSS. What is the equation of the least squares line?

    3. Graph the least squares line on your scattergram. Does your least squares line seem to fit the points on your scattergram?

    4. Does the mean or expected number of games won appear to be strongly related to a team’s batting average? Explain.

    5. Interpret the values of and in the words of the problem.

  1. To improve the quality of the output of any production process, it is necessary first to understand the capabilities of the process. In a particular manufacturing process, the useful life of a cutting tool is related to the speed at which the tool is operated. It’s necessary to understand this relationship in order to predict when the tool should be replaced and how many spare tools should be available. The data in the table below were derived from life tests for the two different brands of cutting tools currently used in the production process.

Cutting Speed

(meters per minute)

Useful Life (Hours)

Brand A

Brand B

30

4.5

6.0

30

3.5

6.5

30

5.2

5.0

40

5.2

6.0

40

4.0

4.5

40

2.5

5.0

50

4.4

4.5

50

2.8

4.0

50

1.0

3.7

60

4.0

3.8

60

2.0

3.0

60

1.1

2.4

70

1.1

1.5

70

0.5

2.0

70

3.0

1.0

    1. Construct a simple linear regression of the data using SPSS. What is the equation of the least squares line for Brand A and for Brand B?

    2. For a cutting speed of 70 meters per minute, find ± 2s for each least squares line.

    3. For which brand would you feel more confident in using the least squares line to predict useful life for a given cutting speed? Explain.


  1. The expenses involved in a manufacturing operation may be categorized as being for raw material, direct labor, and overhead. Overhead refers to all expenses other than those for raw materials and direct labor that are involved with running the factory. A manufacturer of 10-speed racing bicycles is interested in estimating the relationship between its monthly factory overhead and the total number of bicycles produced per month. The estimate will be used to help develop the manufacturing budget for next year. The data in the table below have been collected for the previous 12 months.

Month

Production Level (1,000s of units)

Overhead ($1,000s)

16.9

41.4

15.6

35.0

17.4

38.3

11.6

29.5

17.7

39.6

17.6

37.4

16.3

37.5

15.5

37.0

23.4

47.9

10

28.4

55.6

11

27.1

53.1

12

19.2

40.6

    1. Construct a simple linear regression of the data using SPSS. What is the equation of the least squares line?

    2. Does the straight-line model contribute information for predicting overhead costs? Test at α = 0.05.

    3. Which of the four assumptions we make about the random error ϵ may be inappropriate in this problem? Explain.


  1. Describe the slope of the least squares line if

    1. r = 0.7

    2. r = -0.7

    3. r = 0

    4. r2 = 0.64



  1. After the Minnesota Department of Transportation installed more than 20 years ago what was then a state-of-the-art weigh-in-motion scale in the concrete surface of the eastbound lanes of Interstate 94, a study was undertaken to determine whether the scale’s readings corresponded with the static weights of the vehicles being monitored. After some preliminary comparisons using a two-axle, six-tire truck carrying different loads, calibration adjustments were made in the scale’s software and the scales were reevaluated. The data are in the table below. Construct a simple linear regression of the data using SPSS.

    1. What is the correlation coefficient for both sets of data?

    2. How can the correlation coefficients be used to evaluate the weigh-in-motion scale?

Trial Number

Static Weight of Truck x (thousand pounds)

Weigh in Motion Prior to Calibration Adjustment y1 (thousand pounds)

Weigh in Motion After Calibration Adjustment y2 (thousand pounds)

27.9

26.0

27.8

29.1

29.0

29.1

38.0

39.5

37.8

27.0

25.1

27.1

30.3

31.6

30.6

34.5

36.2

34.3

27.8

25.1

26.9

29.6

31.0

29.6

33.1

35.6

33.0

10

35.5

40.2

35.0




  1. Work standards specify time, cost, and efficiency norms for the performance of work tasks. They are typically used to monitor job performance. In one distribution center, data were collected to develop work standards for the time to assemble or fill customer orders. The table below contains data for a random sample of 9 orders.

Time (mins.)

Order Size

27

36

15

34

71

255

35

103

60

555

10

60

10

96


    1. Construct a simple linear regression of the data using SPSS. What is the equation of the least squares line? Use time as the dependent variable.

    2. In general, we would expect the mean time to fill an order to increase with the size of the order. Do the data support this theory? Test using α = .05.


  1. Firms planning to build new plants or make additions to existing facilities have become very conscious of the energy efficiency of proposed new structures and are interested in the relation between annual energy consumption and the number of square feet of building shell. The table below lists the energy consumption in British thermal units (BTUs) for 22 buildings that were all subjected to the same climatic conditions.

    1. Construct a simple linear regression of the data using SPSS. What is the estimate of the intercept β0 and slope β1?

    2. Is annual energy consumption positively linearly related to the shell area of the building? Test using α = .10.

    3. What is the observed significance level of the test of part b? Interpret its value.

    4. What is the coefficient of determination? Interpret its meaning.

    5. A company wishes to build a new warehouse that will contain 8,000 sq. ft. of shell area. Find the predicted value of energy consumption. Calculate a 95% prediction interval. Comment on the usefulness of this interval.

BTU/Year (thousands)

Shell Area (square feet)

3,870,000

30,001

1,371,000

13,530

2,422,000

26,060

672,200

6,355

233,100

4,576

218,900

24,680

354,000

2,621

3,135,000

23,350

1,470,000

18,770

1,408,000

12,220

2,201,000

25,490

2,680,000

23,680

337,500

5,650

567,500

8,001

555,300

6,147

239,400

2,660

2,629,000

19,240

1,102,000

10,700

423,500

9,125

423,500

6,510

1,691,000

13,530

1,870,000

18,860