Instructions Final Project Now that you have completed the first six assignments, it is time to complete your research project for the course. Include the following sections in your submission. Title

Running head: DATA ANALYSIS 0

Descriptive Statistics Analysis

Name

Columbia Southern University

Data Analysis: Descriptive Statistics and Assumption Testing

Sun Coast data was investigated to choose whether normal distribution of data was shown, which would be compulsory for postulation for parametric statistical tests. All the investigation questions were known to have ratio data that is classifiable, and has a considerable distance between the data values. The questions have a true zero features. Mean (average), mode, and median are very useful in the study.

Correlation: Descriptive Statistics and Assumption Testing

Frequency distribution table.

Frequency

Range

13

18

24

18

12

10

11

12


Histogram



Descriptive statistics table.

Sick Days

Mean

7.126214

Standard Error

0.186484

Median

Mode

Standard Deviation

1.892605

Sample Variance

3.581953

Kurtosis

0.124923

Skewness

0.14225

Range

10

Minimum

Maximum

12

Sum

734

Count

103

Measurement scale.

The four scales used to estimate the information are ratio, nominal, interval and ordinal scales. However, both the ordinal and nominal values do not have substantial distance between data values or true zero. Interval data scales have a considerable distance amid data values; however, they have no true zero. Ratios data can be grouped, well-arranged, and have considerable distance between data values, and have true zero (Simonsohn, Simmons & Nelson, 2019). Therefore, the measurement scale used in this scenario is ratio due to the data being utilized is the sick days.

The measure of central tendency.

Central tendency illustrates the degree to which data points are disseminated around the mid-point of the curve. The mid-point is estimated by average, median, and mode. The mean is 7.12, the median is seven, and the mode is seven.

Evaluation.

The parametric test demands that the assumption of normality be met. Normality is when data is distributed normally and when graphed looks like a bell shape. There are other different typical assumptions that should be met, depending on the statistical procedure used, include sample size, levels of measurement, consistency of variance, independence, absence of outliers, linearity, etc. In this scenario, there is, in fact, a bell-shaped graph, and the data were distributed normally.

Simple Regression: Descriptive Statistics and Assumption Testing

Frequency distribution table.

Lost Time Hours

Frequency

10

35

60

85

110

17

135

18

160

24

185

27

210

37

235

24

260

21

285

15

310

12

335

More

Histogram.

Descriptive statistics table.

Lost Hours

Mean

188.0044843

Standard Error

4.803089447

Median

190

Mode

190

Standard Deviation

71.72542099

Sample Variance

5144.536016

Kurtosis

-0.501223533

Skewness

-0.081984874

Range

350

Minimum

10

Maximum

360

Sum

41925

Count

223

Measurement scale.

According to the information, there are safety training expenditures and lost time hours. The measurement scale used in the scenario is the ratio.

A measure of central tendency.

The mid-point is measured by mean, median, and mode. The mean is 188.0044843, the median is 190, and the mode is 190.

Evaluation.

In this scenario, the lost hours range from 10 to 335, with 10 and 335 having fewer numbers of lost hours. Lost time hours surge progressively, displaying a bell-shaped graph, and the data was distributed normally. Therefore, the assumptions for parametric statistical testing were met.

Multiple Regression: Descriptive Statistics and Assumption Testing

Frequency distribution table.

Bin

Frequency

103.38

104.3697

105.3593

106.349

107.3386

108.3283

109.3179

110.3076

12

111.2973

18

112.2869

17

113.2766

26

114.2662

22

115.2559

27

116.2456

47

117.2352

36

118.2249

44

119.2145

47

120.2042

53

121.1938

61

122.1835

60

123.1732

62

124.1628

74

125.1525

70

126.1421

81

127.1318

92

128.1214

73

129.1111

105

130.1008

80

131.0904

88

132.0801

67

133.0697

50

134.0594

56

135.0491

35

136.0387

30

137.0284

19

138.018

139.0077

139.9973

More

Histogram.

Descriptive statistics table.

Decibels

Mean

124.8359428

Standard Error

0.177944692

Median

125.721

Mode

127.315

Standard Deviation

6.898656622

Sample Variance

47.59146318

Kurtosis

-0.3141873

Skewness

-0.418952188

Range

37.607

Minimum

103.38

Maximum

140.987

Sum

187628.422

Count

1503

Measurement scale

In every contract, employees are subject to noises during the job. These noises eventually lead to injuries. The louder the noises, the more they are susceptible to injuries. The information provided utilizes the ratio scale.

The measure of central tendency

The mean decibels are 124.8359428, the median 125.721 and the mode 127.315

Evaluation

The decibels employees are exaggerated by range from 103 to 140. This histogram is bell-shaped with the most decibels at 129.1111; thus, assumptions for parametric statistical testing observed.

Independent Samples t-Test:

Prior Training

Bin

Frequency

50

55.85714

61.71429

67.57143

73.42857

14

79.28571

10

85.14286

More

Revised Training

Bin

Frequency

75

78.14286

81.28571

10

84.42857

12

87.57143

14

90.71429

10

93.85714

More


Histogram.

Descriptive statistics table.

Prior Training

Revised Training

Mean

69.79032258

Mean

84.77419355

Standard Error

1.402788093

Standard Error

0.659478888

Median

70

Median

85

Mode

80

Mode

85

Standard Deviation

11.04556449

Standard Deviation

5.192741955

Sample Variance

122.004495

Sample Variance

26.96456901

Kurtosis

-0.77667598

Kurtosis

-0.352537913

Skewness

-0.086798138

Skewness

0.144084526

Range

41

Range

22

Minimum

50

Minimum

75

Maximum

91

Maximum

97

Sum

4327

Sum

5256

Count

62

Count

62

Measurement scale.

Test scores from prior training and revised training were reorded. The ratio measurement is the scale used in this scenario.

Measure of central tendency.

The preceding training scores group had a mean of 69.79032258, a median of 70, and a mode of 80. In the revised training with group b, the mean was 84.77419355, a median of 85 and mode 85.

Evaluation.

The histogram shows a normal bell-shaped figure from the data provided. Therefore, assumptions for parametric statistical testing were met.

Dependent Samples (Paired-Samples) t-Test: Descriptive Statistics and Assumption Testing

Frequency distribution table.

Pre-Exposure

Post Exposure

Bin

Frequency

Bin

Frequency

16

16

24

24

32

32

40

13

40

11

48

12

48

14

More

More

Histogram.

Descriptive statistics table.

Pre-Exposure

Post Exposure

Mean

32.85714

Mean

33.28571429

Standard Error

1.752307

Standard Error

1.781423416

Median

35

Median

36

Mode

36

Mode

38

Standard Deviation

12.26615

Standard Deviation

12.46996391

Sample Variance

150.4583

Sample Variance

155.5

Kurtosis

-0.57604

Kurtosis

-0.654212507

Skewness

-0.42511

Skewness

-0.483629097

Range

50

Range

50

Minimum

Minimum

Maximum

56

Maximum

56

Sum

1610

Sum

1631

Count

49

Count

49

Measurement scale.

The data provided is pre-exposure numbers and post-exposure using the ratio measurement.

Measure of central tendency.

The pre-exposure mean is 32.85714; the median is 35 and mode 36. The post-exposure mean is 33.28571429, median 36, and mode 38.

Evaluation.

The histogram demonstrates a normal bell-shaped figure from the data provided. Therefore, assumptions for parametric statistical testing were met.

ANOVA: Descriptive Statistics and Assumption Testing

Frequency distribution table.

Air

Soil

Bin

Frequency

Bin

Frequency

5.75

7.75

8.5

9.5

10

11.25

11.25

More

More

Water

Training

Bin

Frequency

Bin

Frequency

5.25

4.25

7.5

5.5

9.75

6.75

More

More

Histogram.

Descriptive statistics table

Air

Soil

Mean

8.9

Mean

9.1

Standard Error

0.684028316

Standard Error

0.390006748

Median

Median

Mode

11

Mode

Standard Deviation

3.059067625

Standard Deviation

1.744163199

Sample Variance

9.357894737

Sample Variance

3.042105263

Kurtosis

-0.62830092

Kurtosis

0.119230317

Skewness

-0.360849171

Skewness

0.492001831

Range

11

Range

Minimum

Minimum

Maximum

14

Maximum

13

Sum

178

Sum

182

Count

20

Count

20

Water

Training

Mean

Mean

5.4

Standard Error

0.575828922

Standard Error

0.265567912

Median

Median

Mode

Mode

Standard Deviation

2.575185226

Standard Deviation

1.187655807

Sample Variance

6.631578947

Sample Variance

1.410526316

Kurtosis

-0.237524639

Kurtosis

0.253746631

Skewness

0.760206271

Skewness

0.159183094

Range

Range

Minimum

Minimum

Maximum

12

Maximum

Sum

140

Sum

108

Count

20

Count

20

Measurement scale

The information given is using the ratio measurement.

Measure of central tendency

The mean for air is 8.9, the median nine, and mode 11. For air, the mean is 9.1, median 8, and mode 8. Water means 7, the median is 6, and mode is 6. The central training tendency has a mean of 5.4, median five, and mode 5.

Evaluation

The histogram shows a normal bell-shaped figure from the data provided. Therefore, assumptions for parametric statistical testing were observed.

References

George, D., & Mallery, P. (2016). Descriptive statistics. In IBM SPSS Statistics 23 Step by Step (pp. 126-134). Routledge.

Simonsohn, U., Simmons, J. P., & Nelson, L. D. (2019). Specification curve: Descriptive and inferential statistics on all reasonable specifications. Available at SSRN 2694998.