Instructions Final Project Now that you have completed the first six assignments, it is time to complete your research project for the course. Include the following sections in your submission. Title

Running head: DATA ANALYSIS 0

Descriptive Statistics Analysis

Name

Columbia Southern University

Data Analysis: Descriptive Statistics and Assumption Testing

Sun Coast data was investigated to choose whether normal distribution of data was shown, which would be compulsory for postulation for parametric statistical tests. All the investigation questions were known to have ratio data that is classifiable, and has a considerable distance between the data values. The questions have a true zero features. Mean (average), mode, and median are very useful in the study.

Correlation: Descriptive Statistics and Assumption Testing

Frequency distribution table.

Frequency		Range



13
18
24
18
12
		10
		11
		12

Histogram

Descriptive statistics table.

Sick Days

Mean	7.126214
Standard Error	0.186484
Median
Mode
Standard Deviation	1.892605
Sample Variance	3.581953
Kurtosis	0.124923
Skewness	0.14225
Range	10
Minimum
Maximum	12
Sum	734
Count	103

Measurement scale.

The four scales used to estimate the information are ratio, nominal, interval and ordinal scales. However, both the ordinal and nominal values do not have substantial distance between data values or true zero. Interval data scales have a considerable distance amid data values; however, they have no true zero. Ratios data can be grouped, well-arranged, and have considerable distance between data values, and have true zero (Simonsohn, Simmons & Nelson, 2019). Therefore, the measurement scale used in this scenario is ratio due to the data being utilized is the sick days.

The measure of central tendency.

Central tendency illustrates the degree to which data points are disseminated around the mid-point of the curve. The mid-point is estimated by average, median, and mode. The mean is 7.12, the median is seven, and the mode is seven.

Evaluation.

The parametric test demands that the assumption of normality be met. Normality is when data is distributed normally and when graphed looks like a bell shape. There are other different typical assumptions that should be met, depending on the statistical procedure used, include sample size, levels of measurement, consistency of variance, independence, absence of outliers, linearity, etc. In this scenario, there is, in fact, a bell-shaped graph, and the data were distributed normally.

Simple Regression: Descriptive Statistics and Assumption Testing

Frequency distribution table.

Lost Time Hours	Frequency
10
35
60
85
110	17
135	18
160	24
185	27
210	37
235	24
260	21
285	15
310	12
335
More

Histogram.

Descriptive statistics table.

Lost Hours

Mean	188.0044843
Standard Error	4.803089447
Median	190
Mode	190
Standard Deviation	71.72542099
Sample Variance	5144.536016
Kurtosis	-0.501223533
Skewness	-0.081984874
Range	350
Minimum	10
Maximum	360
Sum	41925
Count	223

Measurement scale.

According to the information, there are safety training expenditures and lost time hours. The measurement scale used in the scenario is the ratio.

A measure of central tendency.

The mid-point is measured by mean, median, and mode. The mean is 188.0044843, the median is 190, and the mode is 190.

Evaluation.

In this scenario, the lost hours range from 10 to 335, with 10 and 335 having fewer numbers of lost hours. Lost time hours surge progressively, displaying a bell-shaped graph, and the data was distributed normally. Therefore, the assumptions for parametric statistical testing were met.

Multiple Regression: Descriptive Statistics and Assumption Testing

Frequency distribution table.

Bin	Frequency
103.38
104.3697
105.3593
106.349
107.3386
108.3283
109.3179
110.3076	12
111.2973	18
112.2869	17
113.2766	26
114.2662	22
115.2559	27
116.2456	47
117.2352	36
118.2249	44
119.2145	47
120.2042	53
121.1938	61
122.1835	60
123.1732	62
124.1628	74
125.1525	70
126.1421	81
127.1318	92
128.1214	73
129.1111	105
130.1008	80
131.0904	88
132.0801	67
133.0697	50
134.0594	56
135.0491	35
136.0387	30
137.0284	19
138.018
139.0077
139.9973
More

Histogram.

Descriptive statistics table.

Decibels

Mean	124.8359428
Standard Error	0.177944692
Median	125.721
Mode	127.315
Standard Deviation	6.898656622
Sample Variance	47.59146318
Kurtosis	-0.3141873
Skewness	-0.418952188
Range	37.607
Minimum	103.38
Maximum	140.987
Sum	187628.422
Count	1503

Measurement scale

In every contract, employees are subject to noises during the job. These noises eventually lead to injuries. The louder the noises, the more they are susceptible to injuries. The information provided utilizes the ratio scale.

The measure of central tendency

The mean decibels are 124.8359428, the median 125.721 and the mode 127.315

Evaluation

The decibels employees are exaggerated by range from 103 to 140. This histogram is bell-shaped with the most decibels at 129.1111; thus, assumptions for parametric statistical testing observed.

Independent Samples t-Test:

Prior Training
Bin	Frequency
50
55.85714
61.71429
67.57143
73.42857	14
79.28571	10
85.14286
More

Revised Training
Bin	Frequency
75
78.14286
81.28571	10
84.42857	12
87.57143	14
90.71429	10
93.85714
More

Histogram.

Descriptive statistics table.

Prior Training		Revised Training

Mean	69.79032258	Mean	84.77419355
Standard Error	1.402788093	Standard Error	0.659478888
Median	70	Median	85
Mode	80	Mode	85
Standard Deviation	11.04556449	Standard Deviation	5.192741955
Sample Variance	122.004495	Sample Variance	26.96456901
Kurtosis	-0.77667598	Kurtosis	-0.352537913
Skewness	-0.086798138	Skewness	0.144084526
Range	41	Range	22
Minimum	50	Minimum	75
Maximum	91	Maximum	97
Sum	4327	Sum	5256
Count	62	Count	62

Measurement scale.

Test scores from prior training and revised training were reorded. The ratio measurement is the scale used in this scenario.

Measure of central tendency.

The preceding training scores group had a mean of 69.79032258, a median of 70, and a mode of 80. In the revised training with group b, the mean was 84.77419355, a median of 85 and mode 85.

Evaluation.

The histogram shows a normal bell-shaped figure from the data provided. Therefore, assumptions for parametric statistical testing were met.

Dependent Samples (Paired-Samples) t-Test: Descriptive Statistics and Assumption Testing

Frequency distribution table.

Pre-Exposure		Post Exposure
Bin	Frequency	Bin	Frequency

16		16
24		24
32		32
40	13	40	11
48	12	48	14
More		More

Histogram.

Descriptive statistics table.

Pre-Exposure		Post Exposure

Mean	32.85714	Mean	33.28571429
Standard Error	1.752307	Standard Error	1.781423416
Median	35	Median	36
Mode	36	Mode	38
Standard Deviation	12.26615	Standard Deviation	12.46996391
Sample Variance	150.4583	Sample Variance	155.5
Kurtosis	-0.57604	Kurtosis	-0.654212507
Skewness	-0.42511	Skewness	-0.483629097
Range	50	Range	50
Minimum		Minimum
Maximum	56	Maximum	56
Sum	1610	Sum	1631
Count	49	Count	49

Measurement scale.

The data provided is pre-exposure numbers and post-exposure using the ratio measurement.

Measure of central tendency.

The pre-exposure mean is 32.85714; the median is 35 and mode 36. The post-exposure mean is 33.28571429, median 36, and mode 38.

Evaluation.

The histogram demonstrates a normal bell-shaped figure from the data provided. Therefore, assumptions for parametric statistical testing were met.

ANOVA: Descriptive Statistics and Assumption Testing

Frequency distribution table.

Air		Soil
Bin	Frequency	Bin	Frequency

5.75		7.75
8.5		9.5	10
11.25		11.25
More		More

Water		Training
Bin	Frequency	Bin	Frequency

5.25		4.25
7.5		5.5
9.75		6.75
More		More

Histogram.

Descriptive statistics table

Air		Soil

Mean	8.9	Mean	9.1
Standard Error	0.684028316	Standard Error	0.390006748
Median		Median
Mode	11	Mode
Standard Deviation	3.059067625	Standard Deviation	1.744163199
Sample Variance	9.357894737	Sample Variance	3.042105263
Kurtosis	-0.62830092	Kurtosis	0.119230317
Skewness	-0.360849171	Skewness	0.492001831
Range	11	Range
Minimum		Minimum
Maximum	14	Maximum	13
Sum	178	Sum	182
Count	20	Count	20
Water		Training

Mean		Mean	5.4
Standard Error	0.575828922	Standard Error	0.265567912
Median		Median
Mode		Mode
Standard Deviation	2.575185226	Standard Deviation	1.187655807
Sample Variance	6.631578947	Sample Variance	1.410526316
Kurtosis	-0.237524639	Kurtosis	0.253746631
Skewness	0.760206271	Skewness	0.159183094
Range		Range
Minimum		Minimum
Maximum	12	Maximum
Sum	140	Sum	108
Count	20	Count	20

Measurement scale

The information given is using the ratio measurement.

Measure of central tendency

The mean for air is 8.9, the median nine, and mode 11. For air, the mean is 9.1, median 8, and mode 8. Water means 7, the median is 6, and mode is 6. The central training tendency has a mean of 5.4, median five, and mode 5.

Evaluation

The histogram shows a normal bell-shaped figure from the data provided. Therefore, assumptions for parametric statistical testing were observed.

References

George, D., & Mallery, P. (2016). Descriptive statistics. In IBM SPSS Statistics 23 Step by Step (pp. 126-134). Routledge.

Simonsohn, U., Simmons, J. P., & Nelson, L. D. (2019). Specification curve: Descriptive and inferential statistics on all reasonable specifications. Available at SSRN 2694998.