Instructions Final Project Now that you have completed the first six assignments, it is time to complete your research project for the course. Include the following sections in your submission. Title
Running head: DATA ANALYSIS 0
Descriptive Statistics Analysis
Name
Columbia Southern University
Data Analysis: Descriptive Statistics and Assumption Testing
Sun Coast data was investigated to choose whether normal distribution of data was shown, which would be compulsory for postulation for parametric statistical tests. All the investigation questions were known to have ratio data that is classifiable, and has a considerable distance between the data values. The questions have a true zero features. Mean (average), mode, and median are very useful in the study.
Correlation: Descriptive Statistics and Assumption Testing
Frequency distribution table.
Frequency | Range | |
13 | ||
18 | ||
24 | ||
18 | ||
12 | ||
10 | ||
11 | ||
12 |
Histogram
Descriptive statistics table.
Sick Days | |
Mean | 7.126214 |
Standard Error | 0.186484 |
Median | |
Mode | |
Standard Deviation | 1.892605 |
Sample Variance | 3.581953 |
Kurtosis | 0.124923 |
Skewness | 0.14225 |
Range | 10 |
Minimum | |
Maximum | 12 |
Sum | 734 |
Count | 103 |
Measurement scale.
The four scales used to estimate the information are ratio, nominal, interval and ordinal scales. However, both the ordinal and nominal values do not have substantial distance between data values or true zero. Interval data scales have a considerable distance amid data values; however, they have no true zero. Ratios data can be grouped, well-arranged, and have considerable distance between data values, and have true zero (Simonsohn, Simmons & Nelson, 2019). Therefore, the measurement scale used in this scenario is ratio due to the data being utilized is the sick days.
The measure of central tendency.
Central tendency illustrates the degree to which data points are disseminated around the mid-point of the curve. The mid-point is estimated by average, median, and mode. The mean is 7.12, the median is seven, and the mode is seven.
Evaluation.
The parametric test demands that the assumption of normality be met. Normality is when data is distributed normally and when graphed looks like a bell shape. There are other different typical assumptions that should be met, depending on the statistical procedure used, include sample size, levels of measurement, consistency of variance, independence, absence of outliers, linearity, etc. In this scenario, there is, in fact, a bell-shaped graph, and the data were distributed normally.
Simple Regression: Descriptive Statistics and Assumption Testing
Frequency distribution table.
Lost Time Hours | Frequency |
10 | |
35 | |
60 | |
85 | |
110 | 17 |
135 | 18 |
160 | 24 |
185 | 27 |
210 | 37 |
235 | 24 |
260 | 21 |
285 | 15 |
310 | 12 |
335 | |
More |
Histogram.
Descriptive statistics table.
Lost Hours | |
Mean | 188.0044843 |
Standard Error | 4.803089447 |
Median | 190 |
Mode | 190 |
Standard Deviation | 71.72542099 |
Sample Variance | 5144.536016 |
Kurtosis | -0.501223533 |
Skewness | -0.081984874 |
Range | 350 |
Minimum | 10 |
Maximum | 360 |
Sum | 41925 |
Count | 223 |
Measurement scale.
According to the information, there are safety training expenditures and lost time hours. The measurement scale used in the scenario is the ratio.
A measure of central tendency.
The mid-point is measured by mean, median, and mode. The mean is 188.0044843, the median is 190, and the mode is 190.
Evaluation.
In this scenario, the lost hours range from 10 to 335, with 10 and 335 having fewer numbers of lost hours. Lost time hours surge progressively, displaying a bell-shaped graph, and the data was distributed normally. Therefore, the assumptions for parametric statistical testing were met.
Multiple Regression: Descriptive Statistics and Assumption Testing
Frequency distribution table.
Bin | Frequency |
103.38 | |
104.3697 | |
105.3593 | |
106.349 | |
107.3386 | |
108.3283 | |
109.3179 | |
110.3076 | 12 |
111.2973 | 18 |
112.2869 | 17 |
113.2766 | 26 |
114.2662 | 22 |
115.2559 | 27 |
116.2456 | 47 |
117.2352 | 36 |
118.2249 | 44 |
119.2145 | 47 |
120.2042 | 53 |
121.1938 | 61 |
122.1835 | 60 |
123.1732 | 62 |
124.1628 | 74 |
125.1525 | 70 |
126.1421 | 81 |
127.1318 | 92 |
128.1214 | 73 |
129.1111 | 105 |
130.1008 | 80 |
131.0904 | 88 |
132.0801 | 67 |
133.0697 | 50 |
134.0594 | 56 |
135.0491 | 35 |
136.0387 | 30 |
137.0284 | 19 |
138.018 | |
139.0077 | |
139.9973 | |
More |
Histogram.
Descriptive statistics table.
Decibels | |
Mean | 124.8359428 |
Standard Error | 0.177944692 |
Median | 125.721 |
Mode | 127.315 |
Standard Deviation | 6.898656622 |
Sample Variance | 47.59146318 |
Kurtosis | -0.3141873 |
Skewness | -0.418952188 |
Range | 37.607 |
Minimum | 103.38 |
Maximum | 140.987 |
Sum | 187628.422 |
Count | 1503 |
Measurement scale
In every contract, employees are subject to noises during the job. These noises eventually lead to injuries. The louder the noises, the more they are susceptible to injuries. The information provided utilizes the ratio scale.
The measure of central tendency
The mean decibels are 124.8359428, the median 125.721 and the mode 127.315
Evaluation
The decibels employees are exaggerated by range from 103 to 140. This histogram is bell-shaped with the most decibels at 129.1111; thus, assumptions for parametric statistical testing observed.
Independent Samples t-Test:
Prior Training | |
Bin | Frequency |
50 | |
55.85714 | |
61.71429 | |
67.57143 | |
73.42857 | 14 |
79.28571 | 10 |
85.14286 | |
More | |
Revised Training | |
Bin | Frequency |
75 | |
78.14286 | |
81.28571 | 10 |
84.42857 | 12 |
87.57143 | 14 |
90.71429 | 10 |
93.85714 | |
More |
Histogram.
Descriptive statistics table.
Prior Training | Revised Training | |||
Mean | 69.79032258 | Mean | 84.77419355 | |
Standard Error | 1.402788093 | Standard Error | 0.659478888 | |
Median | 70 | Median | 85 | |
Mode | 80 | Mode | 85 | |
Standard Deviation | 11.04556449 | Standard Deviation | 5.192741955 | |
Sample Variance | 122.004495 | Sample Variance | 26.96456901 | |
Kurtosis | -0.77667598 | Kurtosis | -0.352537913 | |
Skewness | -0.086798138 | Skewness | 0.144084526 | |
Range | 41 | Range | 22 | |
Minimum | 50 | Minimum | 75 | |
Maximum | 91 | Maximum | 97 | |
Sum | 4327 | Sum | 5256 | |
Count | 62 | Count | 62 |
Measurement scale.
Test scores from prior training and revised training were reorded. The ratio measurement is the scale used in this scenario.
Measure of central tendency.
The preceding training scores group had a mean of 69.79032258, a median of 70, and a mode of 80. In the revised training with group b, the mean was 84.77419355, a median of 85 and mode 85.
Evaluation.
The histogram shows a normal bell-shaped figure from the data provided. Therefore, assumptions for parametric statistical testing were met.
Dependent Samples (Paired-Samples) t-Test: Descriptive Statistics and Assumption Testing
Frequency distribution table.
Pre-Exposure | Post Exposure | |||
Bin | Frequency | Bin | Frequency | |
16 | 16 | |||
24 | 24 | |||
32 | 32 | |||
40 | 13 | 40 | 11 | |
48 | 12 | 48 | 14 | |
More | More |
Histogram.
Descriptive statistics table.
Pre-Exposure | Post Exposure | ||
Mean | 32.85714 | Mean | 33.28571429 |
Standard Error | 1.752307 | Standard Error | 1.781423416 |
Median | 35 | Median | 36 |
Mode | 36 | Mode | 38 |
Standard Deviation | 12.26615 | Standard Deviation | 12.46996391 |
Sample Variance | 150.4583 | Sample Variance | 155.5 |
Kurtosis | -0.57604 | Kurtosis | -0.654212507 |
Skewness | -0.42511 | Skewness | -0.483629097 |
Range | 50 | Range | 50 |
Minimum | Minimum | ||
Maximum | 56 | Maximum | 56 |
Sum | 1610 | Sum | 1631 |
Count | 49 | Count | 49 |
Measurement scale.
The data provided is pre-exposure numbers and post-exposure using the ratio measurement.
Measure of central tendency.
The pre-exposure mean is 32.85714; the median is 35 and mode 36. The post-exposure mean is 33.28571429, median 36, and mode 38.
Evaluation.
The histogram demonstrates a normal bell-shaped figure from the data provided. Therefore, assumptions for parametric statistical testing were met.
ANOVA: Descriptive Statistics and Assumption Testing
Frequency distribution table.
Air | Soil | |||
Bin | Frequency | Bin | Frequency | |
5.75 | 7.75 | |||
8.5 | 9.5 | 10 | ||
11.25 | 11.25 | |||
More | More | |||
Water | Training | |||
Bin | Frequency | Bin | Frequency | |
5.25 | 4.25 | |||
7.5 | 5.5 | |||
9.75 | 6.75 | |||
More | More |
Histogram.
Descriptive statistics table
Air | Soil | ||
Mean | 8.9 | Mean | 9.1 |
Standard Error | 0.684028316 | Standard Error | 0.390006748 |
Median | Median | ||
Mode | 11 | Mode | |
Standard Deviation | 3.059067625 | Standard Deviation | 1.744163199 |
Sample Variance | 9.357894737 | Sample Variance | 3.042105263 |
Kurtosis | -0.62830092 | Kurtosis | 0.119230317 |
Skewness | -0.360849171 | Skewness | 0.492001831 |
Range | 11 | Range | |
Minimum | Minimum | ||
Maximum | 14 | Maximum | 13 |
Sum | 178 | Sum | 182 |
Count | 20 | Count | 20 |
Water | Training | ||
Mean | Mean | 5.4 | |
Standard Error | 0.575828922 | Standard Error | 0.265567912 |
Median | Median | ||
Mode | Mode | ||
Standard Deviation | 2.575185226 | Standard Deviation | 1.187655807 |
Sample Variance | 6.631578947 | Sample Variance | 1.410526316 |
Kurtosis | -0.237524639 | Kurtosis | 0.253746631 |
Skewness | 0.760206271 | Skewness | 0.159183094 |
Range | Range | ||
Minimum | Minimum | ||
Maximum | 12 | Maximum | |
Sum | 140 | Sum | 108 |
Count | 20 | Count | 20 |
Measurement scale
The information given is using the ratio measurement.
Measure of central tendency
The mean for air is 8.9, the median nine, and mode 11. For air, the mean is 9.1, median 8, and mode 8. Water means 7, the median is 6, and mode is 6. The central training tendency has a mean of 5.4, median five, and mode 5.
Evaluation
The histogram shows a normal bell-shaped figure from the data provided. Therefore, assumptions for parametric statistical testing were observed.
References
George, D., & Mallery, P. (2016). Descriptive statistics. In IBM SPSS Statistics 23 Step by Step (pp. 126-134). Routledge.
Simonsohn, U., Simmons, J. P., & Nelson, L. D. (2019). Specification curve: Descriptive and inferential statistics on all reasonable specifications. Available at SSRN 2694998.