Using t Test and ANOVA With Sun Coast Remediation Data Set Using the Sun Coast Remediation data set, perform an independent samples t Test, dependent samples t Test, and ANOVA, and interpret the res

7/1/2020 Originality Report https://online.columbiasouthern.edu/webapps/mdb-sa-bb_bb60/originalityReport/ultra?attemptId=ad314e9f-8589-4631-86f0-46ffd8155113&course_id= … 1/7 % 99 %1 SafeAssign Originality Report CSU SafeAssign Plagiarism Check Tool •SafeAssign Originality Report Generator I % 100 Total Score : High risk George Landry Submission UUID : aafb1804-f8f5-e0d4-fc66-2b742245f484 Total Number of Reports 1 Highest Match 100 % Inferential Statistics.edited.docx Average Match 100 % Submitted on 07/01/20 07:39 AM EDT Average Word Count 809 Highest : Inferential Statistics.edited.docx % 100 Attachment  1 Institutional database  (8) Student paper Student paper Student paper Student paper Student paper Student paper Student paper Student paper Internet  (1) readtiger Top sources  (3) Excluded sources  (0) View Originality Report - Old Design Word Count : 809 Inferential Statistics.edited.docx 1 2 6 7 4 5 8 3 9 1 Student paper 2 Student paper 6 Student paper Running head: INFERENTIAL STATISTICS 1 INFERENTIAL STATISTICS 2 Inferential Statistics George Landry Columbia Southern University Data Analysis: Hypothesis Testing The paper will complete dependent samples (paired samples), ANOVA and an independent samples t-test on the data set of the Sun Coast Remediation to help in the testing of hypothesis. Independent Samples t-Test The Sun Coast’s employee training program was advanced six months ago. To evaluate the program effectiveness, a group of workers (Group A) participated in the earlier training program, whereas another group (Group B) participated in the new program. Null and Alternative Hypothesis: Ho4: There is no statistically considerable difference in the mean values of the training scores for Group A and Group B Ha4: There is a statistically significant difference in the mean values of the training scores for Group A and Group B Excel Output Results: t-Test: Two-Sample Assuming Unequal Variances Group A Prior Training Scores Group B Revised Training Scores Mean 69.79032258 84.77419355 Variance 122.004495 26.96456901 1 2 1 3 4 2 2 2 1 7/1/2020 Originality Report https://online.columbiasouthern.edu/webapps/mdb-sa-bb_bb60/originalityReport/ultra?attemptId=ad314e9f-8589-4631-86f0-46ffd8155113&course_id= … 2/7 Observations 62 62 Hypothesized Mean Difference 0 Df 87 t Stat -9.666557191 P(T<=t) one-tail 9.69914E-16 t Critical one-tail 1.662557349 P(T<=t) two-tail 1.93983E-15 t Critical two-tail 1.987608282 The results indicate that Group B's mean values (updated training scores) are higher than those of Group A (scores before training). Moreover, the p-value is 1.9398E-15, which is much lower than the alpha value of 0.05. What this indicates is that the difference between the mean value is statistically significant (Bashir, 2018). Therefore, we have to reject the null hypothesis that there doesn't exist a statistically considerable difference in the mean values of the training scores for Group A and Group B, hence the alternative hypothesis is accepted. Dependent Samples (Paired Samples) t-Test Workers operating on the job sites were observed to remediate lead. Data was gathered on the levels of lead in workers before and after exposure. Null and Alternative Hypothesis: Ho5: There is no statistically considerable significant difference in mean values of lead levels in pre-exposure and post-exposure. Ha5: There is a statistically significant difference in mean values of lead levels in pre-exposure and post-exposure. Excel Output Results: t-Test: Paired Two Sample for Means Pre-Exposure μg/dL Post-Exposure μg/dL Mean 32.85714286 33.28571429 Variance 150.4583333 155.5 Observations 49 49 Pearson Correlation 0.992236043 Hypothesized Mean Difference 0 Df 48 t Stat -1.929802563 P(T<=t) one-tail 0.029776357 t Critical one-tail 1.677224196 P(T<=t) two-tail 0.059552714 t Critical two-tail 2.010634758 However, the alpha level (0.05) is lower than the p-value of 0.0059. What this indicates is that there is no statistically considerable difference between the means of the two variables. Therefore, we have to accept the null hypothesis that there is no statistically significant difference in the mean values between the post-exposure and pre-exposure lead levels; hence, the alternative hypothesis is rejected. ANOVA Sun Coast has gathered data air monitoring of the ROI, water reclamation, soil remediation as well as safety and health training. ANOVA will, therefore, be utilized to test the point of difference in ROI between the service lines. Null and Alternative Hypothesis: Ho6: There is no statistically considerable disparity in mean values for ROI between the four service lines Ha6: There is a statistically significant disparity in mean values for ROI between the four service lines Excel Output Results: ANOVA: Single Factor SUMMARY Groups Count Sum Average Variance A = Air 20 178 8.9 9.357894737 B = Soil 20 182 9.1 3.042105263 C = Water 20 140 7 6.631578947 D = Training 20 108 5.4 1.410526316 ANOVA Source of Variation SS df MS F P-value F crit Between Groups 182.8 3 60.93333 11.92310333 1.75888E-06 2.72494392 Within Groups 388.4 76 5.110526 1 5 4 1 2 6 6 2 1 1 2 5 6 2 7 2 1 8 1 2 1 7/1/2020 Originality Report https://online.columbiasouthern.edu/webapps/mdb-sa-bb_bb60/originalityReport/ultra?attemptId=ad314e9f-8589-4631-86f0-46ffd8155113&course_id= … 3/7 Source Matches  (38) Student paper 100 % Student paper 91 % Student paper 100 % Student paper 63 % Student paper 64 % Student paper 100 % Student paper 88 % t Goups 388. 6 5. 056 Total 571.2 79 From the excel output results, the soil’s ROI has the highest mean value, while the ROI for training has the lowest mean value. Moreover, the p-value is 1.75888E- 06; this is significantly lower than the alpha level of 0.05. What this indicates is that there is a statistically considerable difference in mean values (Su et al., 2017). Therefore, we reject the null hypothesis that there is no statistically significant disparity in mean values of ROI between the four lines of service; therefore, we accept the alternative hypothesis.

References Bashir, J. (2018). Hypothesis Testing. Scientific Journal Of India, 3(1), 62-63. https://doi.org/10.21276/24565644/2018.v3.i1.21 Su, Y., Di, C., & Hsu, L. (2017). Hypothesis testing in linear functional models. Biometrics, 73(2), 551-561. https://doi.org/10.1111/biom.12624 5 2 7 9 1 Student paper Columbia Southern University Original source Columbia Southern University 2 Student paper The paper will complete dependent samples (paired samples), ANOVA and an independent samples t-test on the data set of the Sun Coast Remediation to help in the testing of hypothesis. Original source This paper will conduct an independent samples t-test, dependent samples (paired samples) t-test, and ANOVA on the Sun Coast Remediation data set to help in hypothesis testing 1 Student paper Independent Samples t-Test Original source Independent Samples t Test 3 Student paper The Sun Coast’s employee training program was advanced six months ago. Original source Sun Coast health and safety training program was revamped six months ago 4 Student paper To evaluate the program effectiveness, a group of workers (Group A) participated in the earlier training program, whereas another group (Group B) participated in the new program. Original source Group A participated in the prior training program, and Group B participated in the new training program 2 Student paper Null and Alternative Hypothesis: Original source Null and Alternative Hypothesis 2 Student paper There is no statistically considerable difference in the mean values of the training scores for Group A and Group B Original source There is no statistically significant difference in mean values for the training scores between Group A and Group B 7/1/2020 Originality Report https://online.columbiasouthern.edu/webapps/mdb-sa-bb_bb60/originalityReport/ultra?attemptId=ad314e9f-8589-4631-86f0-46ffd8155113&course_id= … 4/7 Student paper 98 % Student paper 100 % Student paper 100 % Student paper 100 % Student paper 84 % Student paper 84 % Student paper 77 % Student paper 100 % 2 Student paper There is a statistically significant difference in the mean values of the training scores for Group A and Group B Excel Output Results: Original source There is a statistically significant difference in mean values for the training scores between Group A and Group B Excel Output Results 1 Student paper Two-Sample Assuming Unequal Variances Group A Prior Training Scores Group B Revised Training Scores Mean 69.79032258 84.77419355 Variance 122.004495 26.96456901 Original source Two-Sample Assuming Unequal Variances Group A Prior Training Scores Group B Revised Training Scores Mean 69.79032258 84.77419355 Variance 122.004495 26.96456901 1 Student paper Observations 62 62 Hypothesized Mean Difference 0 Original source Observations 62 62 Hypothesized Mean Difference 0 1 Student paper t Stat -9.666557191 P(T<=t) one-tail 9.69914E-16 t Critical one-tail 1.662557349 P(T<=t) two-tail 1.93983E-15 Original source t Stat -9.666557191 P(T<=t) one-tail 9.69914E-16 t Critical one-tail 1.662557349 P(T<=t) two-tail 1.93983E-15 1 Student paper t Critical two-tail 1.987608282 The results indicate that Group B's mean values (updated training scores) are higher than those of Group A (scores before training). Original source t Critical two-tail 1.987608282 (Field, 2005) The mean values for Group B (Revised training scores) are higher than those of Group A (Prior training scores) 5 Student paper Moreover, the p-value is 1.9398E-15, which is much lower than the alpha value of 0.05. Original source The p value is 1.9398E-15, which is way lower than the alpha level (0.05) 4 Student paper Therefore, we have to reject the null hypothesis that there doesn't exist a statistically considerable difference in the mean values of the training scores for Group A and Group B, hence the alternative hypothesis is accepted. Original source Therefore, the null hypothesis is rejected and the alternative hypothesis is accepted that there is a statistically significant difference in mean values of training scores between Group A and Group B 1 Student paper Dependent Samples (Paired Samples) t- Test Original source Dependent Samples (Paired-Samples) t Test 7/1/2020 Originality Report https://online.columbiasouthern.edu/webapps/mdb-sa-bb_bb60/originalityReport/ultra?attemptId=ad314e9f-8589-4631-86f0-46ffd8155113&course_id= … 5/7 Student paper 86 % Student paper 87 % Student paper 90 % Student paper 100 % Student paper 100 % Student paper 100 % Student paper 100 % Student paper 100 % Student paper 78 % Student paper 68 % 2 Student paper Data was gathered on the levels of lead in workers before and after exposure.

Null and Alternative Hypothesis: Original source Data was collected on the lead levels in employees before exposure and after exposure Null and Alternative Hypothesis 6 Student paper There is no statistically considerable significant difference in mean values of lead levels in pre-exposure and post- exposure. Original source There is no statistically significant difference in mean values for blood lead levels between pre-exposure and post- exposure 6 Student paper There is a statistically significant difference in mean values of lead levels in pre-exposure and post-exposure. Original source There is a statistically significant difference in mean values for blood lead levels between pre-exposure and post- exposure 2 Student paper Excel Output Results: Original source Excel Output Results 1 Student paper Paired Two Sample for Means Pre- Exposure μg/dL Post-Exposure μg/dL Mean 32.85714286 33.28571429 Variance 150.4583333 155.5 Original source Paired Two Sample for Means Pre- Exposure μg/dL Post-Exposure μg/dL Mean 32.85714286 33.28571429 Variance 150.4583333 155.5 1 Student paper Observations 49 49 Pearson Correlation 0.992236043 Hypothesized Mean Difference 0 Original source Observations 49 49 Pearson Correlation 0.992236043 Hypothesized Mean Difference 0 1 Student paper t Stat -1.929802563 P(T<=t) one-tail 0.029776357 t Critical one-tail 1.677224196 P(T<=t) two-tail 0.059552714 Original source t Stat -1.929802563 P(T<=t) one-tail 0.029776357 t Critical one-tail 1.677224196 P(T<=t) two-tail 0.059552714 1 Student paper t Critical two-tail 2.010634758 Original source t Critical two-tail 2.010634758 2 Student paper However, the alpha level (0.05) is lower than the p-value of 0.0059. Original source The p value above is 0.0059, which is higher than the alpha level at 0.05 5 Student paper What this indicates is that there is no statistically considerable difference between the means of the two variables. Original source This ultimately means that there is not statistically significant difference between the means of the two variables 7/1/2020 Originality Report https://online.columbiasouthern.edu/webapps/mdb-sa-bb_bb60/originalityReport/ultra?attemptId=ad314e9f-8589-4631-86f0-46ffd8155113&course_id= … 6/7 Student paper 88 % Student paper 80 % Student paper 84 % Student paper 100 % Student paper 100 % Student paper 100 % Student paper 100 % Student paper 100 % Student paper 100 % 6 Student paper Therefore, we have to accept the null hypothesis that there is no statistically significant difference in the mean values between the post-exposure and pre- exposure lead levels; Original source Therefore, the null hypothesis is accepted that there is no statistically significant difference in mean values for blood lead levels between pre-exposure and post-exposure 2 Student paper Sun Coast has gathered data air monitoring of the ROI, water reclamation, soil remediation as well as safety and health training. ANOVA will, therefore, be utilized to test the point of difference in ROI between the service lines. Null and Alternative Hypothesis: Original source Sun Coast has collected data on the return of investment of air monitoring, soil remediation, water reclamation, and health and safety training ANOVA will be used to test where there is a difference in return of investment between the four lines of service Null and Alternative Hypothesis 7 Student paper There is no statistically considerable disparity in mean values for ROI between the four service lines Ha6: There is a statistically significant disparity in mean values for ROI between the four service lines Original source There is no statistically significant difference in mean values for ROI between the four lines of service There is a statistically significant difference in mean values for ROI between the four lines of service 2 Student paper Excel Output Results: Original source Excel Output Results 1 Student paper Groups Count Sum Average Variance Original source Groups Count Sum Average Variance 8 Student paper A = Air 20 178 8.9 9.357894737 B = Soil 20 182 9.1 3.042105263 C = Water 20 140 7 6.631578947 D = Training 20 108 5.4 1.410526316 Original source A = Air 20 178 8.9 9.357894737 B = Soil 20 182 9.1 3.042105263 C = Water 20 140 7 6.631578947 D = Training 20 108 5.4 1.410526316 1 Student paper Source of Variation SS df MS F P-value F crit Original source Source of Variation SS df MS F P-value F crit 2 Student paper Between Groups 182.8 3 60.93333 11.92310333 1.75888E-06 2.72494392 Original source Between Groups 182.8 3 60.93333 11.92310333 1.75888E-06 2.72494392 1 Student paper Within Groups 388.4 76 5.110526 Total 571.2 79 Original source Within Groups 388.4 76 5.110526 Total 571.2 79 7/1/2020 Originality Report https://online.columbiasouthern.edu/webapps/mdb-sa-bb_bb60/originalityReport/ultra?attemptId=ad314e9f-8589-4631-86f0-46ffd8155113&course_id= … 7/7 Student paper 90 % Student paper 92 % Student paper 84 % readtiger 64 % 5 Student paper Moreover, the p-value is 1.75888E-06; Original source The p value is 1.75888E-06 2 Student paper this is significantly lower than the alpha level of 0.05. Original source This is lower than the alpha level of 0.05 7 Student paper Therefore, we reject the null hypothesis that there is no statistically significant disparity in mean values of ROI between the four lines of service; Original source We can therefore reject the null hypothesis that there is no statistically significant difference in mean values for ROI between the four lines of service, and accept the alternative hypothesis 9 Student paper therefore, we accept the alternative hypothesis. Original source We might accept the alternative hypothesis (and the research hypothesis)