Provide examples of how the t test and ANOVA could be used to compare means within your work environment or domain of interest. Discuss the appropriateness of using the t test versus ANOVA. Use an exa

MBA 5652, Research Methods 1 Cou rse Learning Outcomes for Unit VI Upon completion of this unit, students should be able to: 6. Differentiate between various research -based tools commonly used in businesses. 6.1 Identify the most appropriate statistical procedure to use from among t test s or ANOVA to test hypotheses. 7. Test data for a business research project. 7.1 Determine whether to accept or reject null and alternative hypotheses by using t tests and ANOVA. Course/Unit Learning Outcomes Learning Activity 6.1 Unit Lesson Video: ANOVA with Excel Video: Performing an Independent Samples t test in Excel Video: How to do a Two Sample t test Paired Two Sample for Means in Excel 2013 Video: t-test in Microsoft Excel Article: “Encyclopedia of Research Design : Causal -Comparative Design” Unit VI Scholarly Activity 7.1 Unit Lesson Unit VI Scholarly Activity Reading Assignment In order to access the following resources, click the links below: Badii, M. (2009 , November 23 ). ANOVA with Excel [Video file ]. Retrieved from https://www.youtube.com/watch?v=leHOBf_ -9kM Click here for a transcript of the video. Blake, K. (2011 , April 9 ). Performing an independent samples t test in Excel [Video file ]. Retrieved from https://www.youtube.com/watch?v=norKDF0MH0M Click here for a transcript of the video. Brewer, E. W., & Kubn, J. (2010) . Casual -comparative design. In N. J. Salkind (Ed.) , Encyclopedia of research design: Causal -comparative design . Retrieved from http ://methods.sagepub.com.libraryresources.columbiasouthern.edu/reference/encyc -of-research - design/n42.xml Glen, S. (2013 , December 21 ). How to do a two sample t test paired two sample for means in Excel 2013 [Video file ]. Retrieved from https://www.youtube.com/watch?v=RHBIQ2reACM&t=8s Click here for a transcript of the video. UNIT VI STUDY GUIDE Data Analysis: t Test and ANOVA MBA 5652, Research Methods 2 UNIT x STUDY GUIDE Title Grange, J. (2011 , April 7 ). t-test in Microsoft Excel [Video file ]. Retrieved from https://www.youtube.com/watch?v=BlS11D2VL_U Click here for a transcr ipt of the video. Unit Lesson Data Analysis of t Test and ANO VA Unit V focused on the use of parametric statistical procedures, correlation analysis and regression analysis, to test hypotheses. Unit VI focuses on two additional parametric statistical procedures used to test hypotheses. Those two additional procedures are the t test and ANOVA. Like correlation analysis and regression analysis, the t test and ANOVA are forms of inferential statistics. Predictions are stated in the form of hypotheses, sample data are collected and tested, and statistically sig nificant results are used to make inferences about the population of interest (Zikmund, Babin, Carr, & Griffin, 2013). As was pointed out several times throu ghout the course, hypothesis testing either looks for statistically significant relationships between variables or statistically significant differences between variables or groups.

While correlation analysis and regression analysis look for relationships between variables, the t test and ANOVA look for differences between variables or groups. Consider these examples of research questions that may be answered using t test s and ANOVA .  Are there differences in air quality between sites A, B, C, and D?  Are th ere differences in employee safety training scores before and after completion of a training course?  Are there differences in box weight for cereal coming off production Line 1, Line 2, Line 3, and Line 4?  Are there differences in product satisfaction betw een male and female consumers? On the surface, it would seem these questions could easily be answered by simply comparing means. If samples show that the mean weights of cereal box es coming off production Line 1, Line 2, Line 3, and Line 4 are 18.2 oz., 1 8.4 oz., 17.8 oz. , and 18 oz. respectively, it would appear the answer is “yes,” there are differences in box weights for product s coming off Line 1, Line 2, Line 3, and Line 4. There is, however, one important point that prevents drawing any hasty conclus ions about mean differences between variables or groups; mean averages must be tested to determine if statistically significant differences exist. Statistical procedures, like the t test and ANOVA, are interested in the mean, but they also analyze how the data points are dispersed around the means. The means from two samples may appear different, but because of the variance of each data set, there may, in fact, be no statistically significant differences in means. The t test is used to compare two means (e .g. , product satisfaction between male and female consumers) while ANOVA is used to compare more than two means (e.g. , air quality between sites A, B, C, and D). Experimental and quasi -experimental research designs often use t test s and ANOVA. These are e xtremely powerful and useful procedures since variables can be manipulated and controlled to make claims of causation. For this reason, these statistical procedures are the primary tools used to determine the efficacy of drugs. This could be seen in testin g for the efficacy of a new cancer drug. For example, a control group will receive a placebo ( independent variable 1 [IV1]) while an experimental group receive s a new drug (IV 2). After the IV 1 and IV 2 are administered, the control group’s tumor size ( dependent variable [ DV ]) will be compared to the experimental group’s tumor size. If the mean size of experimental group’s tumor is less than the mean size of the control group’s tumor and the differences are statistically significant, a claim can be made that the new drug (IV 2) caused a reduction in tumor size (assuming all other variables were held constant). In many business and social science settings, it is impractical, impossible, unethical, or cost -prohibitive to use experimental research designs. A s an alternative, researchers use causal -comparative research designs ( i.e., ex post facto designs) to similarly look for differences between groups by comparing means. Since causal - comparative designs are ex post facto, meaning events and variables are an alyzed after the fact, it is not possible to manipulate and control variables. Since causal -comparative designs cannot control variables, as do experimental designs, causation cannot be claimed (even though the name causal -comparative would MBA 5652, Research Methods 3 UNIT x STUDY GUIDE Title suggest otherwi se). Nevertheless, causal -comparative designs are effective and frequently use the t test and ANOVA in business and social science research because they can make s trong inferences about the effect the IV has on the DV (Brewer & Kubn , 2010). The t Test (i.e., Student’s t Test ) The t test is the simplest form of a test of differences where only two means are compared. There are two types of t test s: independent test ( i.e., between -groups) and dependent test ( i.e ., within group, paired, or repeated -measures test). Independent sample t test (i.e., between groups): Using an independent t test , a DV is measured for two different groups of people or things that were exposed to different IVs. For example, Plant A employees were trained using a new safety program that the company is considering purchasing and rolling out to all of their plants across the United States (IV 1). Plant B employees are trained using the same safety program the employer has used for the past five years (IV 2). Lost -time -hours (DV) are then compared between the two plants to determine if a statistically significant difference exists. The hypotheses would be stated as below. Ho1: There is no statistically significant difference in lost -time -hours (DV) between Plant A (IV 1) and Plant B (IV 2). Ha1: There is a statistically significant difference in lost -time -hours (DV) between Plant A (IV 1) and Plant B (IV 2). Interpreting the independent sample t test output results : If statistically significant differences exist and lost -time -hours are lower in Plant A, management can make an informed decision, with certainty, about whether to invest in the new training program for all plants. The independent sample t test looked for a statistically significant difference in lost -time -hours between Plant A t raining (IV 1) and Plant B training (IV 2). The results below indicate that the mean lost -time -hours are indeed lower for Plant A (Variable 1) ; however, the results also indicate a p value of .37627 > .05. Therefore, the null hypothesis is accepted that ther e is no statistically significant difference in lost -time -hours (DV) between Plant A (IV 1) and Plant B (IV 2). Given these results, the company would not benefit from purchasing the new training program for all plants. Accept Ho1: There is no statistically significant difference in lost -time -hours (DV) between Plant A (IV 1) and Plant B (IV 2). Reject Ha1: There is a statistically significant difference in lost -time -hours (DV) between Plant A (IV 1) and Plant B (IV 2). Dependent samp le t test (i.e., within -group, paired , or repeated measure) : In a dependent t test , a DV is measured for one group of people or thing both before and after exposure to an IV to determine if statistically MBA 5652, Research Methods 4 UNIT x STUDY GUIDE Title significant differences exist. For example, assume a company has several years of data for the dependent variable (DV) of lost -time -hours. A year ago, they implemented a new safety training program (IV) that was purchased from a third party vendor. Management is now interested to know if there has been an improvement in safety. They compare the mean lost -time -hours before implementing the training to the mean lost -time -hours after implementing the training. The hypotheses would be stated as below. Ho1: There is no statistically significant difference in los t-time -hours (DV) before and after safety training (IV). Ha1: There is a statistically significant difference in lost -time -hours (DV) before and after safety training (IV). Interpreting the dependent sample t test : If there is a statistically significant difference in mean lost -time - hours, and the mean has declined after the training, the company can infer that the training (IV) was successful. Additionally, this information would be helpful in cost -justifying the expenditure for continued use of the safe ty program. The dependent t test looked for a statistically significant difference in in lost -time -hours (DV) before and after safety training (IV). The results below indicate that the mean lost -time -hours are indeed lower post -training and, more importan tly, the results also indicate a p value of .0445 < .05. Therefore, the null hypothesis is rejected , and the alternative hypothesis is accepted that there is a statistically significant difference in lost - time -hours (DV) before and after safety training (IV). Given these results, the company would benefit from continuing the use of the new training progr am implemented last year. Reject Ho1: There is no statistically significant difference in lost -time -hours (DV) before and after safety training (IV). Accept Ha1: There is a statistically significant difference in lost -time -hours (DV) before and after safety training (IV). The t test is both effective and easy to use when there are only two levels (or groups) of the independent variables (e.g. , Plant A training and Plant B training). If, however, the business problem requires comparing the means at more than two levels of the independent variable (e.g. , Plant A training, Plant B training, and Plant C training), the t test cannot be used. In this case, ANOVA would be used t o compare the DV means for more than two groups of the IV (Field, 2005). t test (G. McClelland, personal communication, May 29, 2018) MBA 5652, Research Methods 5 UNIT x STUDY GUIDE Title ANOVA (Analysis of Variance, i.e., One -Way ANOVA, Single -Factor ANOVA) Although the calculations are different, the purpose of using ANOVA and the t test is the same. The researcher is interested in determining if there are statistically significant differences in the mean scores for the dependent variable for different groups of people or things. The difference between the t test and ANOVA is that ANOVA m ust be used when there are more than two groups or more than two levels of the independent variable. Interpreting ANOVA: Interpreting ANOVA output is very similar to how other parametric test results have been interpreted thus far in the course. Like corr elation, regression, and the t test , the main interest interpreting ANOVA output is whether the p value is less than the .05 alpha level. This determines whether to accept or reject the null hypothesis. Consider the following example. A s ales organization has used four different lead generation programs (IV). In the table below, columns A, B, C, and D represent the different groups of the lead generation programs , and the data represent the number of leads produced each month (DV) for a te n-month period. The ANOVA output indicates that there is a statistically significant difference in leads (DV) between lead generation programs (IV) at a p value of .00091< .05. The null hypothesis would, therefore, be rejected , and the alternative hypothesis accepted. At this stage, when using correlation, regression, and the t test , the results provide sufficient information to answer the research questions and inform decision -making regarding the business problem . This is no t true for ANOVA. Although the ANOVA results provide enough information to accept or reject the null, they do not provide complete information for decision -making. Significant ANOVA results only inform the researcher that differences exist between means. W hile the above ANOVA results indicate that differences exist between means, the results do not indicate where the differences exist (Field, 2005) . Therefore, it is unknown whether differences exist between mean leads produced for all of the methods or only some of the methods. Fortunately, there is a post -hoc test, Tukey's HSD that can be conducted to determine exactly where the significant differences exist between means. Once these differences are known, the sales organization can eliminate the least effe ctive lead generation method(s). Tukey’s will not be covered in this course, but it is important to be aware of this second stage in using ANOVA to be able to make a completely informed decision. Old Navy Scenario Using t Test and ANOVA To elucidate the differences between t test and ANOVA, consider the following scenario . Assume Old Navy is interested in running a multimillion -dollar promotional campaign. Their business problem is how to best allocate the promotional dollars. Since their target market i ncludes both males and females in a ANOVA example One-way ANOVA Data Set: Statistical Results: Program 1 Program 2 Program 3 Program 4 Anova: Single Factor 98 83 79 69 68 110 78 68 SUMMARY 56 98 84 76 Groups Count Sum Average Variance 72 88 57 94 Program 1 10 750 75 176.8889 76 103 85 63 Program 2 10 951 95.1 77.87778 70 99 97 65 Program 3 10 830 83 123.1111 85 89 96 102 Program 4 10 737 73.7 187.1222 90 84 87 66 58 96 82 73 77 101 85 61 ANOVA Source of Variation SS df MS F P-value F crit Between Groups 2901.4 3 967.1333 6.846962 0.00091 2.866266 Within Groups 5085 36 141.25 Total 7986.4 39 MBA 5652, Research Methods 6 UNIT x STUDY GUIDE Title range of ages, they want to determine if statistically significant differences exist in the average expenditure made in their stores each year. If statistically significant differences exist, they can allocate their prom otional dollars to more heavily target the market that spends significantly more. Thanks to their loyalty program, they have in -house data that have been collected at the point -of-purchase and at the time of purchase. The loyalty program database includes demographic data, such as gender, age, and expenditures. The independent variable (IV) in this scenario is the target market. The target market can be segmented into groups by gender and age range. Old Navy’s researcher is also interested in mean scores of annual expenditures, which is the dependent variable (DV). The t test is limited to using two groups, or two levels of the independent variable. In this scenario, assume that the independent variable that will be tested is gender, and the two levels ar e males (IV 1) and females (IV 2). Since Old Navy’s database contains data for both gender and expenditure, the DV is tested for statistically significant differences in mean expenditure between males and females. The research question and hypotheses for thi s t test would be as follows. RQ1 : Are there differences in expenditure by gender? Ho1: There is no statistically significant difference in expenditure (DV) between males (IV 1) and females (IV 2). Ha1: There is a statistically significant difference in e xpenditures (DV) between males (IV 1) and females (IV 2). If the t test results indicate that there is a statistically significant difference in mean expenditure by gender, the null hypothesis can be rejected, and Old Navy can make an informed decision about how best to allocate promotional dollars. The researcher could similarly use a t test to analyze the IV of age to determine if there are statistically significant differences in the DV of expenditure. However, the t test is limited to only two group s, or two levels of the independent variable. The researcher would have to select only two levels of the IV, such as below 34 and above 35, or 25 –34 and 35 –44, or any other combination of two levels of age ranges. This restriction limits value that can be gleaned from the data set. A solution that may seem intuitive would be to conduct multiple t test s, but there are problems with this approach. If only three levels of the independent variable were tested (e.g. , 25 –34, 35 –44, 45 –54) , it would require three separate t test s to compare group means (e.g. , t test 1: 25 –34 and 35 –44 ; t test 2: 25 –34 and 45 –54 ; and t test 3: 35 –44 and 45 –54). If the level of independent variable was increased from 3 to 5, the number of t test s that would need to be conducted would increase to 10. It should be evident that this is a very cumbersome and impractical approach to testing for mean differences when the number of groups of the independent variable exceeds two. Another more important problem with this approach is that the c hance of committing a Type I error (rejecting the null when it should not be rejected) is magnified dramatically as the number of t test s incr ease (Field, 2005). The lesson here is that multiple t test s should not be used. Fortunately, ANOVA avoids the pro blem of both complexity and magnification of the chance of Type I errors. ANOVA does not limit the number of groups of the independent variable that can be tested. In the same scenario, each of the groups of the IV variable can be analyzed at once to dete rmine if there are statistically significant differences in mean expenditures (DV). The different groups of the IV would be described as follows: IV1: Under 12 years old IV2: 12 –17 years old IV3: 18 –24 years old IV4: 25 –34 years old IV5: 35 –44 years old IV6: 45 –54 years old IV7: 55 –64 years old IV8: 65 –74 years old IV9: 75 years or older MBA 5652, Research Methods 7 UNIT x STUDY GUIDE Title The research question and hypotheses for the ANOVA analysis would be: RQ2 : Are there differences in expenditure by age range? Ho2: There is no statistically significant difference in expenditures (DV) among IV1, IV 2, IV 3, IV 4, IV 5, IV6, IV 7, IV8, or IV 9. Ha2: There is a statistically significant difference in expenditures (DV) among IV1, IV 2, IV 3, IV 4, IV 5, IV6, IV 7, IV8, or IV 9. If the ANOVA results indicate that there are statistically significant differences in mean expenditure by age group, the null hypothesis is rejected. As mentioned above, the Tukey’s post -hoc test would be required to determine exactly where those signific ant differences exist. W hen Old Navy has these results, they would be empowered to make an informed decision about how best to allocate promotion dollars by age group. Food for Thought This unit discusse s the use of t test and ANOVA. Each example used on ly one independent variable and one dependent variable although different groups of the independent variable were used (e.g. , male and female for the independent variable of gender, and Methods 1, 2, 3, and 4 for the independent variable of lead generation method). Similarly, the examples only used one dependent variable (e.g. , lost -time -hours, leads produced, and expenditures). Business problems are not always this straight -forward. There are times when the researcher will need to analyze multiple independ ent variables and multiple groups of those independent variables to determine if mean differences exist for the dependent variable. For those instances, a different parametric procedure called factorial ANOVA would be used. Still, there are times when the research will need to analyze multiple independent and multiple dependent variables to determine if mean differences exist. For those instances, a parametric procedure called MANOVA (multivariate analysis of variance) would be used. These parametric tests are beyond the scope of this course, but it is important to know that there is a parametric or non -parametric test to answer any research question imaginable. References Brewer, E. W., & Kubn, J. (2010). Casual -comparative design. In N. J. Salkind (Ed.) , Encyclopedia of research design: Causal -comparative design . Retrieved from http://methods.sagepub.com.libraryresources.columbiasouthern.edu/reference/encyc -of-research - design/n42.xml Field, A. (2005). Discovering stats using SPSS (2nd ed.). London, England: Sage. Zikmund, W. G., Babin, B. J., Carr, J. C., & Griffin, M. (2013). Business research methods (9th ed.). Mason, OH: Cengage. Suggested Reading Review the following videos to learn about t test s. StatisticalLab. (2011 , December 6 ). t test paired two sample for means by using Excel [Video file ]. Retrieved form https://www.youtube.com/watch?v=KQGlxG6cmcA Click here for a transcript of the video. MBA 5652, Research Methods 8 UNIT x STUDY GUIDE Title Below is a practical example of using ANOVA. Annenberg Learner (Producer) . (2013). One -way ANOVA: Against all odds — inside statistics [Video fi le]. Retrieved from https://libraryresources.columbiasouthern.edu/login?auth=CAS&url=https://fod.infobase. com/PortalPl aylists.aspx?wID=273866&xtid=111550 The transcript for the video can be fo und by clicking the Transcript tab next to the video in the Films on Demand database.