Data analysis project. Using Citrix workspace In this project assignment, you will need to conduct a hypothesis test about two population means. You can use the General Social Survey (gss.sav) datase

Data Analysis Project


In this project assignment, you will need to conduct a hypothesis test about two population means. You can use the General Social Survey (gss.sav) dataset posted on the Blackboard or your own dataset.

You will need to engage in the following tasks: (1) describe your data sources, (2) develop one or multiple hypotheses and (3) conduct an appropriate t-test to test your hypotheses. Write up to eight pages to describe your data sample, descriptive statistics and t-test results (double space, 12 Times New Roman, and 1 inch margins, including tables and figures).

Describe your data sources.

Brief descriptions of data: data source (first hand or second hand; where did you get it, if second hand), year of collection, unit of analysis, sample size, etc.

T-test

Since the t-test for two population means compares the means for two populations, your dataset should contain a binary (dummy/indicator) variable that divides your sample into two groups (e.g. female vs. male, developing countries versus developed countries). If the variable is not a binary variable, you may create one based on other variables. For example, using the variable “age”, you can generate a binary variable called “senior” by coding individuals with age older than 64 as senior people, if your research question is to examine whether there exists a difference between senior’s income and non-senior’s income.

Your dataset should also contain another interval or ratio variable, for which you want to compare the means. A nominal variable, unless it is dummy (binary) variable, generally would not be used as a dependent variable.

In your report, you need to state the research question you try to answer, your null hypothesis and alternative hypothesis. You should also briefly explain why the question is of your interest.

Next, you should discuss whether you will use a paired-sample t-test or a t-test for two independent means. And explain why this particular test method, instead of the alternative, is chosen.

Then you should present your test results in a table and interpret them in the text, followed by a conclusion on whether the analysis supports or rejects your null hypothesis. Please also explain whether the conclusion is line with your expectation, what the theory predicts or what the literature has found. If it is not, please provide some plausible explanations.