Regression Assignment for Statistics for Business
Statistics for Business II Regression Assignment
Due: Tuesday, April 1 8, 201 7 by 6 :00pm
No late assignments will be accepted. NO EXCEPTIONS!
READ ALL INSTRUCTIONS!
You must turn in your regression assignment by 6:00pm on April 18, 2017 to the CourseDen
assignment folder (formerly called “dropbox ”). Requests to turn in the regression assignment
after this time will be denied . NO EXCEPTIONS ! DO NOT EMAIL ANY FILES TO ME. Only
assign ments submitted to the CourseDen assignment s folder (formerly called “dropbox ”)
will be graded. It is your responsibility to ensure you have submitted the file you wish to
have graded. If you upload an incorrect file, you will NOT be allowed to resubmit the file
after the deadline has pa ssed, and your grade for the assignment will be based on the
assignment submitted to the CourseDen folder.
Final assignment submissions should be clearly labeled and neatly organized. Answers should be labeled using
the question number and letter (f or example, 1a) and should be in the same order as the questions. Each Excel
worksheet must be clearly labeled. Poin ts will be deducted for assignments that are not neatly put together or
for answers not clearly labeled .
You are to work on this assignment individually. Students who turn in the same /similar
assignment s will each receive a grade of zero. These students will also be reported to the
Student Conduct Office for violation of the Academic Honor Code.
Please Note:
Emailed ass ignments will NOT be accepted. If you have trouble uploading your completed
ass ign ment to the CourseDen assignment folder (formerly called “dropbox ”) or have questions
regarding the file y ou have submitted , you must notify me in writing no later than 2:00 pm on
the scheduled due date, failure to do so will result in a grade of ze ro if the assignment is not
turned in to the dropbox (assignment) folder by the deadline.
You should ensure that your Excel file is saved properly and can be reopened for grading. If
you are unsure that your file has been saved properly, you should notify me no later than
2:00 pm on the scheduled due date. Once the deadline has passed, i f I cannot open your
submitted Exce l file for grading, a grade of zero will be given and no resubmission will be
allowed .
IT IS YOUR RESPONSIBILITY TO SUBMIT THE FILE YOU WISH TO HAVE
GRADED BEFORE THE SUBMISSION DEADLINE. ONLY FILES SUB MITTED TO
THE COURSE DEN ASSIGNMENTS FOLDER (formerly called “dropbox ”) WILL BE
GRADED. NO RESUBMISSIONS WILL BE ALLOWED FOR ANY REASON. NO
EXCEPTIONS!
Read and follow all instructions before beginnin g your assignment.
Failure to follow instructions will result in a loss of points.
Question 1 : In this assignment you are provided with data from a research paper by Ella Hartenian and
Nicholas Horton, who examined whether or not there was a relationship between rail trails a nd property
values in Northamp ton, MA. The da ta are located in the file `house prices.xlsx' which is in you r e -mail on D2L
(course den email) . Please see the accompanying word document, `documentation.docx,' which describes
what each of the columns in the data set mean and how they are scaled. For th is assignment you will learn
how to estimate a common regression model used in economics, finance , and real estate called a “ hedonic
price model."
1. Create a copy of the data set by right -clicking the tab at the botto m and clicking `Move or Copy...’, then
create another copy of the data set in the same workbook. Label this copy “Modified Data" and the original
tab “ Original Data." N ow click on the tab for “Modified Data." Delete all variables except `price2014,'
`bedrooms,' `garage space', `nofullbath', ` norooms', `squarefeet,' and `walkscore.' You should have seven
total variables.
(a) M ake sure `price 2014' is the far l eft variable, then estimate the model where `price2014' is a function
of the other six variables .
(b) For each variable, explain what the sign of the regression coefficient was, what the magnitude was
(i.e., if x goes up by one unit how much does y increase by), and whether or not the results of your
regr ession were consistent with what you believed the sign and magnitude to be. Then explain if any
variables were insignificant.
(c) Write down the equation of your regression line, including all variables. Next, calculate the average
value for each of your six explanatory variables using the AVERAGE() function in Excel. Finally, make a
prediction for the price if each of the variables take their average value.
(d) Explain briefly if there are any variables in the data set you think should have been included and, i f so,
why.
Question 2 : In this assignment you are provided with data from a research paper by Ella Hartenian and
Nicholas Horton, who examined whether or not there was a relationship between rail trails a nd property
values in Northamp ton, M A. The data are located in the fi le `house prices.xlsx' which is in you r e -mail on D2L
(course den email) . Please see the accompanying word document, `documentation.docx,' which describes
what each of the columns in the data set mean and how they are scaled. For this assi gnment you will assess
whether rail trails in fluence property values.
2. Create a copy of the data set by right -clicking the tab at the bottom and clicking `Move or Copy...’, then
create another copy of the data set in the same workbook. Label this copy “M odified Data" and the original
tab “ Original Data." Now click on the tab for “Modifi ed Da ta 2." Delete all variables you consider to be
irrelevant to the price of a house. Make sure `price2014' is the far left variable. Then answer the questions:
(a) Estimate the model where house prices are affected by rail distance only. What is the sign and
magnitude of the coefficient on rail distance, and what is the fit of the regression?
(b) Estimate a new model where house prices are affected by both ra il distance and previ ous prices. What
is the sign and magnitude of the coefficient on rail distance, and what is the fi t of the regression?
(c) Estimate a new model where house prices are affected by both rail distance and previous prices, as
well as any other variable you consid er relevant. What is the sign and magnitude of the co efficient on
rail distance, and what is the fit of the regression?
(d) Write a paragraph tha t answers whether rail trails aff ect housing prices based on your three models.