BUSINESS STATISTICS 1

Instructions:

This is a group assignment with a minimum group size of two and a maximum group size of three. All

group members will receive the same marks for the assignment.   All group members must be enrolled in

the same tutorial.  The assignment must be provided in the form of a (brief) business report

approximately 5-9 pages (including this cover page). You must submit an electronic copy of your

assignment in Blackboard.  Hard copies will not be accepted. SHOW YOUR WORK for Calculation based

questions.

This assignment requires the use of Microsoft Excel.  If you have Windows, you will also need to use

the Data Analysis ToolPak.  If you have a Mac, you will need to use StatPlus:MAC LE.

Problem Description:

You are consultants working with an online real estate appraiser, onthehouse.com.au.  In order to

better calibrate their models to predict housing prices, your supervisor has asked your group to

develop a model to appraise the price of homes in a capital city of Australia based on characteristics

of the home and the surrounding neighbourhood.  In economics, this is commonly called a “Hedonic

Regression.”

You will use descriptive statistics, inferential statistics and your knowledge of multiple linear

regression to complete this task.

Housing data for 100 single-family units lists housing price data (in $000s) (Dependent Variable) and

several characteristics of the home and neighbourhood (Independent Variable) for a capital city in

Australia are given in the Excel file: Monday.xlsx.

Here is a table describing the variables in the data set:
Variable    Definition
Price    Price of sold single-family home is $000s
Bed    Number of bedrooms in the house
Dis    Distance to nearest CBD in kilometres
Floor    Area of home is square metres
School    State ranking of nearby public secondary school.  Varies from 0 to 100 points.
Train    Dummy Variable indicating whether a train station is located within 500 metres

Required:

A.    Calculate the descriptive statistics from the data and display in a table.  Be sure to comment

on the central tendency, variability and shape for housing price and two additional variables. (1 Mark)
B.    Draw a graph that displays the relative share of bedrooms in the sample. (1 Mark)
C.    Create a box-and-whisker plot for the distribution of the price of the homes and describe the

shape.  Is there evidence of outliers in the data? (1 Mark)
D.    What is the likelihood that a house is both over $600,000 and more than 10 kilometres from the

CBD? Is the price statistically independent of distance?  Use a Contingency Table. (2 Marks)
E.    Estimate the 90% confidence interval for the population mean housing price. (1 Mark)
F.    Your supervisor recently stated that it is obvious that the mean housing price is greater than

$610,000, which was the average price of housing sold last year.  Test his claim at the 5% level of

significance. (1 Mark)
G.    Run a multiple linear regression using the data and show the output from Excel. (1 Mark)
H.    Is the coefficient estimate for the number of bedrooms statistically different than zero at the

5% level of significance?  Set-up the correct hypothesis test using the results found in the table in

Part (G) using both the critical value and p-value approach.  Interpret the coefficient estimate of the

slope. (2 Marks)
I.    Interpret the remaining slope coefficient estimates. Comment on whether the signs are what you

are expecting. (2 Marks)
J.    Interpret the value of the Adjusted R2. Is the overall model statistically significant at the

1% level of significance?  Use the p-value approach. (1 Mark)
K.    Do the results suggest that the data satisfy the assumptions of a linear regression: Linearity,

Normality of the Errors, and Homoscedasticity of Errors?  Show using scatter diagrams, normal

probability plots and/or histograms and Explain. (3 Marks)
L.    Based on the results of the regressions, is it likely that other factors have influenced

housing prices?  If so, provide a couple possible examples and indicate whether these would likely

influence the regression results if they were included. (1 Mark)
M.    If a community housing organisation asked for information regarding the characteristics of

housing targeting the households of Aboriginal and Torres Strait islanders, explain whether a simple

random sampling technique would provide an accurate representation of these households. (Note: This

question does not use the data) (1 Mark)

Allocation of Marks:
Professional Business Report        2 Marks
Part A                    1 Mark
Part B                    1 Mark
Part C                    1 Mark
Part D                    2 Marks
Part E                    1 Mark
Part F                    1 Mark
Part G                    1 Mark
Part H                    2 Marks
Part I                    2 Marks
Part J                    1 Mark
Part K                    3 Marks
Part L                    1 Mark
Part M                    1 Mark
Total:                    20 Marks

http://www.investopedia.com/terms/h/hedonicpricing.asp