This problem involves the Boston
data set from the MASS
library, which we saw in the lab for this chapter.
We will now try to predict per capita crime rate using the other variables in this data set.
In other words, per capita crime rate is the response, and the other variables are the predictors.
Some of the exercises are not tested by Dodona (for example the plots), but it is still useful to try them.
For each predictor, fit a simple linear regression model to predict the response
(remember that chas
is a categorical variable).
In which of the models is there a statistically significant (p < 0.05) association between the predictor and the response?
Delete the not signficant variables from the list significant.variables.single
below.
Create some plots to back up your assertions.
Fit a multiple regression model to predict the response using all of the predictors.
For which predictors can we reject the null hypothesis \(H_0 : \beta_j = 0\) (p < 0.05)?
Delete the not signficant variables from the list significant.variables.multiple
below.
How do your results from 1 compare to your results from 2? Create a plot displaying the univariate regression coefficients from 1 on the x-axis, and the multiple regression coefficients from 2 on the y-axis. That is, each predictor is displayed as a single point on the plot. Its coefficient in a simple linear regression model is shown on the x-axis, and its coefficient estimate in the multiple linear regression model is shown on the y-axis.
significant.variables.cubic
below.