All analytics tasks must be completed using R, and should be reproducible from the R code, which must be submitted as an appendix.
This log book entry will focus on using association and regression to examine the relationships between multiple variables. Using the ames_train dataset (and the test data for step 3), complete the following tasks:
- Pick 5 independent variables that you believe will be related to the dependent variable. Calculate appropriate measures of association for the bivariate relationships between the independent and dependent variables.
- Produce a regression model to investigate whether the relationships you expected exist. You can also include additional variables in this model to improve the model.
- Using the test data to make house price predictions, and evaluate the model accuracy.
- Summarise the key insights that emerged from the regression and prediction. (600 words)
- Reflect on your personal development in the use of statistics and R. (300 words)