This project is used for forecasting the ROI of new home clusters that will be constructed in the next year in Melbourne by taking past data such as reviews, prices, etc.
The details of this project could be accessed through this document: Problem Identification.pdf
Cleaned Melbourne Housing Market
This project is will implementing Cross Industry Standard Process for Data Mining (CRISP - DM), which includes:
- Business Understanding
- Data Understanding
- Data Preparation
- Modeling
- Evaluation
- Deployment
However, we wil separate it into several phases as follows:
- Data Cleaning
- Data Reduction
We could say that the data is not well-suited for regression analysis, and this issue indicates that the data is not linear. You could see the result of this dataset analysis in here.
While using the Neural Network, we obtain better results in its accuracy by looking at its MSE, MAE, and RMSE. However, there's something unique in this Regression Analysis: the R-Squared. Below is the summary result of both models:
Performance Measure | Multiple Linear Regression | Neural Network |
---|---|---|
R-Squared | 0.5926448818238725 | 0.5237713739630026 |
MSE | 1.2604419499246601e+26 | 84018888704.0 |
MAE | 579344653273.0378 | 192391.421875 |
RMSE | 11226940589157.227 | 289860.11920234904 |
Thus, in short the best model for this data is a non-linear model.