Overview/Goal

This project focuses on predicting apartment prices and visualizing data related to apartment listings. It combines various datasets to create predictive models and interactive visualizations using geographic and demographic information. Models tested include custom random forest, XGBoost, and H2O autoML. The final dashboard is built in JavaScript using D3.js.

Data Sources

Key Files

  • dataJoining.py: Joins datasets, generates zip codes, and integrates IRS data.
  • dataFeatureEng.py: Cleans data and creates dummy variables for analysis.
  • modelCreation.ipynb: Builds predictive models including random forest and XGBoost.
  • aptVis.html: Interactive visualization tool using D3.js for predicting apartment prices.

How to Use

  1. Clone the repository and set up the required dependencies.
  2. Run dataJoining.py to combine datasets and generate zip codes.
  3. Execute dataFeatureEng.py to clean data and create dummy variables.
  4. Open and run modelCreation.ipynb to build predictive models and save predicted prices.
  5. Launch aptVis.html in a web browser to interact with the visualization tool.

Screenshot