Skip to content

This project utilises open data from Data.gov.sg to build several Machine Learning (ML) models that help predict HDB Resale Prices.

Notifications You must be signed in to change notification settings

sienlonglim/ml_webapp

Repository files navigation

Machine Learning Web App 🏠

Static Badge GitHub commit activity (branch) Static Badge Static Badge Static Badge Static Badge

This project utilises open data from Data.gov.sg to build several Machine Learning (ML) models that help predict HDB Resale Prices. Live project

The main focus of this project is to complete a full cycle of

  • Extract Transform Load (ETL)
  • ML Model building
  • Deployment
  • Live dashboarding

The project involves a large dataset (>40k points) involving geodata of all Singapore HDB resale prices over the years 2022 and 2023.

The following steps were taken in the project: (all steps can be found in the JupyterNotebook ipynb files)

  1. Data was obtained through rest API calls to Data.gov.sg, followed by data wrangling
  2. Feature creation and selection (using KBest on Mutual Information, L1 Regularisation)
  3. Hyperparameter tuning (Random Cross Validation)
  4. Model selection and testing Normal and Ensemble models (Gradient boosting, Random forest)
  5. Front end web application (Flask) development with Bootstrap 5
  6. Dashboarding (Tableau & Streamlit)

snippet

Updates: 20240306

  • Completed refactoring of code to modules
  • Model and other objects are read once at the start of app startup, instead of every prediction
  • Rounded up predictions
  • Configured logging for individual modules

About

This project utilises open data from Data.gov.sg to build several Machine Learning (ML) models that help predict HDB Resale Prices.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published