Second-Hand Cars in the PH: A Buyer's Guide is a capstone project presented for FTW Foundation Data Science Program that predicts the prices of second-hand/used cars in the Philippines. This is created by Elyse Go, Nicole Lumagui, Bernadette Misa and Jero Santos, and sponsored by Carlove.
See our model in action here: http://nicolelumagui.pythonanywhere.com/
(This site will be disabled on Wednesday 26 February 2020)
Traffic. Congestion. Carmaggedon. Metro Manila is one of the densest cities. What is one of the government’s way of solving this? The TRAIN Law. Specifically, Auto Tax Reform. Modeled after Singapore, TRAIN Law is put in place to limit “new” cars on the road. Because of this, people are discouraged to buy brand new cars to help alleviate traffic. And because of this, a window of opportunity opens: an increase in market activity for second-hand cars.
Buying used cars has many advantages:
- More savings
- Cheaper insurance cost
- Slower depreciation
- Extended warranty
- Good for the environment
But there are also risks:
- Unknown reliability or treatment
- More frequent maintenance
- Hard to find an exact match of what you want
- Untouched warranty
- Lemon Car / Overpriced Car
- Scraped data from the websites Carmudi, Philkotse, Priceprice Auto and AutoSearch Manila
- Joined the scraped data sets and cleaned the data
- Used K-Nearest Neighbors to impute for mileage
- Exploratory Data Analysis
- Tried Decision Trees to get the first glance on feature importance
- Used Random Forest Regressor then XGBoost to predict the price of second-hand cars
- Created Web App using Django and applying the pickled Random Forest Regressor model on backend
The data used for this project are car listings scraped from Carmudi and Philkotse. While, the retail price of the cars are scraped from Priceprice Auto and AutoSearch Manila.
- Age of car
- Retail Price of Car
- Mileage of Car in km
- Car Brand/Make - Toyota, Honda, Hyundai, Ford, etc...
- Car Model - Civic, Adventure, Ranger, Vios, etc...
- Car Body Type - Saloon/Sedan, Hatchbak/Wagon, SUV, MPV/AUV, etc...
- Fuel Type of Car - Diesel, Gasoline, Electric
- Transmission Type of Car - Automanual, Automatic, CVT, Manual, Shiftable Automatic
- Car's Color
- Seller Type - Individual/Private Owner or Dealer
- Seller's Location/City
- Age of Post/Listing in Days
Model | Cross-Val Score |
---|---|
XGBoost | 79.63% |
Random Forest Regressor | 80% |
- Add more data from other marketplaces and banks, and compile a dataset with balanced distribution of car brands and models.
- Improve user interface of web app.
- More fine tuning of the model.
- Add more features, such as number of doors.
- https://www2.bc.edu/thomas-chemmanur/phdfincorp/MF891%20papers/Ackerlof%201970.pdf
- https://www.business-standard.com/article/news-cd/5-reasons-why-you-should-buy-a-used-car-116051000836_1.html
- https://www.investopedia.com/articles/pf/07/neworusedcar.asp
- https://www.carmudi.com.ph/journal/pros-and-cons-of-buying-a-used-or-new-car/
-
Elyse Go (@fur-elyse)
[email protected] -
Nicole Lumagui (@nicolelumagui)
[email protected] -
Bernadette Misa (@bernablues)
[email protected] -
Jero Santos (@jerosantos)
[email protected]