- Food 101
- Yelp Dataset
One existing issue in the dataset is that in yelp, there is no cleaned data. The solution we came up is to use simply NLP to classify the food.
In other words, if the comments have a matching food crictical word, we will use this to label the food.
If the critical word appears more than 1 times, the food will be falled into multiple category.
The original label came from another problem(human emotional classification). However, that model is not a good one for our question due to overfit. So we have took way to optimze the model.
Fine tune Using Incerception v3
The environment is configured using Google colab
- Compare the dataset we generated manually with the 'good' dataset the is already lablled
- Compare the model for different CV problem, and took ways to reduce overfitting.