Skip to content

Understand the working of Recommendation system. The end goal was to predict what rating a user would give for a particular movie.

Notifications You must be signed in to change notification settings

o-Senpai-o/Recommendation-system-for-Movies

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 

Repository files navigation

Recommendation-system-for-Movies

Understand the working of Recommendation system. The end goal was to predict what rating a user would give for a particular movie.

What is a Recommendation System?

  • A recommendation system generates a compiled list of items in which a user might be interested, in the reciprocity of their current selection of item(s). It expands users’ suggestions without any disturbance or monotony, and it does not recommend items that the user already knows.

  • For instance, the Netflix recommendation system offers recommendations by matching and searching similar users' habits and suggesting movies that share characteristics with films that users have rated highly.

  • The recommendation system workflow shown in the diagram above shows the user’s collaboration regarding the ratings of different movies or shows. New users get their recommendations based on the recommendations of existing users

problem Statement

  • Netflix provided a lot of anonymous rating data, and a prediction accuracy bar that is 10% better than what Cinematch can do on the same training data set. (Accuracy is a measurement of how closely predicted ratings of movies match subsequent actual ratings.)

Real world/Business Objectives and constraints

  • Predict the rating that a user would give to a movie that he ahs not yet rated.
  • Minimize the difference between predicted and actual rating (RMSE and MAPE)

Machine Learning Problem

  • given user, movie predict what rating this use would give to this movie
  • This boils down to a Regression problem
  • ![netflix](images/netflix.png

Data Overview

  • Source : https://www.kaggle.com/netflix-inc/netflix-prize-data/data
  • Overview
    • combined_data_1.txt

    • combined_data_2.txt

    • combined_data_3.txt

    • combined_data_4.txt

    • movie_titles.csv

    • CustomerID,Rating,Date


    • MovieIDs range from 1 to 17770 sequentially.
    • CustomerIDs range from 1 to 2649429, with gaps. There are 480189 users.
    • Ratings are on a five star (integral) scale from 1 to 5.
    • Dates have the format YYYY-MM-DD.

Features

features

Modelling

  • based on item-item similarity we got similar movies
  • Below is a list of similar movies for a movie with same genre (DRACULA) movie
  • model tree

    models
  • we use Surprise models output as features for XGBoost regressor

References

About

Understand the working of Recommendation system. The end goal was to predict what rating a user would give for a particular movie.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published