Skip to content

Built a generalized recommender based on popularity and genre using full data set. Also created content based recommender based on user's taste and movie description.

Notifications You must be signed in to change notification settings

ruccii/Movie-Recommendation-system-clone

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Movies-recommendation-system-clone

  • This Project repository is based on Building a movies recommendation system clone

Dataset Details:

The Dataset used for building this recommendation engine is mentioned as below:

  • Dataset used : MovieLens dataset
  • Download Dataset : Download Dataset from these following links
    • Download MovieLens dataset hosted on Kaggle then use kaggle link
    • Download MovieLens dataset from its official website then use GroupLens link
  • Dataset File Format : CSV File (Comma-separated values). NOTE: Download and save dataset inside input_data folder
  • Types of dataset :
    • The full dataset : This dataset consists of 26,000,000 ratings and 750,000 tag applications applied to 45,000 movies by 270,000 users. Includes tag genome data with 12 million relevance scores across 1,100 tags.
      • NOTE: We will build a simple Recommendation for movies using The full dataset.
    • The small dataset : This dataset comprises of 100,000 ratings and 1,300 tag applications applied to 9,000 movies by 700 users.
      • NOTE: All personalised recommender systems will make use of the small dataset (due to the limited computing power of our system).
  • Data description : It contains 100004 ratings and 1296 tag applications across 9125 movies. These data were created by 671 users between January 09, 1995 and October 16, 2016. This dataset was generated on October 17, 2016.Users were selected at random for inclusion. All selected users had rated at least 20 movies. No demographic information is included. Each user is represented by an id, and no other information is provided.
  • Data Files Content :
    • credits.csv
    • keywords.csv
    • links.csv
    • links_small.csv
    • movies_metadata.csv
    • ratings.csv
    • ratings_small.csv
  • List of other dataset available :
    • MovieLens - Movie Recommendation Data Sets click link
    • Netflix Prize Dataset click link
    • Yahoo! - Movie, Music, and Images Ratings Data Sets click link
    • Cornell University - Movie-review data for use in sentiment-analysis experiments click link
    • MovieTweetings - click link

Dependencies Details:

  • Python >=3.5
  • pandas
  • numpy
  • scipy
  • scikit-learn
  • scikit-surprise
  • matplotlib
  • seaborn
  • jupyter notebook
  • jupyter lab
  • textblob

Install dependencies :

Windows OS :

  • Install Python3 (install python 3.6.4)
    • Step 1: Download python form this link
    • Step 2: Refer this link or this link in order to install python to your system.
  • Install anaconda
    • Step 1: Download Anaconda 5.1 (python 3.6 version) using this link
    • Step 2: See the installation instruction given on this link
    • Note: If you have any other version of python then install anaconda which supports that particular version of python.
  • Install dependencies using conda
    • nltk: In-built installed with anaconda
    • numpy: In-built installed with anaconda
    • scipy: In-built installed with anaconda
    • scikit-learn: In-built installed with anaconda
    • scikit-surprise: $ conda install -c conda-forge scikit-surprise
    • Pandas: In-built installed with anaconda
    • matplotlib: In-built installed with anaconda
    • seaborn: In-built installed with anaconda
    • jupyter notebook: In-built installed with anaconda
    • jupyter lab: In-built installed with anaconda
    • textblob: $ conda install -c conda-forge textblob

Issues with installing surprise package :

Install Pycharm IDE:

  • Step 1: Download pycharm IDE community edition by using this link
  • Step 2: Install .exe file.

Code credit:

About

Built a generalized recommender based on popularity and genre using full data set. Also created content based recommender based on user's taste and movie description.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published