Recommendation System

Brief Description

Definition: A subclass of information filtering system

To predict the rating or preference that user would give to an item (e.g. movie) or social element (e.g. people) they had not yet considered.

Formalization

Mapping Function: $f: U \times I \rightarrow R$
Input:
- User Model ($U$)
- Item ($I$)
Calculate:
- Relativity ($R$) - used to sorting

Search vs. Recommendation

Search: fulfilling users' active needs

user know what he want
user know how to describe

Recommend: mining and fulfilling users' potential needs

user don't know where to find
user don't know how to describe

Purpose and Success Criteria

Prediction perspective
Interaction perspective
Conversion perspective

Background

Informations Overload

Power laws / long-tailed distribution in the statistical sense

Personas 用戶畫像

customized recommendation / personalization

Traditional Approach

Collaborative Filtering 協同過濾 - users' social environment
- Item-based
- User-based
Content-based Recommendation
- analyzes the nature of each item (characteristics of items)
Knowledge-based Recommendation

Collaborative Filtering

The process of filtering for information using techniques involving collaboration among multiple agents, viewpoints, etc.

works by taking a data set of user's data and comparing it to the data of other users

The key idea behind CF is that similar users share the same interest and that similar items are liked by a user.

Basic assumption: Shared common interests in the past would still prefer similar products/items in the future (Those who agreed in the past tend to agree again in the future.)

Approach

Item-based or user-based similarity?

Compared the distance between items is known as item-based similarity.

Compare the distance between users is known as user-based similarity.

The choice depends on how many users you may have or how many items you may have.

(If you have a lot of users, then you'll probably want to go with item-based similarity)

Item-based collaborative filtering

measure the similarity between the items that target users rates/ interacts with and other items

User-based collaborative filtering

measure the similarity between target users and other users

Method

Association Rule Learning

Evaluation

Confidence of A to B $$ \operatorname{confidence}(A\Rightarrow B) = P(B|A) $$
Support of A to B $$ \operatorname{support}(A\Rightarrow B) = P(A \cup B) $$

Example:

bread, milk
bread, diaper, beer, eggs
milk, diaper, beer, coke
bread, milk, diaper, beer
bread, milk, diaper, coke

e.g. {Milk, Diaper} => Beer

$$ \operatorname{support} = \frac{\sigma(\text{milk}, \text{diaper}, \text{beer})}{|D|} = \frac{2}{5} $$

$$ \operatorname{confidence} = \frac{\sigma(\text{milk}, \text{diaper}, \text{beer})}{\sigma(\text{milk}, \text{diaper})} = \frac{2}{3} $$

Apriori

Naive Bayes Based Collaborative Filtering

Naive Bayes

Matrix Factorization (Singular Value Decomposition)

Approximate the full matrix by observing only the most important feature those with **

Singular Value Decomposition

Latent Factor Model

Objective Funciton: minimize squared error

Probability Matrix Factorization (PMF)

Wiki - Matrix factorization (recommender systems)

Factorization Machine (FM)

Paper - Factorization Machines
libFM: Factorization Machine Library
- github

Defect of CF

cold start
data sparsity
popularity bias

Solution => Content-based Recommendation

Content-based Recommendation

Evaluation

Evaluation Experiment

Offline experiments - based on historical data
- prediction accuracy, coverage
Laboratory studies - Controlled experiments
- e.g. questionaries (survey)
Test with real users - A/B tests
- e.g. sales increase, click through rates

Rating Prediction (Regression Evaluation)

Mean Absolute Error (MAE)
Mean Squared Error (MSE)
Root Mean Squared Error (RMSE)
Normalized Mean Absolute Error (NMAE)

Top-N Prediction (Classification Evaluation)

Precision
Recall
Accuracy
F1-score
AUC (Area Under Curve)
- ROC Curve (Receiver Operating Characteristic Curve) (Sensitive Curve)

Others

Average Precision (AP)
Mean Average Precision (MAP)
Precision@N
- e.g. P@5, P@10, P@20
HR@H (Hit Rate)
- e.g. HR@1, HR@5, HR@10
Cumulative Gain (CG)
Discounted Cumulative Gain (DCG)
normalized Discounted Cumulative Gain (nDCG)
- Ideal DCG (IDCG)

Diversity

Intra-List Similarity (ILS)

Novelty

Coverage

Popular Dataset

MovieLens
Netflix
Book-Crossing
Jester Joke
Epinions
Yelp
BibSonomy
Foursquare
Flixster

Resources

Project

facebookresearch/dlrm - Deep Learning Recommendation Model for Personalization and Recommendation Systems

Book

Recommender Systems - The Textbook

Ch3 Model-Based Collaborative Filtering
Ch4 Content-Based Recommender System
Ch5 Knowledge-Based Recommender System

Wikipedia

Recommendation System
Information overload
Long tail
Persona
Collaborative filtering

Article

A Glimpse into Deep Learning for Recommender Systems
Machine Learning for Recommender systems
- art 1 (algorithms, evaluation and cold start)
- Part 2 (Deep Recommendation, Sequence Prediction, AutoML and Reinforcement Learning in Recommendation)
Introduction to Recommender System.
- Part 1 (Collaborative Filtering, Singular Value Decomposition)
- Part 2 (Neural Network Approach)

Evaluation Metrics

Recommender Systems — It’s Not All About the Accuracy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recommendation_System.md

Recommendation_System.md

Recommendation System

Brief Description

Formalization

Search vs. Recommendation

Purpose and Success Criteria

Background

Informations Overload

Personas 用戶畫像

Traditional Approach

Collaborative Filtering

Approach

Item-based collaborative filtering

User-based collaborative filtering

Method

Association Rule Learning

Naive Bayes Based Collaborative Filtering

Matrix Factorization (Singular Value Decomposition)

Factorization Machine (FM)

Defect of CF

Content-based Recommendation

Evaluation

Rating Prediction (Regression Evaluation)

Top-N Prediction (Classification Evaluation)

Others

Diversity

Novelty

Coverage

Popular Dataset

Resources

Project

Book

Wikipedia

Article

Files

Recommendation_System.md

Latest commit

History

Recommendation_System.md

File metadata and controls

Recommendation System

Brief Description

Formalization

Search vs. Recommendation

Purpose and Success Criteria

Background

Informations Overload

Personas 用戶畫像

Traditional Approach

Collaborative Filtering

Approach

Item-based collaborative filtering

User-based collaborative filtering

Method

Association Rule Learning

Naive Bayes Based Collaborative Filtering

Matrix Factorization (Singular Value Decomposition)

Factorization Machine (FM)

Defect of CF

Content-based Recommendation

Evaluation

Rating Prediction (Regression Evaluation)

Top-N Prediction (Classification Evaluation)

Others

Diversity

Novelty

Coverage

Popular Dataset

Resources

Project

Book

Wikipedia

Article