ProtoMF: Prototype-based Matrix Factorization for Effective and Explainable Recommendations

This repository hosts the code and the additional materials for the paper "ProtoMF: Prototype-based Matrix Factorization for Effective and Explainable Recommendations" by Alessandro B. Melchiorre, Navid Rekabsaz, Christian Ganhör, and Markus Schedl at RecSys 2022.

@inproceedings{melchiorre2022protomf,
    title = {ProtoMF: Prototype-based Matrix Factorization for Effective and Explainable Recommendations},
    author = {Melchiorre, Alessandro B. and Rekabsaz, Navid and Ganhör, Christian and Schedl, Markus},
    booktitle = {Sixteenth ACM Conference on Recommender Systems},
    year = {2022},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    series = {RecSys '22},
    doi = {10.1145/3523227.3546756},
    isbn = {978-1-4503-9278-5/22/09}
}

Repository Structure

The code is written in Python and relies on Pytorch to compute the model's gradients. We further use ray tune for hyperparameter optimization and weight and biases for logging, among many cool packages.

Code is structured in the following way and briefly described below:

.
├── confs
│   └── hyper_params.py
├── data
│   ├── amazon2014
│   │   └── amazon2014_splitter.py
│   ├── lfm2b-1mon
│   │   └── lfm2b-2020_splitter.py
│   ├── ml-1m
│   │   └── movielens_splitter.py
│   └── README.md
├── experiment_helper.py
├── feature_extraction
│   ├── feature_extractor_factories.py
│   └── feature_extractors.py
├── pdfs and images
│   ├── protomf_appendix.pdf
│   ├── protomf_diagram.png
│   └── ProtoMF__Prototype_based_Matrix_Factorization.pdf
├── protomf.yml
├── README.md
├── rec_sys
│   ├── protomf_dataset.py
│   ├── rec_sys.py
│   ├── tester.py
│   └── trainer.py
├── start.py
└── utilities
    ├── consts.py
    ├── eval.py
    └── utils.py

where the files/directories contain:

protomf.yml: environment (install with conda env create -f protomf.yml)
start.py: starting point to run the experiments (check with python start.py --help)
experiment_helper.py: hyperparameter optimization procedure
confs/hyper_params.py: hyperparameters of all the models
data/*: data folder and data splitting procedure
feature_extraction/*: code of the models
rec_sys/protomf_dataset.py: general code for handling the dataset (including negative sampling)
rec_sys/rec_sys.py: a general recommender system, used as base for all models
rec_sys/tester.py and rec_sys/trainer.py: testing and training procedure respectively
utilities/*: constants, evaluation metrics code, generic code

Installation and configuration

Environment

Install the environment with conda env create -f protomf.yml
Activate the environment with conda activate protomf

Data

Follow the instructions to download all three datasets in ./data/README.md. Place the datasets files in the right folders and use the respective splitter.py files to pre-process the datasets. To pre-process the data, generally you just need to

move into the folder with cd <dataset_folder>
run python <dataset_name>_splitter.py -lh <folder_where_the_data_is> (usually the data is already in the folder so ./ will suffice). Leave the saving path to ./.

After the scripts finish running you will have 5 files:

3 files with the listening_history of the users for train, val, and test
2 files containing the user and item ids (as index in the rating matrix)

Configuration

In utilities/consts.py you need to set:

DATA_PATH: absolute path to the ./data folder
WANDB_API_KEY: API key of Weight and Biases, currently the results are only printed there.

Run

The experiments can be started with

python start.py -m <model> -d <dataset>

For checking the options for model and dataset, use

python start.py --help

By default, python start.py runs the hyperparameter optimization for a single seed (check utilities/consts.py). If the flag -mp is provided together with the model and dataset, then the script will run all three hyperparameter optimization procedures and average the test results.

Results and progress can be checked on the Weight&Biases webpage.

License

The code in this repository is licensed under the Apache 2.0 License. For details, please see the LICENSE file.

Acknowledgments

This work received financial support by the Austrian Science Fund (FWF): P33526 and DFH-23; and by the State of Upper Austria and the Federal Ministry of Education, Science, and Research, through grant LIT-2020-9-SEE-113 and LIT-2021-YOU-215.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ProtoMF: Prototype-based Matrix Factorization for Effective and Explainable Recommendations

Repository Structure

Installation and configuration

Environment

Data

Configuration

Run

License

Acknowledgments

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
confs		confs
data		data
feature_extraction		feature_extraction
pdfs_and_images		pdfs_and_images
rec_sys		rec_sys
utilities		utilities
LICENSE		LICENSE
README.md		README.md
experiment_helper.py		experiment_helper.py
protomf.yml		protomf.yml
start.py		start.py

License

karapostK/ProtoMF

Folders and files

Latest commit

History

Repository files navigation

ProtoMF: Prototype-based Matrix Factorization for Effective and Explainable Recommendations

Repository Structure

Installation and configuration

Environment

Data

Configuration

Run

License

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages