MultiMod-Popularity-Bias

This is the official implementation of the paper On Popularity Bias of Multimodal-aware Recommender Systems: a Modalities-driven Analysis, accepted as full paper at MMIR’23@ACM Multimedia.

This repository is heavily dependent on the framework Elliot, so we suggest you refer to the official GitHub page and documentation.

Pre-requisites

We implemented and tested our models in PyTorch==1.12.0, with CUDA 10.2 and cuDNN 8.0. Additionally, some of graph-based models require PyTorch Geometric, which is compatible with the versions of CUDA and PyTorch we indicated above.

Installation

Local

If you have the possibility to install CUDA on your workstation (i.e., 10.2), you may create the virtual environment with the requirement files we included in the repository, as follows:

# PYTORCH ENVIRONMENT (CUDA 10.2, cuDNN 8.0)

python3.8 -m venv venv
source venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt
pip install -r requirements_torch_geometric.txt

Docker

A more convenient way of running experiments is to instantiate a docker container having CUDA 10.2 already installed. Make sure you have Docker and NVIDIA Container Toolkit installed on your machine (you may refer to this guide). Then, you may use the following Docker image to instantiate the container equipped with CUDA 10.2 and cuDNN 8.0: link.

Datasets

We provide already processed and split datasets. You can download them from the links below, and then unzip them within the ./data/ folder:

Each dataset folder comes with the following files/subfolders:

data/
  <dataset_name>/
    image_feat/
      0.npy
      1.npy
      ...
    text_feat/
      0.npy
      1.npy
      ...
    train.txt
    test.txt
    val.txt

where <dataset_name> is one of office, toys, clothing.

Hyperparameters

You may find the complete configuration file for each dataset at:

config_files/
  <dataset_name>_<modality>.yml

where <dataset_name> is one of office, toys, clothing, while <modality> is one of multimodal, visual, textual.

Please, set the gpu field at your convenience (e.g., -1 is the default on the CPU).

Training and evaluation

To train all baselines on a specific dataset, run the following command:

python3.8 start_experiments.py --dataset <dataset_name> --modalities <modality>

where <dataset_name> is one of office, toys, clothing, while <modality> is one of multimodal, visual, textual.

Please consider that this may take some time depending on the machine you are running the experiments on.

Once the training has ended for all models, you may find the tsv with all results at:

results/
  <dataset_name>/
    performance/
      rec_cutoff_<cutoff>_relthreshold_0_<datetime>.tsv

where <dataset_name> is one of office, toys, clothing, <cutoff> is one of 10, 20, 50, and <datetime> depends on the date and time the results tsv file was created.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
config_files		config_files
data		data
elliot		elliot
external/models		external/models
README.md		README.md
requirements.txt		requirements.txt
requirements_torch_geometric.txt		requirements_torch_geometric.txt
start_experiments.py		start_experiments.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MultiMod-Popularity-Bias

Table of Contents

Pre-requisites

Installation

Local

Docker

Datasets

Hyperparameters

Training and evaluation

About

Releases

Packages

Languages

sisinflab/MultiMod-Popularity-Bias

Folders and files

Latest commit

History

Repository files navigation

MultiMod-Popularity-Bias

Table of Contents

Pre-requisites

Installation

Local

Docker

Datasets

Hyperparameters

Training and evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages