Popular repositories Loading
-
meshed-memory-transformer
meshed-memory-transformer PublicMeshed-Memory Transformer for Image Captioning. CVPR 2020
-
dress-code
dress-code PublicDress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022
-
multimodal-garment-designer
multimodal-garment-designer PublicThis is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
-
show-control-and-tell
show-control-and-tell PublicShow, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
-
novelty-detection
novelty-detection PublicLatent space autoregression for novelty detection.
Repositories
- awesome-human-visual-attention Public
This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, human visual search.
aimagelab/awesome-human-visual-attention’s past year of commit activity - ReflectiVA Public
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
aimagelab/ReflectiVA’s past year of commit activity