Add support for LLaVA model #482

youssefadr · 2023-10-10T22:00:47Z

🚀 The feature, motivation and pitch

LLaVA seems to be currently a strong open-source competitor to GPT4-V, it doesn't seem to be supported by the library. Do you plan on adding it? If yes, is there something I could contribute with to help?

Alternatives

No response

Additional context

No response

ebsmothers · 2023-10-11T15:53:34Z

Hi @youssefadr, thanks for opening this issue. LLaVA is definitely something we're interested in adding and we would be happy to have you contribute. Is there a specific portion of the model you're especially interested in helping out with?

youssefadr · 2023-10-12T07:48:03Z

Thanks for your answer @ebsmothers, I would like to add the model to torchmultimodal/models first.

ebsmothers · 2023-10-12T17:33:51Z

That sounds reasonable to me. We already have CLIP visual encoders in the library here, so feel free to reuse those. Then the bulk of the work for the model should be to add the LLM. A couple pointers to help with that: TransformerDecoderLayer, RMSNorm. We also have an open PR for rotary positional embeddings (#450) that might be useful. Let me know if this makes sense, happy to provide more details as needed.

youssefadr · 2023-10-12T17:35:41Z

Nice ! I'll come back to you with more questions later, not sure I'll start working on it this week.

theadamsabra · 2024-02-22T01:44:16Z

@youssefadr have you worked on this to any capacity? i'm interested in picking this up if not

ebsmothers · 2024-02-22T05:31:46Z

@theadamsabra if not, you are more than welcome to take it up

theadamsabra · 2024-02-22T05:33:10Z

@ebsmothers thanks! If I don't get a response by tomorrow I'll just pick it up myself

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for LLaVA model #482

Add support for LLaVA model #482

youssefadr commented Oct 10, 2023 •

edited

Loading

ebsmothers commented Oct 11, 2023

youssefadr commented Oct 12, 2023

ebsmothers commented Oct 12, 2023

youssefadr commented Oct 12, 2023

theadamsabra commented Feb 22, 2024

ebsmothers commented Feb 22, 2024

theadamsabra commented Feb 22, 2024

Add support for LLaVA model #482

Add support for LLaVA model #482

Comments

youssefadr commented Oct 10, 2023 • edited Loading

🚀 The feature, motivation and pitch

Alternatives

Additional context

ebsmothers commented Oct 11, 2023

youssefadr commented Oct 12, 2023

ebsmothers commented Oct 12, 2023

youssefadr commented Oct 12, 2023

theadamsabra commented Feb 22, 2024

ebsmothers commented Feb 22, 2024

theadamsabra commented Feb 22, 2024

youssefadr commented Oct 10, 2023 •

edited

Loading