Multimodal capabilities of Llama3 #309

fyang064 · 2024-08-08T21:13:49Z

I saw the compositional approach adding multimodal capabilities to Llama3 in the report, and am curious about the details about the image encoder and adaptor. Can you please provide any of the model config files for vision experiments?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multimodal capabilities of Llama3 #309

Multimodal capabilities of Llama3 #309

fyang064 commented Aug 8, 2024

Multimodal capabilities of Llama3 #309

Multimodal capabilities of Llama3 #309

Comments

fyang064 commented Aug 8, 2024