Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

error:RuntimeError: Error(s) in loading state_dict for CLIPVisionModel: size mismatch for vision_model.embeddings.class_embedding: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([768]). #175

Open
zapqqqwe opened this issue Jun 29, 2024 · 3 comments

Comments

@zapqqqwe
Copy link

为什么报错呢,下载的LanguageBind/LanguageBind_Video_merge 和 LanguageBind/LanguageBind_Image 在本地同时,在config.json文件修改了mm_video_tower 和 mm_image_tower 分别为本地的位置,但是报错,我看好像clip的隐藏层768但是设置的为1024,怎么解决呢

@zapqqqwe
Copy link
Author

huggface里面的LanguageBind/LanguageBind_Video_merge 和 LanguageBind/LanguageBind_Image

@Liu98C
Copy link

Liu98C commented Jul 8, 2024

huggface里面的LanguageBind/LanguageBind_Video_merge 和 LanguageBind/LanguageBind_Image

请问您解决了吗

@Liu98C
Copy link

Liu98C commented Jul 8, 2024

huggface里面的LanguageBind/LanguageBind_Video_merge 和 LanguageBind/LanguageBind_Image

#57 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants