fix: handle SDPA attention implementation for vision encoder #59

xffxff · 2024-11-11T05:53:18Z

ref: #54

AutoModelForCausalLM.from_pretrained(model_id_or_path, device_map="auto", torch_dtype=torch.bfloat16, trust_remote_code=True, attn_implementation="sdpa") currently raises an error because our ViT model does not support the sdpa attention implementation. This PR introduces a fallback mechanism: when attn_implementation="sdpa" is set, the ViT model will automatically use "flash_attention_2" instead, while the language model continues to use sdpa. A warning will be issued to inform the user of this fallback behavior.

xffxff added 2 commits November 11, 2024 05:44

fix: handle SDPA attention implementation for vision encoder

ecff81d

make format happy

1ad2996

xffxff merged commit b84e928 into main Nov 11, 2024
1 check passed

xffxff deleted the sdpa branch November 11, 2024 07:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: handle SDPA attention implementation for vision encoder #59

fix: handle SDPA attention implementation for vision encoder #59

xffxff commented Nov 11, 2024 •

edited

Loading

fix: handle SDPA attention implementation for vision encoder #59

fix: handle SDPA attention implementation for vision encoder #59

Conversation

xffxff commented Nov 11, 2024 • edited Loading

xffxff commented Nov 11, 2024 •

edited

Loading