You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The whisper example has only been optimized and tested for cpu and cuda ep. It uses multiple onnxruntime custom operators that have not been implemented on other EPs.
Describe the bug
When running optimized whisper medium int8 on macOS coreml I get the error
CreateSession(Msg("Could not find an implementation for DecoderMaskedMultiHeadAttention(1) node with name 'Attention_0'"))
To Reproduce
Enable CoreML and use medium.int8 gpu model from https://huggingface.co/thewh1teagle/whisper-olive/tree/main
Expected behavior
It should run without error
Olive config
Olive logs
Add logs here.
Other information
Additional context
Related: #1213
Optimized on ubuntu server with cuda
The text was updated successfully, but these errors were encountered: