You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Using the HF pipeline with the bitsandbytes quantization doesn't work on MPS yet. However, the llama.cpp runtime works well on a Mac, so that can be leveraged. We need to dynamically load/use a runtime based on the platform.
The text was updated successfully, but these errors were encountered:
Using the HF pipeline with the bitsandbytes quantization doesn't work on MPS yet. However, the llama.cpp runtime works well on a Mac, so that can be leveraged. We need to dynamically load/use a runtime based on the platform.
The text was updated successfully, but these errors were encountered: