How to use as replacement of chatgpt with 4o #1366
-
Hi there, great work. I have enjoyed playing with gguf models a lot. However, I've a question, how can koboldcpp be used as replacement of gpt4o (multi modal API). Currently this model runs at 5001, however, all I can do is have send text to text conversation. But I also want to be able to send audio files or use an endpoint where I can have live conversation through speech (both me and bot) as well as upload images which it can explain by speaking, like chatgpt (gpt4o) can. I can provide it model files by uploading text, audio as well as mmproj file, however, I can only interact with it through chat interface, or at least I don't know what endpoint I should use and how to send post requests for audio/image files (or live audio stream for speech recognition). I use python but bash and cmd commands are also welcomed. Thank you. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
There isn't an all-in-one "omni" model for KoboldCpp, instead you need to use various smaller models combined to get what you need.
Additionally, it also provides openai, xtts and various other APIs like ComfyUI compatibility. Generally you can access all basic features from the Web KoboldAI lite interface, just check various settings and observe the API calls used. You can find all model links on the wiki. Please refer to it for more info. |
Beta Was this translation helpful? Give feedback.
There isn't an all-in-one "omni" model for KoboldCpp, instead you need to use various smaller models combined to get what you need.
images
…