Update user docs for running llm server
+ upgrade gguf
to 0.11.0
#463
Annotations
2 errors
Llama3.1 8B FP16 (3.11, llama-mi300x-3)
Canceling since a higher priority waiting request for 'CI - sharktank perplexity short-676' exists
|
Llama3.1 8B FP16 (3.11, llama-mi300x-3)
The operation was canceled.
|