Skip to content

Commit

Permalink
support model quanitze option
Browse files Browse the repository at this point in the history
  • Loading branch information
lebrunel committed May 11, 2024
1 parent b56089d commit 9a95334
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions lib/ollama.ex
Original file line number Diff line number Diff line change
Expand Up @@ -251,10 +251,6 @@ defmodule Ollama do
required: true,
doc: "List of messages - used to keep a chat memory.",
],
template: [
type: :string,
doc: "Prompt template, overriding the model default.",
],
format: [
type: :string,
doc: "Set the expected format of the response (`json`).",
Expand Down Expand Up @@ -419,6 +415,10 @@ defmodule Ollama do
required: true,
doc: "Contents of the Modelfile.",
],
quantize: [
type: :string,
doc: "Quantize f16 and f32 models when importing them.",
],
stream: [
type: {:or, [:boolean, :pid]},
default: false,
Expand Down

0 comments on commit 9a95334

Please sign in to comment.