chore: support customized OpenAI model.json #3961
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Describe Your Changes
This PR allows users to customize model.json to point to a compatible OpenAI endpoint. It applies the legacy model scanning for these cases.
Changes made
The
git diff
highlights the following changes in themodel-json.ts
file:Added Local Engines Array:
LocalEngines
has been added. It contains specific inference engines from theInferenceEngine
enum:cortex
,cortex_llamacpp
,cortex_tensorrtllm
,cortex_onnx
,nitro_tensorrt_llm
, andnitro
.Modified Conditional Logic:
scanModelsFolder
function was altered.model
if allexistFiles
evaluated to true.LocalEngines
list or if allexistFiles
are true before returning themodel
. If either of these conditions is met, the model is returned.These changes adjust how models are processed based on their engine type and the existence of corresponding files.