-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Chore: model hub v0.5.1 update (#3036)
* init model * init qwen2 * version bump * refactor: correct icon * chore: Refactor/issue template feature request (#3037) * refactor: add issue template form for bug * refactor: config blank_issues_enabled: false * refactor: config feature request * refactor: config feature request --------- Co-authored-by: Van-QA <[email protected]> * refactor: correct icon * refactor: allow blank issue --------- Co-authored-by: Van-QA <[email protected]> Co-authored-by: Van Pham <[email protected]>
- Loading branch information
1 parent
34ea178
commit f702506
Showing
6 changed files
with
45 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
36 changes: 36 additions & 0 deletions
36
extensions/inference-nitro-extension/resources/models/qwen2-7b/model.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,36 @@ | ||
{ | ||
"sources": [ | ||
{ | ||
"filename": "Qwen2-7B-Instruct-Q4_K_M.gguf", | ||
"url": "https://huggingface.co/bartowski/Qwen2-7B-Instruct-GGUF/resolve/main/Qwen2-7B-Instruct-Q4_K_M.gguf" | ||
} | ||
], | ||
"id": "qwen2-7b", | ||
"object": "model", | ||
"name": "Qwen 2 Instruct 7B Q4", | ||
"version": "1.0", | ||
"description": "Qwen is optimized at Chinese, ideal for everyday tasks.", | ||
"format": "gguf", | ||
"settings": { | ||
"ctx_len": 32768, | ||
"prompt_template": "<|im_start|>system\n{system_message}<|im_end|>\n<|im_start|>user\n{prompt}<|im_end|>\n<|im_start|>assistant", | ||
"llama_model_path": "Qwen2-7B-Instruct-Q4_K_M.gguf", | ||
"ngl": 28 | ||
}, | ||
"parameters": { | ||
"temperature": 0.7, | ||
"top_p": 0.95, | ||
"stream": true, | ||
"max_tokens": 32768, | ||
"stop": [], | ||
"frequency_penalty": 0, | ||
"presence_penalty": 0 | ||
}, | ||
"metadata": { | ||
"author": "Alibaba", | ||
"tags": ["7B", "Finetuned"], | ||
"size": 4680000000 | ||
}, | ||
"engine": "nitro" | ||
} | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters