-
Notifications
You must be signed in to change notification settings - Fork 62
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve model download speed, progress display, etc #58
Comments
Partially related context:
|
I ve been downloading the 8b for almost an hour now... So yeah any improvement in this regard will be great. Or maybe allow to just download the models then place them (I don't know where the model will be located as I didn't analyse the source code yet) But having them shared with other softwares like Ollama would be great (Symbolic link) |
Lines 13 to 25 in 14e4ae5
|
Yeah I figured that out and updated to Phi 3.5 but the PR shows an error (unrelated I guess as many dependabot PRs are rejected as well) My mistake (I am extremely exhausted lately and I cannot focus)... Still I want to make some progress with this and create a workflow that would allow me to puruse some new venues in the near future by learning from some extensions how they operate internally... alter some and analyse some other for malwares. This is the error of the PR, I thought it would be a drop from 3.1 to 3.5 but I guess I have to learn more about the difference between the tokenization of both.
|
Some more relevant links/functions/etc that could be used here:
|
I was reading the
node-llama-cpp
docs, and they mention that theipull
package can be useful for improved model download speeds:I can see that the current
download
command callsdownloadModel
:humanify/src/commands/download.ts
Lines 1 to 6 in 14e4ae5
Which is defined here, and seems to just use
fetch
currently, as well as implementing its own download progress tracking inshowProgress
:humanify/src/local-models.ts
Lines 38 to 65 in 14e4ae5
humanify/src/progress.ts
Lines 4 to 13 in 14e4ae5
I wonder if using iPull might make sense, both in increased download speed, as well as better download progress visibility/etc.
The text was updated successfully, but these errors were encountered: