-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
log_artifact
: external and non-DVC tracked files support
#551
Comments
Feels like the questions are assuming As of today, it is decoupled from an implementation perspective ( For those scenarios, why would you want to use If we still want a Python API, should we make it part of |
The use case I can think of is huggingfaces integration. Is that what you have in mind? Would we also make
@daavoo If we import the model, isn't that part of the core functionality of |
I don't know yet, at the end people might decide on their own also how exactly bring the artifact from a commit. |
Can we come up with a use case where model registry is needed in this scenario? |
To be honest, I don't see the difference is it DVC-tracked or not. All the same scenarios apply, no? Find a specific version of a model (by a tag) and fetch it to deploy. Assign stages, etc, etc. In this case Could you may be clarify your question, @dberenbaum ? |
How do you envision this workflow if the artifact is managed by git lfs? What commands would I run in my deploy script? |
@dberenbaum I'm not that familiar with Git lfs, but from what I remember you could probably manage it with |
There are two mechanisms we could use in dvc for this:
Neither of these automatically detect whether the files are version-aware today. It would be great if we can add support for that in dvc since I see it in other loggers, but I can't remember the obstacles to doing it (cc @pmrowla).
Neptune is the only logger I have found that supports tracking local files without uploading them, so I'm not sure it should be a high priority, but it's possible to support it with option 2 above. How should we expose this functionality in dvclive? Some options:
Some other loggers for comparison (note that mlflow does not support this pattern at all AFAICT): |
More of a question for now:
dvc.yaml
should support them I think. Do we need to create import file.dvc
or not?The text was updated successfully, but these errors were encountered: