Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include code snippets for other libraries? #2986

Open
severo opened this issue Jul 12, 2024 · 4 comments
Open

Include code snippets for other libraries? #2986

severo opened this issue Jul 12, 2024 · 4 comments
Labels
P2 Nice to have question Further information is requested

Comments

@severo
Copy link
Collaborator

severo commented Jul 12, 2024

For example, in huggingface/huggingface.js#797, we add distilabel, fiftyone and argilla to the list of libraries the Hub knows. However, the aim is only to handle the user-defined tags better, not to show code snippets.

In this issue, I propose to discuss if we should expand the list of dataset libraries for which we show code snippets. For now, we support pandas, HF datasets, webdatasets, mlcroissant and dask.

We already mentioned polars as a potential new lib, I think. Maybe duckdb too?

@burtenshaw
Copy link

burtenshaw commented Jul 12, 2024

With this feature PR Argilla will be able to load predefined dataset repo that contain a .argilla config dir. The dataset could then be loaded in Argilla like this:

import argilla as rg

client = rg.Argilla(api_url="<api_url>", api_key="<api_key>")
dataset = rg.Dataset.from_hub(repo_id="<repo_id>")

Could we show this snippet based on the presence of .argilla ?

@julien-c
Copy link
Member

polars/duckdb 👍

@julien-c
Copy link
Member

Could we show this snippet based on the presence of .argilla ?

sounds reasonable!

@dvsrepo
Copy link
Member

dvsrepo commented Jul 12, 2024

Could we show this snippet based on the presence of .argilla ?

sounds reasonable!

This would be awesome, eventually!

As from_hub will be released along argilla 2.0 in a few days, I think we can need to make it bullet proof with some iteration and further testing with the community

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P2 Nice to have question Further information is requested
Projects
None yet
Development

No branches or pull requests

4 participants