We created a CLIP index for the datacomp-12.8M dataset 🔍 🖼️ #78
RobbeSneyders
started this conversation in
Show and tell
Replies: 1 comment 1 reply
-
Very cool, thanks for sharing! Are you also planning to make indices for the larger pools we have? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
We created a CLIP index for the datacomp-12.8M dataset to help users search through it and use it more easily. We used Fondant to create it and published it on the huggingface hub. You can find more details and info on how to use it in this short post.
Let us know what you think, we might create more indices for the larger datasets later.
Beta Was this translation helpful? Give feedback.
All reactions