-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BM25 support with Typescript #219
Comments
We need BM25 encoder support with Typescript to support Hybrid search also, Could you please support it? |
+1 please |
+1 |
Yes please! This exactly what I need +5 |
For right now, we encourage you to use our Python library for generating the sparse vectors (note: it's in beta): https://github.com/pinecone-io/pinecone-text (But noted re: a TS implementation!) |
+1 |
2 similar comments
+1 |
+1 |
Hi, all! Audrey from Pinecone here. Can you provide some clarification around this feature request and an example use case? For instance, when you say Thanks in advance! |
The problem is there isn't a way to do a hybrid upsert or search on Pinecone in NodeJS. This is primarily due to the bm25 library being only available in python. The real problem is around converting sentences to the sparse vectorization. As a result, you basically have to setup python API endpoint to do the conversion (pass in text, get back sparse vectors, when you could just do it inline. Big blocker to me at the moment for setting up a hybrid solution in my 100% NodeJS env. I think what would help tremendously is showing an example of how to both populate a hybrid Pinecone index using JS and then also how to query against it in JS. If this isn't possible, I think making this possible is what everyone is asking for. This exemplifies the problem here: |
same issue here, should have a full hybrid option from the ts client. |
Gotcha, thanks for the clarification, y'all. So the crux is in generating sparse vectors via the TS client, so you can upsert and search all in one go, got it. Re: an example @BenBrewerBowman, there is not currently a way to do this while remaining in the TS client ecosystem (you can only currently use |
Hi, I'd echo @BenBrewerBowman comment. Right now my team is managing our own API endpoint which utilizes the python SDK to compute BM25 sparse vectors. It would be great to see both a hosted BM25 (or equivalent sparse vector model) available through Pinecone, as well as BM25 support via the TS SDK. |
@aulorbe thanks for letting them know! Just to 100% clarify This would be very exciting to have, so keep us updated with timeline! Thanks 🙏 |
@BenBrewerBowman I'm a little confused -- right now, you can upsert sparse vectors (BM25-encoded or otherwise) and search them with all of our clients via hybrid search. Is that not what you are trying to do? The thing users cannot currently do is generate those sparse vectors via the TS client. |
The thing users cannot currently do is generate those sparse vectors via the TS client. Yes correct. Others are possible through the clients. |
Is this your first time submitting a feature request?
Describe the feature
Maybe I just cannot find it, but we need BM25 encoder support with Typescript to support Hybrid search.
Search for BM25 in issues and code base did not return anything.
Describe alternatives you've considered
No response
Who will this benefit?
No response
Are you interested in contributing this feature?
No response
Anything else?
No response
The text was updated successfully, but these errors were encountered: