Uploading openparsed docs to Vector DB #25
Unanswered
GedeonIsezerano
asked this question in
Q&A
Replies: 1 comment
-
I'm not familiar with the details of pinecone, but each node has a import openparse
basic_doc_path = "./sample-docs/mobile-home-manual.pdf"
parser = openparse.DocumentParser()
parsed_basic_doc = parser.parse(basic_doc_path)
content_dict = parsed_basic_doc.dict()
for node in content_dict["nodes"]:
node["vector"] = your_function_to_embed_text(node["text"]) We will likely add the vector attribute directly to the node but currently it get's thrown out after the pipeline is finished running. I will update this answer when that gets implemented. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
What would be the best way to upload the parsed data to a vector db such as pinecone? The output from Documentparser provides nodes but not embedding values.
Here is my code
Beta Was this translation helpful? Give feedback.
All reactions