Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RAG (read PDFs and other documents) #9

Open
thisandthat1 opened this issue Jan 3, 2024 · 2 comments
Open

RAG (read PDFs and other documents) #9

thisandthat1 opened this issue Jan 3, 2024 · 2 comments

Comments

@thisandthat1
Copy link

thisandthat1 commented Jan 3, 2024

To enable more usability, can we add a feature to upload the docs and let the model understand the same, so that we can chat about the document.

Document formats like .txt, .docx, .csv, .pdf

@cztomsik
Copy link
Owner

cztomsik commented Jan 4, 2024

this is planned, .txt (and images) will happen first

@cztomsik cztomsik changed the title Option to read PDFs and other documents RAG (read PDFs and other documents) Apr 13, 2024
@cztomsik
Copy link
Owner

This will first happen with BM25 (classic fulltext) and then also with embeddings.

There will be 2 versions of this

  1. per-session RAG with uploaded PDF (scope of the original ticket)

    • either from the last message or from all uploaded files in the conversation
  2. global section, called datasets/datasources, where you upload files (or select folders) and it will be available for chatting (and namespaced, so you can also select which source(s) to use)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants