This project allows you to upload a PDF document and ask questions about its content. It uses langchain, openapi ai model and Facebook Ai Similarity Search(FAISS) library to process the text in the PDF and provide answers to questions pertaining the document.
-
Clone the repository:
git clone https://github.com/john-thuo1/chatWithPDF cd into your directory/ open with vscode
-
Create a Virtual Environment:
python -m venv env
-
Install the required dependencies:
pip install -r requirements.txt
-
Create OpenAI API Key and add it to your .env file: openai
Specify the variable as follows : OPENAI_API_KEY = "Secret Key"
-
Run the application:
streamlit run App.py
- Add support for multiple file formats
- Implement Document Indexing techniques by use of libraries such as Elasticsearch or Apache Solr
- Enhance question answering capabilities: Explore advanced question answering techniques, such as using transformer models like BERT or GPT, to improve the accuracy and comprehension of the system.
- Use a model that supports multiple languages, most notably some BERT models do support this.