🧠 Simple RAG Pipeline using Gemini API

This is a simple template of RAG pipeline to add context in Gemini prompts using a Chroma database and Gemini Embeddings. 🌟

🚀 Setup

Clone the Repository

  git clone https://github.com/Rohitmantha/RAG-Pipeline-using-Gemini-API.git

Go to the project directory

  cd RAG-Pipeline-using-Gemini-API

Install dependencies

Make sure you have a virtual environment set up. Install the required packages using pip:

  pip install -r requirements.txt

Configure Environment Variables

Create a .env file in the root directory of the project and add your Google API key

  GOOGLE_API_KEY=your_google_api_key_here

📄 Populate the Database

To populate the Chroma database with your documents:

Place your PDF documents in the ./resources/ directory.
Run the populate_db.py script:

  python populate_db.py

This script will load PDF documents, split them into manageable chunks, and save them to the Chroma vector database. 📚

🔍 Query the Database

To query the database and generate answers based on the context: Use the gen_context.py script. Pass your query as an argument:

python gen_context.py "Your query here"

This script will: Retrieve relevant chunks from the Chroma database. Generate a prompt based on the retrieved context and passes that to a LLM. 🤖

📝 How It Works

populate_db.py reads and processes PDF documents, splits them into chunks, and saves them in the Chroma database.

You can tweak the parameters as you wish and get an optimal chunk size,chunk overlap and also to read from some other file type change the *.pdf in the load_documenst() function in populate_db to any other format intended.

gen_context.py retrieves relevant chunks for a given query and generates a detailed answer using the Google Generative AI model

🎉 Contributions

Feel free to contribute to this project! Whether you have suggestions, improvements, or fixes, your input is welcome. Just fork the repo and create a pull request. 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
gen_context.py		gen_context.py
populate_db.py		populate_db.py
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Simple RAG Pipeline using Gemini API

🚀 Setup

📄 Populate the Database

🔍 Query the Database

📝 How It Works

🎉 Contributions

Contributors

About

Releases

Packages

Languages

Byte-Crfat-AI/Chem_LLM

Folders and files

Latest commit

History

Repository files navigation

🧠 Simple RAG Pipeline using Gemini API

🚀 Setup

📄 Populate the Database

🔍 Query the Database

📝 How It Works

🎉 Contributions

Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages