LLM Tutorials and Applications

Overview

The goal of this repository is to provide a collection of architectures and techniques for leveraging Large Language Models (LLMs). These sample applications serve as starting points for building advanced LLM products. The material collected here is sourced from public resources, including various repositories and courses.

As the technology evolves, some packages and tools may become outdated. However, efforts will be made to update this repository with new tools and technologies whenever possible. At the very least, these notebooks demonstrate the product-level possibilities available for leveraging LLMs.

Case Studies:

In the following notebooks, different databases and practical use cases are presented. The purpose is to expand your imagination towards the broad applications of LLMs in different domains. These sample scenarios include:

Technical support as a chatbot technician
Entity extraction from customer travel queries
Customer support
Healthcare consultation
Patient inquiry analysis
Patient symptom categorization
Search and summary of clinical trials
Search and retrieve information from the Novartis 2023 annual report
Information retrieval from the Amazon Product Catalog

Fundamentals

The Fundamentals section covers the essential concepts and techniques for working with Large Language Models (LLMs). This section provides a solid foundation for understanding and utilizing LLMs, including topics such as chatbots, conversational agents, chains, prompt templates, and output parsers. By exploring these basics, you will gain the necessary skills to build simple to advanced applications that leverage the power of LLMs for various conversational tasks and QA systems.

This section covers general topics related to QA, chatbots, chains, and other conversational topics with LLMs, ranging from simple to advanced.

Introduction and Basics of Chatbots and Conversational Agents
- A direct API call from OpenAI to perform chat completions
- Chat completions with LangChain APIs
- Using prompt templates to create custom prompts
- Using output parsers to extract key information from completions
- Use Cases:
  - Technical support as a chatbot technician
  - Entity extraction from customer travel queries
Using Memory to Have an Interactive Conversational Chatbot
- Complete memory recording
- Window-based memory
- Summary-based memory
- Use Cases:
  - Technical support
  - Customer support
  - Healthcare consultation
Using LangChain to Generate LLM Chains
- Introduction to chains
- LLMChain and Language Execution Chain (LEC)
- Intermediate outputs in chains
- Use Cases:
  - Patient inquiry analysis
  - Patient symptom categorization
Coming Soon:
- Using few-shot chains to instruct LLMs
- Numerical evaluation metrics: BLEU, ROUGE, METEOR, CER

Retrieval-Augmented Generation (RAG)

Retrieval-Augmented Generation (RAG) is a powerful technique that combines the strengths of information retrieval and natural language generation. By integrating retrieval mechanisms with LLMs, RAG can access and utilize vast amounts of external knowledge, enabling the generation of more accurate, contextually relevant, and informative responses. This approach is particularly useful for tasks requiring up-to-date information, detailed explanations, or specific data points, significantly enhancing the capabilities and performance of LLM-based applications.

Question and Answering (QA) Over Documents Using RAG
- A simple in-memory RAG application using LangChain based on a CSV file
- Use Case:
  - Search and summary of clinical trials
RAG Application Using Persistent and Cloud-Based Vector Stores
- Using a public embedding model from Hugging Face
- Applying chunking strategy for analyzing large documents
- Using a persistent vector store from Chroma DB
- A cloud-based vector store from Pinecone
- Case Study:
  - Search and retrieve information from the Novartis 2023 annual report
Evaluation of Retrieval-Augmented Generation Using QAGenerator
- Using QAGenerator to create ground truth samples for QA applications
- Evaluating QA-based RAG using QAGeneratorChain and QAEvalChain
- Case Study:
  - Information retrieval from the Amazon Product Catalog
Corrective RAG, Self RAG
- The sample application of Corrective RAG and Self RAG can be found in this Agentic LLM Demo Application
- Equiped with query re-writing, Retrival grader, and Hallucination guardrail.
Coming Soon:
- Ranking retrieved documents using similarity scores
- Using UMAP to visualize retrieved document similarity
- Window-based, hierarchical retrieval, and MAP-RERANK to expand input length of RAG pipelines
- Graph RAG

Agents and Tools

Large Language Model (LLM) agents are crucial for creating dynamic, interactive, and context-aware applications. These agents can handle complex tasks by understanding and generating human-like text, making them invaluable in automating processes, providing intelligent responses, and integrating with various tools and services to perform specific actions. By leveraging LLM agents, developers can build robust applications that offer enhanced user experiences and operational efficiencies.

Agent-Based Application of Large Language Models
- An LLM agent that calls a single Python interpreter tool
- A ReAct-based LLM agent that uses two built-in tools, a calculator and a Wikipedia search
- User-defined agent and its tools
- Based on Langchain v0.1: Create a separate venv
LLM Agents for Clinical Trial Management
- Demo videos on YouTube.
- An agentic LLM application that automates the evaluation of patient for clinical trials.
- Patient database, Trial database, Participatient policies.
- Corrective RAG, Self RAG
- Tool calling, Langgraph, Hallucination grader.
- Based on Langchain v0.2: Create a separate venv

Installation

First, clone the repository:

git clone https://github.com/bab-git/llm-tutorials.git

cd llm-tutorials

Then install the dependencies: Here, we use the provided requirements.txt file. But for more complex applications use poetry to install the dependencies.

pip install -r requirements.txt

Note: The notebooks presented in this repository use the package versions as stated in the requirements.txt file. However, some Langchain packages might become deprecated over time, and you may need to update them. To ensure you have the latest versions, you may also need to run:

pip install -U langchain langchain-community langgraph <or other packages>

In case of updating package, you may also need to update parts of the codes in the notebooks based on the latest package versions.

Requirements:
Make sure you have the following dependencies installed:

Python 3.8 or higher
pip

Secrets

You need to set a few API keys as environment variables to use Openai, Pinecone, Hugging Face, and Tavily via API calls.

For the above services you also need to create the relevant accounts on their websites.

Using LangSmith (and adding its env variables) is optional, but it is highly recommended for training and development purposes.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
images		images
notebooks		notebooks
source_data		source_data
src		src
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Tutorials and Applications

Overview

Table of Contents

Case Studies:

Fundamentals

Retrieval-Augmented Generation (RAG)

Agents and Tools

Installation

Secrets

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

bab-git/llm-tutorials

Folders and files

Latest commit

History

Repository files navigation

LLM Tutorials and Applications

Overview

Table of Contents

Case Studies:

Fundamentals

Retrieval-Augmented Generation (RAG)

Agents and Tools

Installation

Secrets

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages