Skip to content

Commit

Permalink
updated diagram
Browse files Browse the repository at this point in the history
Signed-off-by: Francisco Javier Arceo <[email protected]>
  • Loading branch information
franciscojavierarceo committed May 11, 2024
1 parent c19e1b7 commit 7422a9c
Showing 1 changed file with 0 additions and 28 deletions.
28 changes: 0 additions & 28 deletions module_4_rag/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -50,33 +50,5 @@ flowchart TD;
A[Pull Data] --> B[Batch Score Embeddings];
B[Batch Score Embeddings] --> C[Materialize Online];
C[Materialize Online] --> D[Retrieval Augmented Generation];
D[Retrieval Augmented Generation] --> E[Store User Interaction];
E[Store User Interaction] --> F[Update Training Labels];
F[Update Training Labels] --> H[Fine Tuning];
H[Fine Tuning] -. Backpropagate .-> B[Batch Score Embeddings];
```


A simple example of the user experience:

```
Q: Can you tell me about Chicago?
A: Here's some wikipedia facts about Chicago...
```

# Limitations
A common issue with RAG and LLMs is hallucination. There are two common
approaches:

1. Prompt engineering
- This approach is the most obvious but is susceptible to prompt injection

2. Build a Classifier to return the "I don't know" response
- This approach is less obvious, requires another model, more training data,
and fine tuning

We can, in fact, use both approaches to further attempt to minimize the
likelihood of prompt injection.

This demo will display both.

0 comments on commit 7422a9c

Please sign in to comment.