GenAI Conversational RAG Reference

Codename: "Galileo"

All classes are under active development and subject to non-backward compatible changes or removal in any future version. These are not subject to the Semantic Versioning model.

While you can still use these classes, you may need to update your source code when upgrading to a newer version of this package.

Conversational generative AI applications that provide search and summarisation against a collection of private documents (also known as "retrieval augmented generation" or RAG) contain a number of complex components. These include:

An elastic document ingestion pipeline,
A special purpose vector store for document embeddings,
A performant embeddings inference engine,
API access to an aligned large language model, and
The combined functionality exposed via a user interface that maintains session persistance and is secured with authN.

Galileo was created to provide all of these things, integrated into a reference application.

Use case

The use case for this reference application is a virtual legal research assistant, capable of answering questions against US Supreme Court decisions.

For more information, refer to the Galileo Generative AI Reference Sample documentation.

Key documentation links

Overview
- Mental Model
- How it Works
- Security considerations
Getting started
Developer Guide
How to contribute

Disclaimer: Use of Third-Party models

By using this sample, you agree that you may be deploying third-party models (“Third-Party Model”) into your specified user account. AWS does not own and does not exercise any control over these Third-Party Models. You should perform your own independent assessment, and take measures to ensure that you comply with your own specific quality control practices and standards, and the local rules, laws, regulations, licenses and terms of use that apply to you, your content, and the Third-Party Models, and any outputs from the Third-Party Models. AWS does not make any representations or warranties regarding the Third-Party Models.

Disclaimer: Use of Prompt Engineering Templates

Any prompt engineering template is provided to you as AWS Content under the AWS Customer Agreement, or the relevant written agreement between you and AWS (whichever applies). You should not use this prompt engineering template in your production accounts, or on production, or other critical data. You are responsible for testing, securing, and optimizing the prompt engineering as appropriate for production grade use based on your specific quality control practices and standards. AWS may reuse this prompt engineering template in future engagements, but we will not share your confidential data nor your intellectual property with other customers.

Security Considerations

The sample code, software libraries, command line tools, proofs of concept, templates, or other related technology (including any of the foregoing that are provided by our personnel) is provided to you as AWS Content under the AWS Customer Agreement, or the relevant written agreement between you and AWS (whichever applies). You should not use this AWS Content in your production accounts, or on production or other critical data. You are responsible for testing, securing, and optimizing the AWS Content, such as sample code, as appropriate for production grade use based on your specific quality control practices and standards. Deploying AWS Content may incur AWS charges for creating or using AWS chargeable resources, such as running Amazon EC2 instances or using Amazon S3 storage.

There are a number of security considerations that should be taken into account prior to deploying and utilising this sample. The security section outlines each of these considerations.

Other useful samples

If you looking to benchmark multiple LLMs and RAG engines in a simple way, refer to the aws-samples/aws-genai-llm-chatbot. That project focuses more on experimentation with models and vector stores, while this project focuses more on building an extendable 3-tier application.

Name		Name	Last commit message	Last commit date
Latest commit History 226 Commits
.github		.github
.husky		.husky
.projen		.projen
.vscode		.vscode
demo		demo
docs		docs
packages		packages
projenrc		projenrc
scripts		scripts
.eslintrc.json		.eslintrc.json
.gitattributes		.gitattributes
.gitignore		.gitignore
.ncurc.json		.ncurc.json
.npmrc		.npmrc
.nxignore		.nxignore
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
.projenrc.ts		.projenrc.ts
.syncpackrc.json		.syncpackrc.json
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
HEADER		HEADER
LICENSE		LICENSE
LICENSE-THIRD-PARTY		LICENSE-THIRD-PARTY
NOTICE		NOTICE
README.md		README.md
approved-licenses.yaml		approved-licenses.yaml
nx.json		nx.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
prerequisite-check.sh		prerequisite-check.sh
selected-options.json		selected-options.json
tsconfig.dev.json		tsconfig.dev.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

GenAI Conversational RAG Reference

Use case

Key documentation links

Disclaimer: Use of Third-Party models

Disclaimer: Use of Prompt Engineering Templates

Security Considerations

Other useful samples

About

Licenses found

Releases 57

Contributors 9

Languages

License

Licenses found

aws-samples/aws-genai-conversational-rag-reference

Folders and files

Latest commit

History

Repository files navigation

GenAI Conversational RAG Reference

Use case

Key documentation links

Disclaimer: Use of Third-Party models

Disclaimer: Use of Prompt Engineering Templates

Security Considerations

Other useful samples

About

Resources

License

Licenses found

Code of conduct

Security policy

Stars

Watchers

Forks

Releases 57

Contributors 9

Languages