Spring AI Demo

Updated for YOW! Australia, December 2024.

Kotlin demo project with a simple HTMX UI, demonstrating Spring AI with Ollama, Open AI and Neo4j. Shows:

RAG using Spring AI's out of the box QuestionAnswerAdvisor
Mixing LLMs in a single application. Use the right LLM for each requirement.
The power of Spring AI advisors to instrument chats in a reusable way
The power of integrating LLM calls within a Spring application by exposing Spring beans as LLM-accessible functions.

This project features the following custom advisors:

TopicGuardAdvisor: Classifies user input and short-circuits to give a canned response if it's about a banned topic
CountMentionsAdvisor: Detects when a topic is mentioned in a chat and increments an entity counter and raises an application event
SavePerformanceAdvisor: Remembers mentions of upcoming performances and saves them to the database. Extraction runs asynchronously, so it doesn't slow responding to the user.

This project illustrates the following best practices:

Externalize your prompts. Prompts should not be in Kotlin, Java, Python/whatever programming language. They should be externalized so they can be edited easily and potentially shared.
Mix multiple models, using the best (or cheapest) LLM for each task.
Enable reuse via advisors, analogous to aspects in AOP
Return entities from the LLM
Use structured persistent data as well as vector search
Write in Kotlin!

Setup

This is a standard Spring Boot project, built with Maven and written in Kotlin.

Set the OPEN_AI_API_KEY environment variable to your Open AI token, or edit ChatConfiguration.kt to switch to a different premium chat model.

Use the Docker Compose file in this project to run Neo, or otherwise change the Neo credentials in application.properties to use your own database.

Run Ollama on your machine. Make sure you've pulled the gemma2:2b model as follows:

docker pull ollama/gemma2:2b

Edit ChatConfiguration.kt to use a different Ollama model if you prefer.

Running

Start the server, either in your IDE or with mvn spring-boot:run
Go to http://localhost:8080 to see the simple chat interface

Limitations

This is a demo to illustrate the power of Spring AI advisors, so it's simplistic.

In particular:

The SavePerformanceAdvisor works off the latest user message only (although this is extracted into a strategy function)
The CountMentionsAdvisor looks for a literal string. This could easily be improved to work with a local model and exhibit deeper understanding (e.g. "the user is talking about auto service"). It's also inefficient as it scans all Mention entities on every request.
The UI is very basic

Contributions welcome.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.idea		.idea
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spring AI Demo

Setup

Running

Limitations

About

Releases

Packages

Languages

License

johnsonr/springai-demo

Folders and files

Latest commit

History

Repository files navigation

Spring AI Demo

Setup

Running

Limitations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages