Spring AI Powered Local CLI Chat Bot with RAG

This project is a fully local AI chatbot built with Spring AI and RAG (Retrieval-Augmented Generation). It runs in a CLI (Command Line Interface) environment, ensuring privacy and offline functionality without external dependencies. By default, it uses Ollama for both LLM and embedding models, but it also supports other Spring AI-compatible providers. The chatbot processes various document types, including PDF, DOCX, TXT, and HTML, extracting and vectorizing text for domain-specific Q&A.

Demo

spring-ai-local-cli-chatbot

USER: Tell me about Hurricane Milton.

[ Search Results ]
===============================================
▶ 1 Document, Score: 0.74
-----------------------------------------------
Tropical Depression Fourteen Discussion Number 1 (https://
www.nhc.noaa.gov/archive/2024/al14/al142024.discus.001.shtml?) (Report). Miami, Florida:
National Hurricane Center. Archived (https://web.archive.org/web/20241005151826/https://ww
w.nhc.noaa.gov/archive/2024/al14/al142024.discus.001.shtml) from the original on October 5,
2024. Retrieved October 5, 2024.

===============================================
▶ 2 Document, Score: 0.74
-----------------------------------------------
Brown, Daniel; Blake, Eric (October 7, 2024). Hurricane Milton Update Statement (https://www.nhc.noaa.gov
...
===============================================

ASSISTANT: Hurricane Milton was a powerful Category 5 storm that formed in the Atlantic in 2023. It caused significant damage along the Gulf Coast, particularly in Florida, where it made landfall near Tampa. The storm surge and high winds led to widespread flooding and power outages. Emergency services were overwhelmed, and recovery efforts took several weeks.

Quick Start

Build and run the app:

./mvnw clean install
./mvnw spring-boot:run

Auto-configurations

Configuration Options

The following options have been added to application.properties:

# Sets the name of the Spring Boot application.
spring.application.name=spring-ai-local-cli-chatbot
# Specifies the location pattern for RAG (Retrieval-Augmented Generation) documents to be embedded.
spring.application.cli-chatbot.documents-location-pattern=classpath:wikipedia-hurricane-milton-page.pdf

RAG Document Path Pattern

The project uses PathMatchingResourcePatternResolver to generate the path pattern for RAG documents. This Spring utility allows flexible specification of document locations, enabling the application to dynamically load and embed documents for RAG-based conversations. The patterns can target resources in the classpath or the file system.

Pattern Examples

Classpath Patterns: classpath:wikipedia-hurricane-milton-page.pdf: Loads a specific PDF file from the classpath. classpath*:*.pdf: Loads all PDF files from the classpath.
File System Patterns: file:/path/to/docs/*.pdf: Loads all PDF files from the specified directory on the file system.

These patterns provide flexibility in defining which documents are used for embedding and retrieval, making the chatbot adaptable to various document sources.

AI Model

Spring AI works with Ollama for local LLM and embedding models. No API keys are required. To enable this, simply add the following dependency:

<dependency>
  <groupId>org.springframework.ai</groupId>
  <artifactId>spring-ai-starter-model-ollama</artifactId>
</dependency>

Before using Ollama with Spring AI, ensure that Ollama is installed and running on your system, refer to the Spring AI Ollama Chat Prerequisites.

To use different models for local LLM and embedding in your Spring Boot application with Ollama, you can specify them using command-line arguments. Here’s how to do it:

./mvnw spring-boot:run -Dspring-boot.run.arguments="--spring.ai.ollama.chat.options.model=llama3.2:1b --spring.ai.ollama.embedding.options.model=bge-m3"

Tips

For additional configuration options in Spring AI with Ollama:
- LLM Settings: LLM Properties Documentation
- Embedding Settings: Embedding Properties Documentation
For a list of models supported by Ollama:
- LLM Models
- Embedding Models

Implementation Details

Default System Prompt

A default system prompt can be configured via application.properties to define the assistant’s behavior and expertise. In this case, the assistant is specialized in hurricanes:

spring.application.cli-chatbot.system-prompt=You are useful assistant, expert in hurricanes.

Vector Store

The project uses Spring AI’s SimpleVectorStore, eliminating the need for external vector databases. This store is automatically configured unless a custom VectorStore bean is provided.

@Bean
@ConditionalOnMissingBean(VectorStore.class)
public VectorStore vectorStore(EmbeddingModel embeddingModel) {
  return SimpleVectorStore.builder(embeddingModel).build();
}

Document Readers

Documents are read using Apache Tika-based TikaDocumentReader, which supports multiple file formats. The documents are loaded from the locations specified in your application.properties:

spring.application.cli-chatbot.documents-location-pattern=classpath:wikipedia-hurricane-milton-page.pdf

@Bean
@ConditionalOnMissingBean(DocumentReader.class)
public DocumentReader[] documentReaders(
        @Value("${spring.application.cli-chatbot.documents-location-pattern}") String documentsLocationPattern) throws IOException {
  return Arrays.stream(new PathMatchingResourcePatternResolver().getResources(documentsLocationPattern))
          .map(TikaDocumentReader::new).toArray(DocumentReader[]::new);
}

Text Splitter

Text data is split into token-based chunks using TokenTextSplitter, which is useful for chunking documents before embedding.

@Bean
@ConditionalOnMissingBean(TextSplitter.class)
public TextSplitter textSplitter() {
  return new TokenTextSplitter();
}

Retrieval-Augmented Generation (RAG)

The RetrievalAugmentationAdvisor uses a VectorStoreDocumentRetriever to retrieve similar documents based on vector similarity. A similarity threshold of 0.50 is used to filter relevant context.

@Bean
@ConditionalOnMissingBean(RetrievalAugmentationAdvisor.class)
public RetrievalAugmentationAdvisor retrievalAugmentationAdvisor(VectorStore vectorStore) {
  return RetrievalAugmentationAdvisor.builder()
          .documentRetriever(VectorStoreDocumentRetriever.builder()
                  .similarityThreshold(0.50)
                  .vectorStore(vectorStore)
                  .build()
          ).build();
}

Chat Loop

// Start the chat loop
System.out.println("\n" + applicationName);
try (Scanner scanner = new Scanner(System.in)) {
    while (true) {
        System.out.print("\nUSER: ");
        String userMessage = scanner.nextLine();
        chatClient.prompt(userMessage).advisors(AssistantPrefixStreamAdvisor)
            .stream().content().toStream().forEach(System.out::print);
        System.out.println();
    } 
}

Creates an interactive loop for RAG chatbot interaction, streaming the assistant’s response in real-time.

Key Features

Fully Local Execution: All components run on your local machine, ensuring privacy and offline capability.
Ollama Integration: Uses Ollama for both LLM and embedding models, offering high-performance local AI processing.
Spring AI's SimpleVectorStore: Implements a lightweight vector store for efficient information retrieval without external dependencies.
Interactive Console Interface: Provides a streamlined console-based interface for real-time chatbot interaction.
Dynamic RAG Document Loading: Uses PathMatchingResourcePatternResolver for flexible document retrieval.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.mvn/wrapper		.mvn/wrapper
src/main		src/main
.gitignore		.gitignore
README.md		README.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Spring AI Powered Local CLI Chat Bot with RAG

Demo

Quick Start

Auto-configurations

Configuration Options

RAG Document Path Pattern

Pattern Examples

AI Model

Tips

Implementation Details

Default System Prompt

Vector Store

Document Readers

Text Splitter

Retrieval-Augmented Generation (RAG)

Chat Loop

Key Features

About

Uh oh!

Releases

Packages

Uh oh!

Languages

JM-Lab/spring-ai-local-cli-chatbot

Folders and files

Latest commit

History

Repository files navigation

Spring AI Powered Local CLI Chat Bot with RAG

Demo

Quick Start

Auto-configurations

Configuration Options

RAG Document Path Pattern

Pattern Examples

AI Model

Tips

Implementation Details

Default System Prompt

Vector Store

Document Readers

Text Splitter

Retrieval-Augmented Generation (RAG)

Chat Loop

Key Features

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages