Added ollama and openai Fastapi api services #482

ParisNeo · 2024-12-18T01:01:22Z

Add FastAPI Services for LightRAG Integration

Overview

Added two FastAPI services that provide REST API endpoints for utilizing LightRAG in distributed applications:

Ollama-based FastAPI service
OpenAI-based FastAPI service

These services enable easy integration of LightRAG capabilities into existing applications through HTTP endpoints.

Key Features

Common Features

Multiple search modes (naive, local, global, hybrid)
Streaming and non-streaming query responses
Document management (insert, upload, batch processing)
Health monitoring endpoints
Automatic API documentation (Swagger/ReDoc)
Configurable working directories
Asynchronous operation support

Ollama Service Specific

Configurable Ollama host
Support for various Ollama models
Default integration with mistral-nemo and bge-m3
Adjustable async operation limits
Custom embedding dimensions

OpenAI Service Specific

OpenAI API integration
Support for latest GPT models
text-embedding-3-large integration
Automatic embedding dimension detection
nest-asyncio implementation for better async handling

Configuration Options

Ollama Service

--host: Server host (default: 0.0.0.0)
--port: Server port (default: 9621)
--model: LLM model name (default: mistral-nemo:latest)
--embedding-model: Embedding model (default: bge-m3:latest)
--ollama-host: Ollama host URL
--working-dir: RAG storage location
--max-async: Maximum concurrent operations
--max-tokens: Token limit
--embedding-dim: Embedding dimensions
--max-embed-tokens: Embedding token limit

OpenAI Service

--host: Server host (default: 0.0.0.0)
--port: Server port (default: 9621)
--model: OpenAI model (default: gpt-4)
--embedding-model: Embedding model (default: text-embedding-3-large)
--working-dir: RAG storage location
--max-tokens: Token limit
--max-embed-tokens: Embedding token limit

API Endpoints

Both services provide:

/query: Document querying
/query/stream: Streaming responses
/documents/text: Text insertion
/documents/file: File upload
/documents/batch: Batch file processing
/documents/scan: Directory scanning
/health: System status

Usage Examples

Query Example

curl -X POST "http://localhost:9621/query" \
    -H "Content-Type: application/json" \
    -d '{"query": "Your question", "mode": "hybrid"}'

Document Upload

curl -X POST "http://localhost:9621/documents/file" \
    -F "file=@document.txt"

Testing

Tested with various document sizes
Verified streaming functionality
Confirmed batch processing capabilities
Validated error handling
Checked memory management

Documentation

Included detailed README for both services
Added API documentation
Provided configuration guides
Included usage examples

Future Improvements

Add support for more model providers
Implement caching mechanisms
Add authentication/authorization
Enhance error handling
Add monitoring metrics

Dependencies

FastAPI
Uvicorn
LightRAG
Pydantic
OpenAI/Ollama clients
Python 3.8+

This PR significantly enhances LightRAG's usability in distributed environments and makes it easier to integrate with existing applications.

LarFii · 2024-12-19T08:43:43Z

Thanks for your contribution! But there are some linting errors. Please make sure to run pre-commit run --all-files before submitting to ensure all linting checks pass.

ParisNeo · 2024-12-19T10:44:51Z

Hi there. Thanks alot for answering me.
I just run the linting fix.

Best regards

ParisNeo · 2024-12-19T11:07:32Z

By the way, why don't you put this directly as a github action to automatically apply linting to the project?

LarFii · 2024-12-20T02:37:36Z

Thank you again for your incredible contribution! It seems that automation can't fix all the linting errors, so some manual adjustments are still needed.

ParisNeo · 2024-12-21T01:11:38Z

You are welcome.
I even added lightrag to my Tool lollms as a service. So the user can setup a server with lightrag then he can use lollms as front end to chat with his AI using his vectorized data.

For now there is ollama and openai, But I'll add more services for other backends.

Thanks for accepting my contribution.
Best regards

ParisNeo added 16 commits December 16, 2024 01:05

Added a fastapi service

d8f4f3e

Merge branch 'HKUDS:main' into main

d6d5352

Update README.md

190694c

Update README.md

883a679

Update README.md

9a41912

Update README.md

c0f4d13

updazted

c67ec3a

Merge branch 'main' of https://github.com/ParisNeo/LightRAG

b9c75dc

updated

3d721e3

fix

ce3d7df

updated requirements

a6ae0f5

upgraded

8a85b30

upgraded

59edc32

working server

2266399

Added openai api

81ba55d

Merge branch 'HKUDS:main' into main

5df5754

ParisNeo added 2 commits December 19, 2024 11:44

Fixed linting

fe6ebfa

Merge branch 'main' of https://github.com/ParisNeo/LightRAG

bbba07e

LarFii merged commit e5dc186 into HKUDS:main Dec 20, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added ollama and openai Fastapi api services #482

Added ollama and openai Fastapi api services #482

ParisNeo commented Dec 18, 2024

LarFii commented Dec 19, 2024

ParisNeo commented Dec 19, 2024

ParisNeo commented Dec 19, 2024

LarFii commented Dec 20, 2024

ParisNeo commented Dec 21, 2024

Added ollama and openai Fastapi api services #482

Added ollama and openai Fastapi api services #482

Conversation

ParisNeo commented Dec 18, 2024

Add FastAPI Services for LightRAG Integration

Overview

Key Features

Common Features

Ollama Service Specific

OpenAI Service Specific

Configuration Options

Ollama Service

OpenAI Service

API Endpoints

Usage Examples

Query Example

Document Upload

Testing

Documentation

Future Improvements

Dependencies

LarFii commented Dec 19, 2024

ParisNeo commented Dec 19, 2024

ParisNeo commented Dec 19, 2024

LarFii commented Dec 20, 2024

ParisNeo commented Dec 21, 2024