Releases: MLidstrom/DNFileRAG
Releases · MLidstrom/DNFileRAG
v1.2.0 - Vision/Image Support
What's New
Vision/Image Support
DNFileRAG now supports PNG, JPG, JPEG, and WEBP image files. When enabled, images are processed using Ollama's llava vision model to:
- Extract visible text (OCR-like functionality)
- Generate image descriptions
- Index the content for RAG queries
New Features
IVisionTextExtractorinterface for image text extractionOllamaVisionTextExtractorimplementation using Ollama llava modelImageParserfor processing image files during ingestionVisionOptionsconfiguration section:Vision:Enabled- Enable/disable vision processingVision:Provider- Vision provider (currently: Ollama)Vision:Ollama:Model- Vision model (default: llava)
Configuration
Enable in appsettings.json:
{
"Vision": {
"Enabled": true,
"Provider": "Ollama",
"Ollama": {
"BaseUrl": "http://localhost:11434",
"Model": "llava"
}
}
}Getting Started
# Pull the llava model
ollama pull llava
# Add images to your documents folder
cp image.png src/DNFileRAG/data/documents/Documentation
- Created
CHANGELOG.mdwith version history - Updated README with image format support
- Updated architecture documentation to v1.5
Full Changelog
v1.1.0
What's New in v1.1.0
Features
- API Key Authentication - Secure your API with configurable API keys via
X-API-Keyheader - Dev Mode Bypass - Set
RequireApiKey: falsefor easy local development - Role-Based Access Control -
readerandadminroles for endpoint authorization
Improvements
- Enhanced HelpChat example with new logo and assets
- Updated documentation with correct API endpoints
- Added sample Northwind documents for testing
Bug Fixes
- Fixed API endpoint documentation inconsistencies