GitHub - elizaos-plugins/plugin-image: Processes and analyzes images to generate descriptions. Supports multiple providers

ImageDescriptionService

Processes and analyzes images to generate descriptions. Supports multiple providers:

Configuration:

# For OpenAI Vision
OPENAI_API_KEY=your_openai_api_key

# For Google Gemini
GOOGLE_GENERATIVE_AI_API_KEY=your_google_api_key

Provider selection:

The service automatically handles different image formats, including GIFs (first frame extraction).

Features by provider:

Local (Florence):

OpenAI Vision:

Google Gemini 1.5:

The provider can be configured through the runtime settings, allowing easy switching between providers based on your needs.

// ... existing code ...

Analyzes and generates descriptions for images.

// Example usage
const result = await runtime.executeAction("DESCRIBE_IMAGE", {
    imageUrl: "path/to/image.jpg",
});

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
dist		dist
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
.npmignore		.npmignore
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json
tsup.config.ts		tsup.config.ts