Claude Code skill for importing web content, YouTube videos, and PDFs into local RAG system.
๐ฎ๐น Italiano | ๐จ๐ณ ไธญๆ
Brainery enables Claude Code to import and query web content using AnythingLLM as a local RAG (Retrieval-Augmented Generation) database. Import articles, YouTube transcripts, and PDFs, then query them using natural language.
Key Features:
- ๐ Web scraping with clean markdown extraction (Crawl4AI)
- ๐บ YouTube transcripts with automatic fallback to audio transcription (yt-dlp + Whisper)
- ๐ PDF import with text extraction
- ๐ Local document parsing (PDF, DOCX, TXT, logs) via unstructured-mcp-server
- ๐ง Local RAG database for private, offline content querying (AnythingLLM)
- ๐ฐ Free LLM provider compatible with AnythingLLM (iFlow Platform - 200K context tokens free tier)
- ๐ณ Docker-based infrastructure with pre-built images
- ๐ Multilingual documentation (EN/IT/ZH)
git clone https://github.com/Tapiocapioca/brainery-containers.git
cd brainery-containers
docker-compose up -dContainer Stack:
- crawl4ai (port 9100) - Web scraping
- yt-dlp-server (port 9101) - YouTube transcripts
- whisper-server (port 9102) - Audio transcription
- anythingllm (port 9103) - RAG database
Open http://localhost:9103 and:
- Create admin account
- Configure LLM provider (recommended: iFlow Platform - free tier with 200K context tokens)
- Provider:
OpenAI Compatible - Base URL:
https://vibex.iflow.cn/v1 - Model:
glm-4.6 - Get API key from: https://iflow.cn
- Provider:
- Create workspace (e.g., "brainery")
- Generate API key in Settings โ API Keys
๐ก Why iFlow? Free tier with excellent multilingual support (EN/IT/ZH), 200K context window, and fast response times. See Installation Guide for detailed setup.
Install required MCP servers for Claude Code integration:
# Install AnythingLLM MCP server (use our fork)
npx -y @smithery/cli install @tapiocapioca/anythingllm-mcp-server --client claude
# Install other MCP servers
npx -y @smithery/cli install crawl4ai --client claude
npx -y @smithery/cli install yt-dlp --client claude
\n# Install local document parser
pip install unstructured-mcp-server๐ Note: We use a custom fork of AnythingLLM MCP server: https://github.com/Tapiocapioca/anythingllm-mcp-server
cd ~/.claude/skills
git clone https://github.com/Tapiocapioca/brainery.gitRestart Claude Code to load the skill.
In Claude Code:
Import this article into Brainery: https://example.com/article
Then query:
What are the main points in the article I just imported?
Import this technical guide: https://example.com/docker-guide
Import the transcript from: https://www.youtube.com/watch?v=VIDEO_ID
Import these articles:
1. https://example.com/article1
2. https://example.com/article2
3. https://example.com/article3
Then tell me what the common themes are.
What are the key concepts discussed in all the articles I imported today?
- Installation Guide - Complete setup instructions
- Usage Examples - Practical workflows and common scenarios
- BRAINERY_CONTEXT.md - Technical implementation details
- Italiano: Installazione | Esempi
- ไธญๆ: ๅฎ่ฃ | ไฝฟ็จ็คบไพ
Brainery uses a modular architecture with 5 Docker containers:
โโโโโโโโโโโโโโโ
โ Claude Code โ
โโโโโโโโฌโโโโโโโ
โ MCP Tools
โ
โโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโ
โผ โผ โผ โผ
โโโโโโโโโโโ โโโโโโโโโโโโ โโโโโโโโโโโ โโโโโโโโโโโโโโโโ
โCrawl4AI โ โ yt-dlp โ โ Whisper โ โ AnythingLLM โ
โ :9100 โ โ :9101 โ โ :9102 โ โ :9103 โ
โโโโโโโโโโโ โโโโโโโโโโโโ โโโโโโโโโโโ โโโโโโโโโโโโโโโโ
Web scraping YouTube Audio RAG Database
transcripts transcription
- Docker Desktop 20.10+
- Docker Compose 2.0+
- 12GB RAM minimum (16GB recommended)
- ~20GB disk space for containers and models
Default ports (9100-9104) work out-of-box. To customize, create .env file:
cd brainery-containers
cp .env.example .env
# Edit ports in .env
docker-compose up -ddocker ps --filter "name=brainery-"
docker-compose restart <service-name>Verify API key in .env file and reinitialize MCP client.
- Check container health:
curl http://localhost:9100/health - Verify network connectivity
- Check logs:
docker-compose logs <service-name>
See Installation Guide for detailed troubleshooting.
- brainery (this repo) - Claude Code skill with multilingual docs
- brainery-containers - Docker infrastructure with pre-built images on Docker Hub
Contributions welcome! Please:
- Fork the repository
- Create feature branch
- Test thoroughly
- Submit pull request
MIT License - see LICENSE file.
- Issues: GitHub Issues
- Containers: brainery-containers issues
- Documentation: See docs/ for detailed guides
Current version: 1.0.0
See CHANGELOG.md for version history.