An model context protocol (MCP) server that extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure. More about MCP.
- Removes ads, navigation, footers and other non-essential content
- Converts clean HTML into well-formatted Markdown (also uses Turndown)
- Returns article metadata (title, excerpt, byline, site name)
- Handles errors gracefully
Unlike simple fetch requests, this server:
- Extracts only relevant content using Mozilla's Readability algorithm
- Eliminates noise like ads, popups, and navigation menus
- Reduces token usage by removing unnecessary HTML/CSS
- Provides consistent Markdown formatting for better LLM processing
- Includes useful metadata about the content
To install Mozilla Readability Parser for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install server-moz-readability --client claude
npm install server-moz-readability
Fetches and transforms webpage content into clean Markdown.
Arguments:
{
"url": {
"type": "string",
"description": "The website URL to parse",
"required": true
}
}
Returns:
{
"title": "Article title",
"content": "Markdown content...",
"metadata": {
"excerpt": "Brief summary",
"byline": "Author information",
"siteName": "Source website name"
}
}
Add to your claude_desktop_config.json
:
{
"mcpServers": {
"readability": {
"command": "npx",
"args": ["-y", "server-moz-readability"]
}
}
}
- @mozilla/readability - Content extraction
- turndown - HTML to Markdown conversion
- jsdom - DOM parsing
- axios - HTTP requests
MIT