MCP Directory

webclaw

by 0xMassi · Rust · ★ 865

Fast, local-first web content extraction for LLMs. Scrape, crawl, extract structured data — all from Rust. CLI, REST API, and MCP server.

#ai#ai-agents#ai-scraping#cli#crawler#data-extraction#firecrawl-alternative#html-to-markdown#llm#markdown#mcp#mcp-server#rag#rust#self-hosted#tls-fingerprinting#web-crawler#web-extraction#web-scraper#web-scraping

Install

cargo install --git https://github.com/0xMassi/webclaw.git

Claude Desktop config

Add this to your claude_desktop_config.json:

{
  "mcpServers": {
    "webclaw": {
      "command": "npx",
      "args": [
        "-y",
        "github:0xMassi/webclaw"
      ]
    }
  }
}

From the README

webclaw The fastest web scraper for AI agents. 67% fewer tokens. Sub-millisecond extraction. Zero browser overhead. Your AI agent calls and gets a 403. Or 142KB of raw HTML that burns through your token budget. **webclaw fixes both.** It extracts clean, structured content from any URL using Chrome-level TLS fingerprinting — no headless browser, no Selenium, no Puppeteer. Output is optimized for LLMs: **67% fewer tokens** than raw HTML, with metadata, links, and images preserved. --- **Self-host.** Free, AGPL-3.0, runs locally. Get the CLI, MCP server, or RES…
Read full README on GitHub →

💡 Need a managed MCP host?

Try Claude Pro for the smoothest MCP experience, or browse our cloud-hosted servers.

Related developer tools servers