MCP Directory
Back

claude-video-vision

by jordanrendric · TypeScript · ★ 407

Give Claude the ability to watch and understand videos — Claude Code plugin with frame extraction and multimodal audio analysis

#claude-code#claude-code-plugin#ffmpeg#gemini#mcp#mcp-server#multimodal#video-analysis#whisper

Install

npx -y github:jordanrendric/claude-video-vision

Claude Desktop config

Add this to your claude_desktop_config.json:

{
  "mcpServers": {
    "claude-video-vision": {
      "command": "npx",
      "args": [
        "-y",
        "github:jordanrendric/claude-video-vision"
      ]
    }
  }
}

From the README

Give Claude the ability to **watch and understand videos**. A Claude Code plugin that extracts frames via ffmpeg and processes audio via multiple backends (Gemini API, local Whisper, or OpenAI API). Claude receives frames as images and audio transcription with timestamps — the plugin is a **perception layer**, not an interpretation layer. - **Multimodal perception** — Claude sees video frames directly and reads audio transcriptions with timestamps - **Flexible backends** — Choose between cloud APIs or fully local processing - **Adaptive extraction** — Claude adjusts fps, time range, and reso…
Read full README on GitHub →

💡 Need a managed MCP host?

Try Claude Pro for the smoothest MCP experience, or browse our cloud-hosted servers.

Related ai & ml servers