claude-video-vision

jordanrendric/claude-video-vision
★ 357 stars TypeScript 🤖 AI/LLM Updated 1d ago
Give Claude the ability to watch and understand videos — Claude Code plugin with frame extraction and multimodal audio analysis
View on GitHub →

Quick Install

Copy the config for your editor. Some servers may need additional setup — check the README.

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "claude-video-vision": {
      "command": "npx",
      "args": [
        "-y",
        "jordanrendric/claude-video-vision"
      ]
    }
  }
}

README Excerpt

<p align="center"> <img src="./assets/hero.avif" alt="claude-video-vision" width="100%" /> </p> Give Claude the ability to **watch and understand videos**. A Claude Code plugin that extracts frames via ffmpeg and processes audio via multiple backends (Gemini API, local Whisper, or OpenAI API). Claude receives frames as images and audio transcription with timestamps — the plugin is a **perception layer**, not an interpretation layer.

Topics

claude-codeclaude-code-pluginffmpeggeminimcpmcp-servermultimodalvideo-analysiswhisper