ollama-handoff

Michael-WhiteCapData/ollama-handoff
★ 1 stars Python AI/LLM Updated today
MCP server that offloads cheap work from your cloud LLM agent to a local Ollama model — summaries, drafts, extractions, first-pass reviews — at zero cloud cost.
View on GitHub → Try with Claude — $10 free →

Quick Install

Copy the config for your editor. Some servers may need additional setup — check the README.

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "ollama-handoff": {
      "command": "uvx",
      "args": [
        "ollama-handoff"
      ]
    }
  }
}

Or install with pip: pip install ollama-handoff

README Excerpt

<!-- mcp-name: io.github.Michael-WhiteCapData/ollama-handoff --> **An MCP server that offloads cheap work from your cloud LLM agent to a local Ollama model.** Your frontier model (Claude, GPT, etc.) is brilliant and metered. A lot of the work it gets handed — summarizing a log, drafting a commit message, pulling every URL out of a file, a quick first-pass code review — **doesn't need frontier reasoning at all.** `ollama-handoff` exposes your local [Ollama](https://ollama.com/) instance as a hand

Tools (13)

OLLAMA_DEFAULT_MODELOLLAMA_KEEP_ALIVEOLLAMA_NUM_CTXOLLAMA_TIMEOUT_SOLLAMA_URLask_localchat_localcode_review_localdraft_commit_message_localextract_locallist_modelsserver_infosummarize_local

Topics

ai-agentsclaudellmlocal-llmmcpmodel-context-protocolollamapython