llmtrim

fkiene/llmtrim

★ 27 stars Rust AI/LLM Updated 1mo ago ✗ Risk

Local proxy that compresses your LLM API requests so you pay less, with no change to the answers. Trims wasted tokens from prompts, history, tool output, and code before they're sent: -31% input / -74% output, measured live. Any provider, no extra model calls. Also an MCP server and embeddable library (Rust, Python, Ruby, Kotlin, Swift).

View on GitHub → 🔍 Audit Wallet Slippage →

Quick Install

Copy the config for your editor. Some servers may need additional setup — check the README.

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "llmtrim": {
      "command": "cargo",
      "args": [
        "run",
        "--",
        "llmtrim"
      ]
    }
  }
}

Topics

agentic-codingaianthropicclaude-codecost-reductiondeveloper-toolsllmllmopsmcpmitm-proxyopenaipromptprompt-compressionprompt-engineeringproxy

llmtrim

Quick Install

Topics

Related AI/LLM Servers