llmtrim

fkiene/llmtrim
★ 27 stars Rust AI/LLM Updated today
Local proxy that compresses your LLM API requests so you pay less, with no change to the answers. Trims wasted tokens from prompts, history, tool output, and code before they're sent: -31% input / -74% output, measured live. Any provider, no extra model calls. Also an MCP server and embeddable library (Rust, Python, Ruby, Kotlin, Swift).
View on GitHub → Try with Claude — $10 free →

Quick Install

Copy the config for your editor. Some servers may need additional setup — check the README.

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "llmtrim": {
      "command": "cargo",
      "args": [
        "run",
        "--",
        "llmtrim"
      ]
    }
  }
}

Topics

agentic-codingaianthropicclaude-codecost-reductiondeveloper-toolsllmllmopsmcpmitm-proxyopenaipromptprompt-compressionprompt-engineeringproxy