MCP server that reduces LLM token usage by 80%+ through intelligent file caching, semantic diffs, and content-defined chunking.
Quick Install
Copy the config for your editor. Some servers may need additional setup — check the README.
Add to claude_desktop_config.json:
{
"mcpServers": {
"semantic-cache-mcp": {
"command": "uvx",
"args": [
"semantic-cache-mcp"
]
}
}
}
Run in terminal:
claude mcp add semantic-cache-mcp uvx semantic-cache-mcp
Add to .cursor/mcp.json:
{
"mcpServers": {
"semantic-cache-mcp": {
"command": "uvx",
"args": [
"semantic-cache-mcp"
]
}
}
}
Or install with pip: pip install semantic-cache-mcp
README Excerpt
<p align="center"> <img src="https://cdn.jsdelivr.net/gh/CoderDayton/semantic-cache-mcp@f8af5804ddc7c3fed62d6901c0c7df098a76164e/assets/logo.svg" width="128" height="128" alt="Semantic Cache MCP Logo" /> </p> <h1 align="center">Semantic Cache MCP</h1> <p align="center"> <a href="https://ko-fi.com/U7U01WTJF9">
Tools (20)
LOG_LEVELMAX_CACHE_ENTRIESMAX_CONTENT_SIZEMAX_EDIT_SIZEMAX_MATCHESMAX_WRITE_SIZESEMANTIC_CACHE_DIRTOOL_MAX_RESPONSE_TOKENSTOOL_OUTPUT_MODETOOL_TIMEOUTbatch_editbatch_readcleardeleteeditedit_previewglobgrepreadread_image
Topics
cachingclaudeembeddingsllmmcppythonsemantic-searchtoken-optimization