Compress every MCP tool response before it hits your agent.
One MCP endpoint. Every tool response is ranked and trimmed before it reaches your agent. 100+ MCP tools for Claude Code, Cursor, Codex, and Gemini CLI. How we measure →
Compatible with
How a response gets compressed.
Same input, same output, every run. Four steps, no model in the loop. The output is a re-ranking of your own sentences — every token in the compressed response appears in the original.
Why the output is auditable.
Not a summary. A re-ranking of your own sentences. Documents are chunked, embedded, and scored on a semantic graph; only the highest-ranked nodes survive into the output. Typically 46% smaller on production traffic. Methodology & benchmark peak in the measurement section. measurement section.
- Three compression modes: fast / balanced / SBERT
- AST-aware code compression for 7+ languages
- Per-workspace key scoping. Keys cannot read across workspaces.
- Command Palette — Cmd+K navigation, G+D shortcuts, full-text search
- GitHub Integration — auto-compress on push with PR token summaries
- Real-Time Queue Monitor — live SSE streaming for batch jobs
- Roles: Owner, Admin, Member, Viewer. Shared projects, activity feed.
- Stacks with native prompt caching (Anthropic / OpenAI / Gemini). When both apply, total input-cost reduction can reach 95% — see methodology.
100+ MCP tools behind one endpoint.
Works with any MCP-compatible client. Claude Code, Cursor, Windsurf, VS Code — they all get the full compression layer, GitHub webhook integration, and real-time SSE monitoring. Configuration is one JSON block, shown below.
- CWE-22 path traversal prevention on all file I/O
- Async batch ingest — 4× throughput
- Prometheus metrics, OpenTelemetry tracing, health checks
gc_compress_manifestshrinks MCP tool-description bloat
{
"mcpServers": {
"gotcontext": {
"url": "https://api.gotcontext.ai/mcp",
"headers": {
"Authorization": "Bearer gc_your_key_here"
}
}
}
}Try it now
Paste any text and see how much you can save. No signup required.
Text is processed in-memory and is not stored, logged with PII, or used for training. Do not paste secrets or production credentials. Privacy details →
Pay for tool calls. Compression is included.
Every MCP tool response is compressed before it returns to your agent — so each call delivers significantly more context value per token budget. The multiplier scales with the live compression ratio (see hero). From solo developers to enterprise teams.
Free
- 1,000 compressions/month
- 100KB max document
- Standard compression
- Command Palette & shortcuts
- Activity Feed
- Dark/Light theme
- Community support
Pro
Most Popular- 50,000 compressions/month
- All 100+ MCP tools (incl. ACE, knowledge mgmt, multimodal)
- Priority queue: 2 concurrent compression slots
- 1MB max document
- Accelerated compression (3-5x faster)
- Queue Monitor (real-time SSE)
- Usage analytics
- Webhook Notifications
- Priority support
Business
- 500,000+ compressions / month
- All 100+ MCP tools
- Priority queue: 8 concurrent compression slots
- Self-hosted Docker (run in your VPC)
- OIDC federation (Okta, Auth0, Azure AD)
- Audit-log export (NDJSON/CSV) for SOC2
- SBERT embeddings (higher fidelity than the default MiniLM tier)
- SSO / SAML
- Email support · SLA on request (custom MSA)
- DPA / IP indemnity / custom MSA
Start free.
1,000 compressions/month, all 100+ tools, no credit card.