remember() here. recall() everywhere. One memory for every agent.
A two-call surface, sub-200ms p95 recall, capability-token identity, and a governed multi-agent memory underneath. Wrap your client, add the MCP plugin, or call the SDK/REST — your subagents inherit memory automatically.
$ npx ultramemory setupfrom ultramemory import Memory
mem = Memory()
mem.remember("Acme ships on Fridays") # 202 -> { trace_id }
ctx = mem.recall("when does Acme ship?") # ranked, token-budgeted
mem.trace(ctx.trace_id) # provenance: where each fact came fromTwo calls. Three altitudes. Works with Claude Code, Cursor, Codex, Cline, and any MCP client.
remember returns a trace_id; recall returns ranked, token-budgeted context; trace shows the lineage of every fact.
Pick a door.
Route by intent — get to the right page in one click.
Pick your altitude — three ways in.
The integration story you decide on first. Start with the Proxy — it is the lowest-effort path.
Proxy ★
Wrap your model client; we auto-remember and auto-recall behind the scenes. Zero new tools on the agent.
Learn more →MCP plugin
Add to Claude Code / Cursor / Codex / Cline. Exactly two tools — recall + remember, ~200 tokens. Admin stays server-side.
Learn more →SDK + REST
Full control from TS, Python, or plain REST — buckets, promotion, identity, audit, all off the agent's context.
Learn more →
Yes, your stack.
Any MCP client works; the SDK is model-agnostic.
Plugs into
- Claude Code
- Cursor
- Codex
- Cline
- MCP
- OpenAI
- Anthropic
- Gemini
- Vercel AI SDK
The numbers devs care about.
Fast enough to sit in the hot path; clean enough that the model never gets slower or dumber.
The speed SLO we hold to — fast enough to sit in the hot path.
Our own numbers, on our own eval, until independent benchmarks land.
Connect once, subagents inherit.
Wire up the parent once. Every helper it spawns gets scoped memory automatically — no re-wiring, no human in the loop.
- parent
- subagent
- peer
- tool
Under the hood.
The plain Product pages send curious readers here.
Retrieval
Hybrid recall, RRF fusion, and the governance rerank that ranks what's current.
Learn more →Governance
Private / shared buckets, the promotion gate, and cross-agent conflict resolution.
Learn more →
A maintained, dependable platform.
What's new, what's live, and where the code lives.
Start with two calls.
remember → recall, then trace every fact back to its source.