FAQ
Frequently asked questions
Everything about persistent memory for AI agents — how it works, what it connects to, what it costs, and how your data is handled. Still stuck? Talk to us.
Basics
General
Mnemoverse is a persistent memory API for AI agents. You store preferences, decisions, lessons, and context once, and recall them from any tool — Claude Code, Cursor, VS Code, Windsurf, ChatGPT, Python, or any HTTP client — with a single API key.
A vector database does similarity search over stored embeddings — nothing learns or changes. Mnemoverse is a memory: it forms associations between concepts (Hebbian), tunes recall from outcome feedback (Rescorla-Wagner), consolidates and forgets over time (HDBSCAN clustering with Von Restorff protection for distinctive items), and expands queries automatically. It improves with use.
They are complementary. RAG answers "what do the documents say?" by retrieving chunks of a fixed corpus. Mnemoverse answers "what did we discuss, decide, or learn last time?" — it accumulates user- and project-specific experience across sessions. Many teams use both.
The client libraries (Python SDK, MCP server) are open source under MIT. The memory engine itself is a hosted service. You can self-host on Enterprise plans.
Integrations
Tools & integrations
One API key works across Claude Code, Claude Desktop, Cursor, VS Code, Windsurf, ChatGPT, the Python SDK, and any HTTP client. The same memories are available everywhere.
One command: claude mcp add mnemoverse -e MNEMOVERSE_API_KEY=mk_live_YOUR_KEY -- npx -y @mnemoverse/mcp-memory-server@latest. The MCP server exposes six memory tools to the agent.
Yes — via Custom GPT Actions against the REST API, or any client that can make HTTPS calls. See the docs for the OpenAPI spec.
Yes: pip install mnemoverse (live on PyPI). It wraps write, read, feedback, stats, and consolidation with typed errors.
@mnemoverse/mcp-memory-server is an open-source Model Context Protocol server (listed on the MCP Registry) that gives any MCP-capable agent six memory tools over a standard stdio transport.
Pricing
Pricing & plans
Yes. Free is $0: 1,000 queries/day, 10,000 atoms, 60 requests/minute — no credit card required.
Any read or write call to the memory API. Daily limits reset every 24 hours; rate limits are per minute. Paid plans raise all three limits.
Yes. Plans are month-to-month (Pro $29/mo, Team $149/mo) and you can change or cancel at any time from the console.
Custom limits, dedicated infrastructure, SSO/SAML, audit logs, data-residency options, self-hosting, and a custom SLA. Contact sales to scope it.
Security
Privacy & security
Every tenant's memories live under a separate org identifier; cross-tenant reads return empty. Requests are authenticated per API key.
No. We do not train models on your stored memories.
Keys are hashed with SHA-256 before storage and compared in constant time to prevent timing attacks. The raw key is shown once at creation.
On managed PostgreSQL 17 with pgvector (US region today). Enterprise plans can choose data residency and self-hosting.
Under the hood
How the memory works
Concepts that are recalled together form and strengthen Hebbian associations, so related memories surface together over time. Outcome feedback (a +1/-1 signal when a recalled memory was useful) tunes future recall via a Rescorla-Wagner update.
Yes — deliberately. A consolidation step (HDBSCAN clustering) merges redundant memories, while Von Restorff protection keeps distinctive, singleton items from being lost. This keeps recall sharp instead of letting noise accumulate.
Yes. The underlying SLoD approach is published (arXiv:2603.08965) and was presented at the GRAAI workshop at IEEE WCCI 2026. Mnemoverse is benchmarked on public memory benchmarks including LoCoMo and LongMemEval.
Didn't find your answer? Send us a note or read the documentation.