Memoria

Secure · Auditable · Programmable Memory for AI Agents

See It In Action · Quick Start · Modifying Config · Architecture · API Reference · Why Memoria?

Overview

Memoria is a persistent memory layer for AI agents with Git-level version control. Every memory change is tracked, auditable, and reversible — snapshots, branches, merges, and time-travel rollback, all powered by MatrixOne's native Copy-on-Write engine.

%%{init: {'theme': 'base', 'themeVariables': { 
  'primaryColor': '#0A2540',
  'primaryTextColor': '#E0F7FF',
  'primaryBorderColor': '#00D4FF',
  'lineColor': '#00A3CC',
  'secondaryColor': '#1E3A5F',
  'tertiaryColor': '#00D4FF'
}}}%%

graph TD
    A[AI Agent] 
    -->|MCP Protocol| B[Memoria Core]

    B --> C[Canonical Storage<br/>Single Source of Truth]
    B --> D[Retrieval Strategy<br/>Pluggable Search]

    C --> E[Git-for-Data Engine]
    E --> F[MatrixOne]

    subgraph "Security Layer"
        G[Snapshot & Branch<br/>Zero-Copy Isolation]
        H[Audit & Provenance<br/>Full Traceability]
        I[Self-Governance<br/>Contradiction Detection]
    end

    B --> G
    B --> H
    B --> I

    classDef core fill:#0A2540,stroke:#00D4FF,stroke-width:3px,color:#E0F7FF,rx:15,ry:15;
    classDef storage fill:#1E3A5F,stroke:#00A3CC,stroke-width:2px,color:#E0F7FF;
    classDef strategy fill:#1E3A5F,stroke:#00D4FF,stroke-width:2px,color:#E0F7FF;
    classDef engine fill:#00A3CC,stroke:#00D4FF,color:#0A2540;

    class A,B core;
    class C,D storage;
    class E engine;
    class G,H,I strategy;

Core Capabilities:

Cross-conversation memory — preferences, facts, and decisions persist across sessions
Semantic search — retrieves memories by meaning, not just keywords
Git for Data — zero-copy branching, instant snapshots, point-in-time rollback
Audit trail — every memory mutation has a snapshot + provenance chain
Self-maintaining — built-in governance detects contradictions, quarantines low-confidence memories
Private by default — local embedding model option, no data leaves your machine

Supported Agents: Kiro · Cursor · Claude Code · Any MCP-compatible agent

Storage Backend: MatrixOne — Distributed database with native vector indexing

Why Memoria?

Capability	Memoria	Letta / Mem0 / Traditional RAG
Git-level version control	Native zero-copy snapshots & branches	File-level or none
Isolated experimentation	One-click branch, merge after validation	Manual data duplication
Audit trail	Full snapshot + provenance on every mutation	Limited logging
Semantic retrieval	Vector + full-text hybrid search	Vector only
Multi-agent sharing	Shared trusted memory pool per user	Siloed per agent
Migration cost	Zero — all state in MatrixOne	Export/import required

See It In Action

Here's a real setup conversation in Cursor — just tell the agent what you want, it handles the rest:

You:     I want to set up Memoria for persistent memory.

Cursor:  Before I start, a few quick questions:

           1. Which AI tool are you using? (Cursor, Kiro, Claude Code, or multiple?)
           2. Do you have a MatrixOne database running? If not, I can start one with
              Docker, or you can use MatrixOne Cloud (free tier, no Docker needed).
           3. Do you have an embedding service? (OpenAI, SiliconFlow, Ollama, etc.)
              If not, I'll use a local model — but it downloads ~900MB on first use.

You:     Cursor. No database yet. I have a SiliconFlow API key.

Cursor:  Got it. I'll start MatrixOne with Docker first — running:

           docker compose up -d

You:     [output] Container memoria-matrixone started

Cursor:  MatrixOne is initializing, takes about 30 seconds on first start.
         While we wait — what's your SiliconFlow API key and which model do you use?
         (e.g. BAAI/bge-m3)

You:     key is sk-xxxx, model BAAI/bge-m3

Cursor:  Database is ready. Installing Memoria now:

           pip install mo-memoria

You:     [output] Successfully installed memoria-0.1.0

Cursor:  Now I'll create the MCP config for Cursor. Edit .cursor/mcp.json:

           {
             "mcpServers": {
               "memoria": {
                 "command": "memoria-mcp",
                 "args": ["--db-url", "mysql+pymysql://root:111@localhost:6001/memoria",
                          "--user", "alice"],
                 "env": {
                   "EMBEDDING_PROVIDER": "openai",
                   "EMBEDDING_BASE_URL": "https://api.siliconflow.cn/v1",
                   "EMBEDDING_API_KEY": "sk-xxxx",
                   "EMBEDDING_MODEL": "BAAI/bge-m3",
                   "EMBEDDING_DIM": "1024"
                 }
               }
             }
           }

         Please restart Cursor to load the MCP server, then come back.

You:     Restarted. Does it work?

Cursor:  Let me check by calling memory_retrieve("test"):

         ← No relevant memories found.

         Everything looks good. Try telling me a preference — like your test framework
         or code style — and I'll remember it across all future conversations.

Quick Start

memoria-mcp runs in two modes — pick one before starting:

	Embedded mode	Remote mode
Flag	`--db-url`	`--api-url` + `--token`
You run	MatrixOne DB (Docker or cloud)	Nothing — connect to existing server
Embedding	Configure in `env` block	Server handles it
When	Personal setup, local dev, self-hosted	Team/SaaS — admin gives you a URL + token

Prerequisites (embedded mode only)

Python 3.11+
MatrixOne database (local Docker or cloud)

1. Start MatrixOne (database)

git clone https://github.com/matrixorigin/Memoria.git
cd Memoria
docker compose up -d
# Wait ~30-60s for first-time initialization

Or use docker run directly:

docker run -d --name matrixone -p 6001:6001 -v ./data/matrixone:/mo-data --memory=2g matrixorigin/matrixone:latest

See docker-compose.yml for configuration options. Don't want Docker? Use MatrixOne Cloud (free tier).

2. Install Memoria

pip install mo-memoria

# Only needed if using local embedding model (no external API):
pip install "mo-memoria[local-embedding]"    # Local sentence-transformers (~900MB download)

# If no NVIDIA GPU available, install CPU-only PyTorch first to avoid large CUDA dependencies:
pip install torch --index-url https://download.pytorch.org/whl/cpu
pip install "mo-memoria[local-embedding]"

⚠️ Configure embedding BEFORE the MCP server starts for the first time. Tables are created on first startup with the configured embedding dimension. Changing it later requires re-creating the embedding column.

3. Configure your AI tool

Run memoria init in your project directory — it auto-detects Kiro / Cursor / Claude and writes the MCP config + steering rules:

cd your-project

# Local DB (default)
memoria init

# With custom DB URL
memoria init --db-url "mysql+pymysql://root:111@localhost:6001/memoria"

# Remote Memoria server (SaaS / team deployment)
memoria init --api-url "https://your-server:8100" --token "sk-your-key..."

# With OpenAI-compatible embedding
memoria init --embedding-provider openai \
             --embedding-base-url https://api.siliconflow.cn/v1 \
             --embedding-api-key sk-... \
             --embedding-model BAAI/bge-m3 \
             --embedding-dim 1024

This creates:

Kiro: .kiro/settings/mcp.json + .kiro/steering/memory.md
Cursor: .cursor/mcp.json + .cursor/rules/memory.mdc
Claude: .mcp.json + CLAUDE.md

Then restart your AI tool — database tables are created automatically when the MCP server starts.

4. Verify

memoria status   # check config files and rule versions

Or ask your AI tool: "Do you have memory tools available?" — it should list memory_store, memory_retrieve, etc.

Modifying the MCP Config

memoria init generates the config once. To change settings afterwards, edit the config file directly:

Kiro: .kiro/settings/mcp.json
Cursor: .cursor/mcp.json
Claude: .mcp.json

Switch from local DB to remote server:

{
  "mcpServers": {
    "memoria": {
      "command": "memoria-mcp",
      "args": ["--api-url", "https://your-server:8100", "--token", "sk-your-key..."]
    }
  }
}

Change embedding provider (edit the env block):

{
  "mcpServers": {
    "memoria": {
      "command": "memoria-mcp",
      "args": ["--db-url", "mysql+pymysql://root:111@localhost:6001/memoria", "--user", "alice"],
      "env": {
        "EMBEDDING_PROVIDER": "openai",
        "EMBEDDING_BASE_URL": "https://api.siliconflow.cn/v1",
        "EMBEDDING_API_KEY": "sk-...",
        "EMBEDDING_MODEL": "BAAI/bge-m3",
        "EMBEDDING_DIM": "1024"
      }
    }
  }
}

Re-run init to overwrite (use --force to also overwrite customized steering rules):

memoria init --api-url "https://new-server:8100" --token "sk-new-key..."
# steering rules are preserved unless --force is passed

Update steering rules only (after upgrading Memoria):

pip install --upgrade mo-memoria
memoria update-rules
# restart your AI tool

Restart your AI tool after any config change.

Setup by Tool

Kiro

cd your-project
memoria init --tool kiro

Or manually create .kiro/settings/mcp.json:

{
  "mcpServers": {
    "memoria": {
      "command": "memoria-mcp",
      "args": ["--db-url", "mysql+pymysql://root:111@localhost:6001/memoria", "--user", "alice"]
    }
  }
}

Copy .kiro/steering/memory.md from docs/steering/memory.md into your project's .kiro/steering/ directory. Restart Kiro.

Cursor

cd your-project
memoria init --tool cursor

Or manually create .cursor/mcp.json (same structure as above). Restart Cursor.

Claude Desktop

cd your-project
memoria init --tool claude

Or manually edit claude_desktop_config.json (same structure). Restart Claude Desktop.

Configuration Options

Embedding providers

Memoria needs an embedding model to vectorize memories for semantic search.

Provider	Quality	Privacy	Cost	First-use latency	Ongoing latency
Local (default)	Good	✅ Data never leaves machine	Free	~900MB download + a few seconds to load on first query	Fast (in-process)
OpenAI / SiliconFlow	Better	⚠️ Text sent to API	API key required	None	Network round-trip
Custom service	Varies	Depends on host	Self-hosted	None	Network round-trip

Configure via environment variables in the MCP config env block:

"env": {
  "EMBEDDING_PROVIDER": "openai",
  "EMBEDDING_BASE_URL": "https://api.siliconflow.cn/v1",
  "EMBEDDING_API_KEY": "sk-...",
  "EMBEDDING_MODEL": "BAAI/bge-m3",
  "EMBEDDING_DIM": "1024"
}

Leave all empty to use local embedding (all-MiniLM-L6-v2, dim=384).

💡 Local Embedding Tips: If you are using the local provider (default), Memoria will download the model from Hugging Face on the first run.

Mirroring: If huggingface.co is slow or blocked, set HF_ENDPOINT=https://hf-mirror.com.
Offline Mode: To run completely offline, first run Memoria once with internet access to cache the model, then set HF_HUB_OFFLINE=1 and TRANSFORMERS_OFFLINE=1. Do not set these before the model is cached, or it will fail.

⚠️ CRITICAL: Configure embedding BEFORE the MCP server starts for the first time. Tables are created on first startup with the configured dimension. Changing it later requires re-creating the embedding column (destructive).

Architecture

┌─────────────┐     MCP (stdio)     ┌──────────────────────────────────────┐     SQL      ┌────────────┐
│  Kiro /      │ ◄─────────────────► │  Memoria MCP Server                │ ◄──────────► │ MatrixOne  │
│  Cursor /    │   store / retrieve  │  ├── Canonical Storage              │  vector +    │  Database  │
│  Claude Code │                     │  ├── Retrieval (vector / semantic)  │  fulltext    │            │
│  Any Agent   │                     │  └── Git-for-Data (snap/branch/merge)│             │            │
└─────────────┘                      └──────────────────────────────────────┘              └────────────┘

API Reference

Memoria exposes MCP tools that your AI tool calls automatically based on steering rules. You can also invoke them directly.

Core CRUD

Tool	Description
`memory_store`	Store a new memory
`memory_retrieve`	Retrieve relevant memories for a query (call at conversation start)
`memory_correct`	Update an existing memory with new content (by ID or semantic search)
`memory_purge`	Delete by ID, comma-separated batch IDs, or bulk-delete by topic keyword
`memory_search`	Semantic search across all memories
`memory_profile`	Get user's memory-derived profile summary

Snapshots

Tool	Description
`memory_snapshot`	Create a named snapshot of current memory state
`memory_snapshots`	List snapshots with pagination (`limit`, `offset`). Shows total count
`memory_snapshot_delete`	Delete snapshots by name(s), prefix, or age. Supports batch deletion
`memory_rollback`	Restore memories to a previous snapshot

Branches

Tool	Description
`memory_branch`	Create a new branch for isolated experimentation (optionally from a snapshot or point-in-time)
`memory_branches`	List all branches
`memory_checkout`	Switch to a different branch (shows up to `top_k` memories after switching)
`memory_merge`	Merge a branch back into main
`memory_diff`	Preview what would change on merge (LCA-based diff with semantic classification)
`memory_branch_delete`	Delete a branch

Maintenance

Tool	Description
`memory_governance`	Quarantine low-confidence memories, clean stale data (1h cooldown)
`memory_consolidate`	Detect contradictions, fix orphaned graph nodes (30min cooldown)
`memory_reflect`	Synthesize high-level insights from memory clusters via LLM (2h cooldown)
`memory_extract_entities`	Extract named entities and build entity graph (proactive)
`memory_link_entities`	Write entity links from your own extraction results
`memory_rebuild_index`	Rebuild IVF vector index for a table
`memory_snapshot_delete`	Delete snapshots by name(s), prefix, or age. Supports batch deletion

Memory Types

Type	What it stores	Example
`semantic`	Project facts, technical decisions	"This project uses Go 1.22 with modules"
`profile`	User/agent preferences	"Always use pytest, never unittest"
`procedural`	How-to knowledge, workflows	"To deploy: run make build then kubectl apply"
`working`	Temporary context for current task	"Currently refactoring the auth module"
`tool_result`	Results from tool executions	Cached command outputs

Usage Examples

Store and Retrieve

You: "I prefer tabs over spaces, and always use black for formatting"
AI:  → calls memory_store("User prefers tabs over spaces, uses black for formatting", type="profile")

... next conversation ...

You: "Format this Python file"
AI:  → calls memory_retrieve("format python file")
     ← gets: [profile] User prefers tabs over spaces, uses black for formatting
     → formats with black, uses tabs

Correct a Memory

You: "Actually, I switched to ruff instead of black"
AI:  → calls memory_correct(query="formatting tool", new_content="User uses ruff for formatting", reason="switched from black")
     (finds the memory about black via semantic search, corrects it — no memory_id needed)

Snapshots: Save and Restore State

You: "Take a snapshot before we refactor the database layer"
AI:  → calls memory_snapshot(name="before_db_refactor", description="pre-refactor state")
     ← "Snapshot 'before_db_refactor' created."

... refactoring goes wrong ...

You: "Roll back to before the refactor"
AI:  → calls memory_rollback(name="before_db_refactor")
     ← "Rolled back to snapshot 'before_db_refactor'."

Branches: Isolated Experimentation

You: "Create a memory branch to evaluate switching from PostgreSQL to SQLite"
AI:  → calls memory_branch(name="eval_sqlite")
     → calls memory_checkout(name="eval_sqlite")
     ← "Switched to branch 'eval_sqlite'. 42 memories on this branch."

You: "We're now using SQLite instead of PostgreSQL"
AI:  → calls memory_store("Project uses SQLite for persistence", type="semantic")
     (stored on eval_sqlite only — main is untouched)

You: "Merge it"
AI:  → calls memory_diff(source="eval_sqlite")   ← preview first
     → calls memory_merge(source="eval_sqlite", strategy="replace")
     ← "Merged 3 memories from 'eval_sqlite' (skipped 0)."

memory_branch also supports branching from a past point in time:

AI:  → calls memory_branch(name="debug", from_timestamp="2026-03-11 10:00:00")
     (must be within the last 30 minutes)

Commands

Command	Description
`memoria-mcp --db-url <url> --user <id>`	Start MCP server in embedded mode (direct DB)
`memoria-mcp --api-url <url> --token <key>`	Start MCP server in remote mode (proxy to REST API)
`memoria-mcp --transport sse`	Start with SSE transport instead of stdio

Manual Tuning & Optimization

Integration quality depends on your AI agent's reasoning ability and steering rules. Out-of-the-box behavior may not be optimal.

If memory usage feels suboptimal, edit the steering rules in .kiro/steering/memory.md, .cursor/rules/memory.mdc, or CLAUDE.md to be more explicit. For example, if your agent forgets to retrieve memories at conversation start:

CRITICAL: At the start of EVERY conversation, call memory_retrieve with the user's first message.

Adapting to Other Agents

Memoria uses the Model Context Protocol (MCP) standard. Any MCP-compatible agent can integrate by pointing to the server:

{
  "mcpServers": {
    "memoria": {
      "command": "memoria-mcp",
      "args": ["--db-url", "mysql+pymysql://root:111@localhost:6001/memoria", "--user", "alice"],
      "env": {
        "EMBEDDING_PROVIDER": "openai",
        "EMBEDDING_API_KEY": "sk-...",
        "EMBEDDING_MODEL": "BAAI/bge-m3",
        "EMBEDDING_DIM": "1024"
      }
    }
  }
}

Or in remote mode (proxy to a deployed Memoria REST API):

{
  "mcpServers": {
    "memoria": {
      "command": "memoria-mcp",
      "args": ["--api-url", "https://memoria-host:8100", "--token", "sk-your-key..."]
    }
  }
}

Troubleshooting

"Cannot connect to database"

docker ps | grep matrixone
# If not running:
docker start matrixone

"sentence-transformers not installed"

pip install "mo-memoria[local-embedding]"

First query is slow

Expected with local embedding — model loads into memory on first query (~3-5s). Use an embedding service to avoid this by setting EMBEDDING_PROVIDER=openai in the MCP config env block.

Troubleshooting

"Cannot connect to database"

docker ps | grep matrixone
# If not running:
docker start matrixone

"sentence-transformers not installed"

pip install "mo-memoria[local-embedding]"

First query is slow

Expected with local embedding — model loads into memory on first query (~3-5s). Use an embedding service to avoid this by setting EMBEDDING_PROVIDER=openai in the MCP config env block.

AI tool doesn't seem to use memory

Verify memoria-mcp is in PATH: which memoria-mcp
Restart the AI tool after editing the MCP config
Test the server directly: memoria-mcp --db-url "mysql+pymysql://root:111@localhost:6001/memoria"

Development

Quick setup (local dev)

# Start MatrixOne + API
make start

# In another terminal, configure your AI tool for remote mode:
cd your-project
memoria init --api-url "http://localhost:8100" --token "test-master-key-for-docker-compose"

# Restart your AI tool

Or use embedded mode (direct DB, no API):

cd your-project
memoria init --db-url "mysql+pymysql://root:111@localhost:6001/memoria"

Run tests

make test-unit          # Unit tests (no DB)
make test               # All tests (needs DB)
make test-mcp           # MCP server tests
make test-all-cov       # All tests with coverage

Bump version and publish

make bump-version BUMP=patch   # 0.1.1 → 0.1.2
git add -A && git commit -m "chore: release v0.1.2"
git tag v0.1.2
git push && git push --tags    # Triggers PyPI publish

To test the published package from TestPyPI before the official release:

pip install --index-url https://mirrors.aliyun.com/pypi/simple/ \
            --extra-index-url https://test.pypi.org/simple/ \
            -U mo-memoria

Note: --index-url points to a mirror for dependencies (TestPyPI doesn't host all deps), --extra-index-url pulls mo-memoria itself from TestPyPI.

For AI Agents

If you're an AI agent helping a user set up Memoria, ask before acting:

Which AI tool? Kiro / Cursor / Claude Desktop?
MatrixOne database? Already have one → get URL. Local Docker → docker compose up -d. MatrixOne Cloud → no Docker needed.
Embedding? ⚠️ Ask BEFORE installation. Once the MCP server starts, the embedding dimension is locked into the database schema. Already have a service (OpenAI/SiliconFlow/Ollama)? Use it. Otherwise local is fine.
Install: pip install mo-memoria (add [local-embedding] if no external embedding service)
Configure: Run memoria init in the user's project directory (see Quick Start for flags). It auto-detects the tool and writes MCP config + steering rules.
Tell user to restart their AI tool
Verify: Ask the AI tool to call memory_retrieve("test") — should return "No relevant memories found"

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.github		.github
.kiro		.kiro
assets		assets
benchmarks/datasets		benchmarks/datasets
docs		docs
examples		examples
memoria		memoria
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.mergify.yml		.mergify.yml
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
README.pypi.md		README.pypi.md
SETUP_GUIDE.md		SETUP_GUIDE.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

Memoria

Overview

Why Memoria?

See It In Action

Quick Start

Prerequisites (embedded mode only)

1. Start MatrixOne (database)

2. Install Memoria

3. Configure your AI tool

4. Verify

Modifying the MCP Config

Setup by Tool

Kiro

Cursor

Claude Desktop

Configuration Options

Embedding providers

Architecture

API Reference

Core CRUD

Snapshots

Branches

Maintenance

Memory Types

Usage Examples

Store and Retrieve

Correct a Memory

Snapshots: Save and Restore State

Branches: Isolated Experimentation

Commands

Manual Tuning & Optimization

Adapting to Other Agents

Troubleshooting

"Cannot connect to database"

"sentence-transformers not installed"

First query is slow

Troubleshooting

"Cannot connect to database"

"sentence-transformers not installed"

First query is slow

AI tool doesn't seem to use memory

Development

Quick setup (local dev)

Run tests

Bump version and publish

For AI Agents

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages