Claude Persistent Memory - Long-term Memory for Claude Code Across Sessions

Persistent Memory
for Claude Code

Give Claude Code long-term memory that survives across sessions. Hybrid BM25 + vector search, auto-clustering, and confidence scoring — all stored locally.

$ git clone https://github.com/MIMI180306/claude-persistent-memory.git

AI memory that
actually works

A complete memory lifecycle — from structured storage to intelligent retrieval, clustering, and time decay.

Hybrid Search

BM25 full-text (FTS5) combined with vector semantic similarity (sqlite-vec). Fused ranking for precise memory retrieval.

4-Channel Retrieval

Pull via MCP tools on demand, plus auto-push through hooks on user prompt, pre-tool, and post-tool events.

LLM Structuring

Memories auto-structured into what / when / do / warn XML format by LLM for consistent retrieval quality.

Auto Clustering

Similar memories grouped automatically. Mature clusters promoted to reusable skill memories that never decay.

Confidence Scoring

Each memory has a confidence score adjusted through validation feedback and usage, with configurable time decay.

Local-First Storage

All data in local SQLite. Your memories never leave your machine. Zero cloud dependency for storage.

Pull + Push dual channels

On-demand MCP tools paired with automatic hook injection across the full Claude Code lifecycle.

Pull — MCP Server

memory_search

memory_save

memory_validate

memory_stats

Push — Hooks

UserPromptSubmit

PreToolUse

PostToolUse

PreCompact / SessionEnd

SQLite + FTS5 + sqlite-vec

memory.db — hybrid BM25 + cosine similarity

Embedding Server

TCP :23811

bge-m3 · 1024-dim

LLM Server

TCP :23812

Azure OpenAI GPT-4.1

Memory lifecycle

From save to structure, embed, search, validate, cluster, promote, and decay.

Save

MCP tool or auto-extract

Structure

LLM → XML format

Embed

bge-m3 1024-dim vector

BM25 + vector fused

Validate

Confidence ± feedback

Cluster

Auto-group similar

Promote

Cluster → skill

Decay

Low confidence fades

Up and running
in 3 steps

Clone, configure, and start. Then add MCP + hooks config to your Claude Code project.

Install

git clone https://github.com/
  MIMI180306/claude-persistent-memory.git
cd claude-persistent-memory
npm install

Configure

cp config.default.js config.js

# Set Azure OpenAI credentials
export AZURE_OPENAI_ENDPOINT=...
export AZURE_OPENAI_KEY=...
export AZURE_OPENAI_DEPLOYMENT=...

Start Services

# Terminal 1: Embeddings (~2GB RAM)
npm run embedding-server

# Terminal 2: LLM proxy
npm run llm-server

7 types with
configurable decay

Each type has its own half-life. Skills promoted from mature clusters never expire.

fact

90 days

Stable facts about the codebase

decision

90 days

Architectural decisions and rationale

bug

60 days

Bug fixes and root causes

pattern

90 days

Recurring code patterns

context

30 days

Session-specific context

preference

60 days

User workflow preferences

skill

never

Promoted from mature clusters

Persistent Memoryfor Claude Code

AI memory thatactually works

Hybrid Search

4-Channel Retrieval

LLM Structuring

Auto Clustering

Confidence Scoring

Local-First Storage

Pull + Push dual channels

Memory lifecycle

Up and runningin 3 steps

7 types withconfigurable decay

Persistent Memory
for Claude Code

AI memory that
actually works

Up and running
in 3 steps

7 types with
configurable decay