Kleos | Persistent Memory for AI Agents

Features

Everything agents need
to remember.

Modular capabilities that compose into a full cognitive memory layer. Built in Rust for speed, safety, and zero runtime dependencies.

📈

FSRS-6 Decay

Power-law forgetting that models human memory. Memories fade unless reinforced.

🔗

Knowledge Graph

10 typed edge relations, Louvain community detection, PageRank with 180-day temporal decay. 2-hop graph traversal augments search.

🔍

Hybrid Search

4-channel RRF: vector similarity, FTS5 BM25, graph traversal, personality signals. IBM Granite cross-encoder reranks top-K.

🔐

At-Rest Encryption

SQLCipher database encryption. Keyfile, environment variable, or YubiKey HMAC-SHA1. Argon2id KDF.

👤

Personality

Learned preferences, style guides, and user-specific behaviors that persist.

🕐

Time Travel

View memory state at any point in time. See what the agent knew and when.

🤖

Multi-Agent

Shard-per-tenant isolation. Deadpool reader/writer pools. Lazy loading with LRU eviction. Quota enforcement.

📌

Decomposition

Break complex memories into atomic facts. Each scored and decays independently.

🎯

Budget Context

Token-aware assembly. Fill context windows optimally for any model.

📡

Eidolon

Single POST /activity fans out to 6 services: Chiasm tasks, Axon events, Broca logs, Thymus metrics, Skills, Memory.

🔑

credd

Secure credential daemon. Agent keys, secret resolution, proxy injection. Never hardcode again.

🧠

Hopfield Network

Modern associative memory (Ramsauer 2020). Pattern completion with exponential capacity.

🌙

Dream Consolidation

6-stage autonomous consolidation: replay, merge, prune, discover, decorrelate, resolve. Runs on schedule like biological sleep.

🔗

Causal Chains

Track cause/effect relationships. Build reasoning chains across memories with confidence scoring.

🎯

Cross-Encoder Rerank

IBM Granite ONNX model. Semantic precision on top-K results after initial retrieval.

🛠

Tool Grounding

Secure tool execution. MCP, Shell, Web, GUI backends with sandboxing and security policies.

📈

Million+ Scale

LanceDB vector index handles 1M+ memories per user. Single-user or production multi-tenant.

🛠

Agent-Forge

Structured reasoning protocol. spec_task, log_hypothesis, verify, challenge_code. Quality workflows baked in.

💭

Sidecar

Session-scoped Guardian. Batched observation flushing with retry, file watcher, Prometheus metrics, Syntheos integration.

🔒

Per-Agent Keys

Every agent and model gets its own encrypted API key. Full audit trail of who stored what.

⚙

Claude Hooks

Drop-in hooks for Claude Code. Auto context loading, session tracking, memory storage. Zero manual prompting.

Smart Memory

FSRS-6 Spaced Repetition -- 21-parameter algorithm with power-law forgetting curves
Dual-Strength Model -- Bjork & Bjork: storage strength never decays, retrieval resets on access
Versioning -- Update memories without losing history. Full version chain preserved
Auto-Deduplication -- SimHash 64-bit locality-sensitive hashing detects near-identical memories
Auto-Forget / TTL -- Set memories to expire. Background sweep every 5 minutes
Contradiction Detection -- LLM-verified conflict scanning with resolution options
Hopfield Network -- Modern associative memory with exponential capacity
Dream Consolidation -- 6-stage cycle: replay, merge, prune, discover, decorrelate, resolve

Intelligence Layer

4-Channel Hybrid Search -- RRF across vector, FTS5, personality, and graph signals
Fact Extraction & Auto-Tagging -- Structured facts with temporal validity windows
Atomic Fact Decomposition -- Long memories split into independently searchable atomic facts
Conversation Extraction -- Feed raw chat logs, get structured memories
Reflections & Consolidation -- Meta-analysis and cluster compression
Abstention -- Search knows what it doesn't know. No false positives on unrelated queries
Personality Engine -- Six signal types shape future recall scoring
Guardrails -- Pre-action safety checks: allow/warn/block
Causal Chains -- Track cause/effect relationships with confidence
Emotional Valence -- Sentiment analysis with valence/arousal scoring
Cross-Encoder Rerank -- IBM Granite ONNX for semantic precision
Search Cache -- 32-shard LRU with generation-counter invalidation
Faceted Filtering -- Category, temporal window, tags, source expressions

Developer Platform

MCP Server -- 57+ tools for Claude Desktop, Cursor, Windsurf, and other MCP clients
TypeScript SDK -- First-class client with store, search, context, guard, inbox
Python SDK -- Pydantic v2 + httpx async client
Go SDK -- Stdlib-only client, zero dependencies
CLI -- Full access to every feature from your terminal
REST API -- 80+ endpoints with OpenAPI 3.1 spec
Multi-Tenant + RBAC -- Isolated memory per user with role-based access
Webhooks & Digests -- Event hooks with HMAC signing and scheduled digests
Audit Trail -- Every mutation logged with who, what, when, from where
Transcript Ingest -- Dedicated kleos-ingest daemon for real-time observation streaming
Cleanup Utility -- One-shot growth/activity deduplication and migration
Scratchpad -- Ephemeral working memory with TTL auto-purge
LanceDB Scale -- 1M+ memories per user. Production-ready vector index
Tool Grounding -- Secure execution: MCP, Shell, Web, GUI with sandboxing
Skill Learning -- Evolving skills with trust scoring and versioning
Loom Workflows -- Multi-step orchestration with parallel/decision/wait
Thymus Evaluations -- Rubric-based quality scoring and metrics
Agent-Forge -- Structured reasoning: spec_task, log_hypothesis, verify, challenge_code
Sidecar -- Session Guardian: batched observation flushing, retry, Prometheus metrics, Syntheos wiring
Per-Agent Keys -- Encrypted API keys per agent/model with full audit trail
Claude Code Hooks -- Drop-in hooks for auto context, session tracking, memory storage
credd Daemon -- Secure credential resolution, proxy injection, secret management
PIV-Backed Signing -- Hardware-secured request signing for ingest and store operations
Ingestion Pipeline -- Markdown, JSON/JSONL, CSV, PDF, ZIP with recursive extraction
Chunked Upload -- Binary upload protocol with SHA-256 verification and 24h expiry
Search Cache -- 32-shard LRU with per-user generation counters and 15s TTL
Faceted Search -- Complex filter expressions: category, temporal, tags, source
Gate/Guard System -- Pre-action policy engine: allow/warn/block with approval queue
PCA Compression -- Weekly batching of 7+ memories into summary with generalizes edges

Visualization & Organization

WebGL Galaxy Graph -- Interactive memory space visualization at /gui
Knowledge Graph -- Auto-linking, PageRank, community detection
Episodic Memory -- Conversation episodes as embedded, searchable narratives
Entities & Projects -- First-class people, servers, tools, and projects
Review Queue / Inbox -- Approve, reject, or edit before memories enter recall
Community Detection -- Louvain clustering surfaces memory clusters
PageRank -- Structural importance scoring boosts search results
Time-Travel Queries -- Query memory state at any past moment

A day with Kleos

What it actually looks like to use persistent memory.

Session Starts

Pick up where you left off

Your agent pulls context from Kleos. It knows the project state, your preferences, what you decided last time, and what's still unresolved.

While Working

Decisions and discoveries stored automatically

New architecture choices, deployment configs, bug resolutions -- all stored with importance scoring and auto-linked to related memories.

Conflict Detected

"You said X last week but now Y -- which is correct?"

Kleos catches contradictions between new information and existing knowledge. Your agent surfaces them instead of silently overwriting history.

Session Ends

Memories consolidate, weak ones fade

Important memories grow stronger. Irrelevant details lose retrieval strength. The knowledge graph reorganizes around what actually matters.

Next Session

Your agent remembers everything that mattered

Not a transcript dump. Not a vector search over flat files. Weighted, prioritized, personality-aware context -- assembled from a living knowledge graph.

How memory retrieval strength works

Click "Recall" to see how spaced repetition builds lasting memory.

100% 50% 0%

0 Days 12mo

Retrievability: 85% Stability: 1.2d Reviews: 2

See your memory space

Kleos includes a built-in WebGL graph visualization. Explore connections between memories, search, create, edit, all from the browser.

demo-gui.syntheos.dev

🧠

Interactive Memory Galaxy

Explore a live demo with sample memories and auto-generated links. Click nodes to inspect. Search, filter by category, create new memories. Keyboard shortcuts for power users.

WebGL Rendering Force-Directed Layout Real-time Search Category Filters

Open Live Demo

Observability

Eidolon
Agent activity tracking

Single activity endpoint that fans out to six downstream services. One POST triggers Chiasm task updates, Axon event publishing, Broca action logging, Thymus metrics, skill matching, and memory storage.

✓ Chiasm task lifecycle (active/paused/blocked/completed)
✓ Axon event publishing with cursor-based replay
✓ Broca permanent action logging with causal links
✓ Soma agent registry with heartbeat and drift detection
✓ Thymus quality scoring and reinforcement loop

# Single POST fans out to all services

curl -X POST $EIDOLON_URL/activity \

-H "Authorization: Bearer $EIDOLON_KEY" \

-d '{

"agent": "claude-code",

"action": "task.completed",

"summary": "Deployed v2.1 to prod"

}'

Coordination

Seven services,
one process.

Neuroscience-named coordination primitives that share the same substrate. No external message broker. No separate databases. Everything in-process, zero-latency.

Axon

Event Bus

Pub/sub with channels, retention windows, cursor-based consumption. Webhook push or pull polling. At-least-once delivery.

Broca

Action Log

Permanent structured log of agent actions. Complex filtering by agent, service, action type. Causal links to Axon events.

Chiasm

Task Tracker

Long-running task lifecycle management. State machine: active, paused, blocked, completed. Project-scoped for multi-agent work.

Soma

Agent Registry

Directory of active agents with heartbeat monitoring. Capability declarations, quality scores, drift flags.

Loom

Workflow Engine

DAG-based execution engine. Actions, decisions, LLM calls as nodes. State persistence across restarts.

Thymus

Quality Evaluation

Aggregates feedback signals into quality scores. Rubric-based evaluation. Reinforcement learning loop stored in Soma.

Brain

Cognitive Backend

Hopfield network or external subprocess. Init, Query, Absorb, DreamCycle commands. JSON over STDIN/STDOUT protocol.

# One POST triggers all services

curl -X POST localhost:4200/activity \

-H "Authorization: Bearer $KEY" \

-d '{"agent": "claude", "action": "task.completed"}'

Fans out to: Chiasm + Axon + Broca + Thymus + Skills + Memory

Zero Manual Prompting

Drop-in hooks.
Automatic memory.

No more "search Kleos for..." in every prompt. Install hooks once, memory works automatically. Session context loads at start, memories store on end, relevant context surfaces each turn.

⚙

Claude Code Hooks

Four hooks that wire up the entire system:

SessionStart -- Load project context
UserPrompt -- Search relevant memories
PostToolUse -- Report to sidecar
Stop -- Store session summary

🛠

Agent-Forge Protocol

Structured reasoning workflow enforced via hooks:

spec_task -- Define before coding
log_hypothesis -- Record debugging guesses
verify -- Check code compiles/runs
challenge_code -- Quality self-review

💭

Sidecar

Session-scoped Guardian:

Batched flush -- Observations buffered with exponential backoff retry
File watcher -- Auto-extracts from Claude Code JSONL with checkpoint persistence
Syntheos -- Axon events, Chiasm tasks, Broca logs, Soma heartbeat
Prometheus -- Flush latency, pending depth, active sessions, compress stats

# Install hooks and forget about manual prompting

cp hooks/simple/*.sh ~/.claude/hooks/

# Or full version with Eidolon, Agent-Forge, credd

cp hooks/full/*.sh ~/.claude/hooks/

Compatibility

Works with
any AI platform.

Connect via MCP protocol, HTTP API, or CLI. Drop Kleos into your existing workflow in minutes.

🤖

Claude

🚀

Cursor

🌊

Windsurf

⚡

Zed

💬

ChatGPT

💻

Any MCP

MCP Native | HTTP REST API | CLI Tool

Security

Hardware-backed
security.

Not just authentication. Non-repudiable request signing, pre-action policy gates, layered middleware, and optional YubiKey integration. Security that does not depend on trust.

KLEOSv1 Signing

Non-repudiable request envelopes. ECDSA-P256 (YubiKey PIV) or Ed25519 (software). 60-second replay window with nonce protection.

Gate/Guard System

Pre-action decision engine. Allow, warn, or block policies. Human-in-the-loop approval queue for sensitive operations.

Middleware Stack

6 layers: Tower-HTTP, safe mode (crash-loop detection), pre-auth rate limit, authentication, audit logging, tenant quotas.

YubiKey Integration

PIV for remote API signing with public PEM registry. HMAC-SHA1 challenge-response for local credential unlocking.

SQLCipher Encryption

Per-tenant database encryption at rest. Key resolution from keyfile, environment variable, or YubiKey HMAC-SHA1.

credd Vault

AES-256-GCM credential daemon on port 4400. Keyless agent bootstrap. Secret resolution without plaintext exposure.

Performance

Built for
production.

Rust-native performance with zero garbage collection pauses. Local ONNX inference means no API latency.

<15ms

Search Latency

1024

Embedding Dims

0

Cloud Dependencies

16

Rust Crates

Memory Recall Accuracy 94%

FSRS Stability After 7 Days 87%

Search Relevance (MRR@10) 0.91

Graph Entity Resolution 89%

Internal benchmarks on synthetic workloads. Results vary with dataset size, embedding model, and usage patterns.

Integration

Drop in
anywhere.

CLI Bash

# Store a memory
kleos-cli store "Auth migrated to JWT" \
--category decision \
--importance 9

# Search
kleos-cli search "auth setup"

✔ 3 results in 12ms

HTTP API curl

# Store via API
curl -X POST localhost:4200/store \
  -H "Authorization: Bearer eg_..." \
  -d '{"content":"Auth migrated"}'

# Contextual recall
curl -X POST localhost:4200/context \
  -d '{"query":"auth setup"}'

Quick Start 30 seconds

# Clone and build
git clone github.com/Ghost-Frame/Kleos
cargo build --release

# Run server
./target/release/kleos-server

✔ Ready on :4200

Interactive Demo

Try it yourself.

CLI playground, live memory graph, and store demo. All running in your browser.

CLI Playground

kleos

# Type a command and press Enter

# Try: store, search, list, help

$

Store a Memory

Get started in
30 seconds.

One command. Full memory. Your hardware.

$ curl -fsSL kleos.sh | sh

View on GitHub Read the Docs

127.0.0.1:4200 · 16 crates · rusqlite + ONNX + LanceDB · Zero cloud dependencies

Your agents have amnesia.

Everything agents need to remember.

FSRS-6 Decay

Knowledge Graph

Hybrid Search

At-Rest Encryption

Personality

Time Travel

Multi-Agent

Decomposition

Budget Context

Eidolon

credd

Hopfield Network

Dream Consolidation

Causal Chains

Cross-Encoder Rerank

Tool Grounding

Million+ Scale

Agent-Forge

Sidecar

Per-Agent Keys

Claude Hooks

Smart Memory

Intelligence Layer

Developer Platform

Visualization & Organization

A day with Kleos

Pick up where you left off

Decisions and discoveries stored automatically

"You said X last week but now Y -- which is correct?"

Memories consolidate, weak ones fade

Your agent remembers everything that mattered

How memory retrieval strength works

See your memory space

Interactive Memory Galaxy

Eidolon Agent activity tracking

Seven services, one process.

Axon

Broca

Chiasm

Soma

Loom

Thymus

Brain

Drop-in hooks. Automatic memory.

Claude Code Hooks

Agent-Forge Protocol

Sidecar

Works with any AI platform.

Hardware-backed security.

KLEOSv1 Signing

Gate/Guard System

Middleware Stack

YubiKey Integration

SQLCipher Encryption

credd Vault

Built for production.

Drop in anywhere.

Try it yourself.

CLI Playground

Store a Memory

Get started in 30 seconds.

Everything agents need
to remember.

Eidolon
Agent activity tracking

Seven services,
one process.

Drop-in hooks.
Automatic memory.

Works with
any AI platform.

Hardware-backed
security.

Built for
production.

Drop in
anywhere.

Get started in
30 seconds.