GIR MODE ACTIVATED
Open Source · Self-Hosted · Rust Native

Your agents have amnesia.

Kleos is cognitive infrastructure for AI agents. 16 Rust crates: persistent memory with FSRS-6 decay, 4-channel hybrid search, knowledge graphs, 7 coordination services, and hardware-backed security. One binary. Zero cloud dependencies.

RUST_NATIVE// ONNX_EMBEDDINGS// FSRS_6_DECAY// KNOWLEDGE_GRAPHS// HYBRID_SEARCH// AT_REST_ENCRYPTION// MULTI_TENANT// AGENT_FORGE// KLEOS_SIDECAR// TOOL_COMPRESSION// PER_AGENT_KEYS// CLAUDE_HOOKS// EIDOLON// CREDD// MILLION_PLUS_SCALE// HOPFIELD_NETWORK// DREAM_CONSOLIDATION// CROSS_ENCODER_RERANK// ACTIVITY_FANOUT// GATE_GUARD// KLEOSV1_SIGNING// LOUVAIN_CLUSTERING// SEARCH_CACHE// INGESTION_PIPELINE// RUST_NATIVE// ONNX_EMBEDDINGS// FSRS_6_DECAY// KNOWLEDGE_GRAPHS// HYBRID_SEARCH// AT_REST_ENCRYPTION// MULTI_TENANT// AGENT_FORGE// KLEOS_SIDECAR// TOOL_COMPRESSION// PER_AGENT_KEYS// CLAUDE_HOOKS// EIDOLON// CREDD// MILLION_PLUS_SCALE// HOPFIELD_NETWORK// DREAM_CONSOLIDATION// CROSS_ENCODER_RERANK// ACTIVITY_FANOUT// GATE_GUARD// KLEOSV1_SIGNING// LOUVAIN_CLUSTERING// SEARCH_CACHE// INGESTION_PIPELINE//

Everything agents need
to remember.

Modular capabilities that compose into a full cognitive memory layer. Built in Rust for speed, safety, and zero runtime dependencies.

📈

FSRS-6 Decay

Power-law forgetting that models human memory. Memories fade unless reinforced.

🔗

Knowledge Graph

10 typed edge relations, Louvain community detection, PageRank with 180-day temporal decay. 2-hop graph traversal augments search.

🔍

Hybrid Search

4-channel RRF: vector similarity, FTS5 BM25, graph traversal, personality signals. IBM Granite cross-encoder reranks top-K.

🔐

At-Rest Encryption

SQLCipher database encryption. Keyfile, environment variable, or YubiKey HMAC-SHA1. Argon2id KDF.

👤

Personality

Learned preferences, style guides, and user-specific behaviors that persist.

🕐

Time Travel

View memory state at any point in time. See what the agent knew and when.

🤖

Multi-Agent

Shard-per-tenant isolation. Deadpool reader/writer pools. Lazy loading with LRU eviction. Quota enforcement.

📌

Decomposition

Break complex memories into atomic facts. Each scored and decays independently.

🎯

Budget Context

Token-aware assembly. Fill context windows optimally for any model.

📡

Eidolon

Single POST /activity fans out to 6 services: Chiasm tasks, Axon events, Broca logs, Thymus metrics, Skills, Memory.

🔑

credd

Secure credential daemon. Agent keys, secret resolution, proxy injection. Never hardcode again.

🧠

Hopfield Network

Modern associative memory (Ramsauer 2020). Pattern completion with exponential capacity.

🌙

Dream Consolidation

6-stage autonomous consolidation: replay, merge, prune, discover, decorrelate, resolve. Runs on schedule like biological sleep.

🔗

Causal Chains

Track cause/effect relationships. Build reasoning chains across memories with confidence scoring.

🎯

Cross-Encoder Rerank

IBM Granite ONNX model. Semantic precision on top-K results after initial retrieval.

🛠

Tool Grounding

Secure tool execution. MCP, Shell, Web, GUI backends with sandboxing and security policies.

📈

Million+ Scale

LanceDB vector index handles 1M+ memories per user. Single-user or production multi-tenant.

🛠

Agent-Forge

Structured reasoning protocol. spec_task, log_hypothesis, verify, challenge_code. Quality workflows baked in.

💭

Sidecar

Session-scoped Guardian. Batched observation flushing with retry, file watcher, Prometheus metrics, Syntheos integration.

🔒

Per-Agent Keys

Every agent and model gets its own encrypted API key. Full audit trail of who stored what.

Claude Hooks

Drop-in hooks for Claude Code. Auto context loading, session tracking, memory storage. Zero manual prompting.

A day with Kleos

What it actually looks like to use persistent memory.

Session Starts

Pick up where you left off

Your agent pulls context from Kleos. It knows the project state, your preferences, what you decided last time, and what's still unresolved.

While Working

Decisions and discoveries stored automatically

New architecture choices, deployment configs, bug resolutions -- all stored with importance scoring and auto-linked to related memories.

Conflict Detected

"You said X last week but now Y -- which is correct?"

Kleos catches contradictions between new information and existing knowledge. Your agent surfaces them instead of silently overwriting history.

Session Ends

Memories consolidate, weak ones fade

Important memories grow stronger. Irrelevant details lose retrieval strength. The knowledge graph reorganizes around what actually matters.

Next Session

Your agent remembers everything that mattered

Not a transcript dump. Not a vector search over flat files. Weighted, prioritized, personality-aware context -- assembled from a living knowledge graph.

How memory retrieval strength works

Click "Recall" to see how spaced repetition builds lasting memory.

100% 50% 0%
0 Days 12mo
Retrievability: 85% Stability: 1.2d Reviews: 2

See your memory space

Kleos includes a built-in WebGL graph visualization. Explore connections between memories, search, create, edit, all from the browser.

demo-gui.syntheos.dev
🧠

Interactive Memory Galaxy

Explore a live demo with sample memories and auto-generated links. Click nodes to inspect. Search, filter by category, create new memories. Keyboard shortcuts for power users.

WebGL Rendering Force-Directed Layout Real-time Search Category Filters
Open Live Demo

Eidolon
Agent activity tracking

Single activity endpoint that fans out to six downstream services. One POST triggers Chiasm task updates, Axon event publishing, Broca action logging, Thymus metrics, skill matching, and memory storage.

  • Chiasm task lifecycle (active/paused/blocked/completed)
  • Axon event publishing with cursor-based replay
  • Broca permanent action logging with causal links
  • Soma agent registry with heartbeat and drift detection
  • Thymus quality scoring and reinforcement loop
# Single POST fans out to all services
curl -X POST $EIDOLON_URL/activity \
-H "Authorization: Bearer $EIDOLON_KEY" \
-d '{
"agent": "claude-code",
"action": "task.completed",
"summary": "Deployed v2.1 to prod"
}'

Seven services,
one process.

Neuroscience-named coordination primitives that share the same substrate. No external message broker. No separate databases. Everything in-process, zero-latency.

Axon

Event Bus

Pub/sub with channels, retention windows, cursor-based consumption. Webhook push or pull polling. At-least-once delivery.

Broca

Action Log

Permanent structured log of agent actions. Complex filtering by agent, service, action type. Causal links to Axon events.

Chiasm

Task Tracker

Long-running task lifecycle management. State machine: active, paused, blocked, completed. Project-scoped for multi-agent work.

Soma

Agent Registry

Directory of active agents with heartbeat monitoring. Capability declarations, quality scores, drift flags.

Loom

Workflow Engine

DAG-based execution engine. Actions, decisions, LLM calls as nodes. State persistence across restarts.

Thymus

Quality Evaluation

Aggregates feedback signals into quality scores. Rubric-based evaluation. Reinforcement learning loop stored in Soma.

Brain

Cognitive Backend

Hopfield network or external subprocess. Init, Query, Absorb, DreamCycle commands. JSON over STDIN/STDOUT protocol.

# One POST triggers all services
curl -X POST localhost:4200/activity \
-H "Authorization: Bearer $KEY" \
-d '{"agent": "claude", "action": "task.completed"}'
Fans out to: Chiasm + Axon + Broca + Thymus + Skills + Memory

Drop-in hooks.
Automatic memory.

No more "search Kleos for..." in every prompt. Install hooks once, memory works automatically. Session context loads at start, memories store on end, relevant context surfaces each turn.

Claude Code Hooks

Four hooks that wire up the entire system:

  • SessionStart -- Load project context
  • UserPrompt -- Search relevant memories
  • PostToolUse -- Report to sidecar
  • Stop -- Store session summary
🛠

Agent-Forge Protocol

Structured reasoning workflow enforced via hooks:

  • spec_task -- Define before coding
  • log_hypothesis -- Record debugging guesses
  • verify -- Check code compiles/runs
  • challenge_code -- Quality self-review
💭

Sidecar

Session-scoped Guardian:

  • Batched flush -- Observations buffered with exponential backoff retry
  • File watcher -- Auto-extracts from Claude Code JSONL with checkpoint persistence
  • Syntheos -- Axon events, Chiasm tasks, Broca logs, Soma heartbeat
  • Prometheus -- Flush latency, pending depth, active sessions, compress stats
# Install hooks and forget about manual prompting
cp hooks/simple/*.sh ~/.claude/hooks/
# Or full version with Eidolon, Agent-Forge, credd
cp hooks/full/*.sh ~/.claude/hooks/

Works with
any AI platform.

Connect via MCP protocol, HTTP API, or CLI. Drop Kleos into your existing workflow in minutes.

🤖
Claude
🚀
Cursor
🌊
Windsurf
Zed
💬
ChatGPT
💻
Any MCP
MCP Native | HTTP REST API | CLI Tool

Hardware-backed
security.

Not just authentication. Non-repudiable request signing, pre-action policy gates, layered middleware, and optional YubiKey integration. Security that does not depend on trust.

KLEOSv1 Signing

Non-repudiable request envelopes. ECDSA-P256 (YubiKey PIV) or Ed25519 (software). 60-second replay window with nonce protection.

Gate/Guard System

Pre-action decision engine. Allow, warn, or block policies. Human-in-the-loop approval queue for sensitive operations.

Middleware Stack

6 layers: Tower-HTTP, safe mode (crash-loop detection), pre-auth rate limit, authentication, audit logging, tenant quotas.

YubiKey Integration

PIV for remote API signing with public PEM registry. HMAC-SHA1 challenge-response for local credential unlocking.

SQLCipher Encryption

Per-tenant database encryption at rest. Key resolution from keyfile, environment variable, or YubiKey HMAC-SHA1.

credd Vault

AES-256-GCM credential daemon on port 4400. Keyless agent bootstrap. Secret resolution without plaintext exposure.

Built for
production.

Rust-native performance with zero garbage collection pauses. Local ONNX inference means no API latency.

<15ms
Search Latency
1024
Embedding Dims
0
Cloud Dependencies
16
Rust Crates
Memory Recall Accuracy 94%
FSRS Stability After 7 Days 87%
Search Relevance (MRR@10) 0.91
Graph Entity Resolution 89%

Internal benchmarks on synthetic workloads. Results vary with dataset size, embedding model, and usage patterns.

Drop in
anywhere.

CLI Bash
# Store a memory
kleos-cli store "Auth migrated to JWT" \
  --category decision \
  --importance 9

# Search
kleos-cli search "auth setup"

✔ 3 results in 12ms
HTTP API curl
# Store via API
curl -X POST localhost:4200/store \
  -H "Authorization: Bearer eg_..." \
  -d '{"content":"Auth migrated"}'

# Contextual recall
curl -X POST localhost:4200/context \
  -d '{"query":"auth setup"}'
Quick Start 30 seconds
# Clone and build
git clone github.com/Ghost-Frame/Kleos
cargo build --release

# Run server
./target/release/kleos-server

✔ Ready on :4200

Try it yourself.

CLI playground, live memory graph, and store demo. All running in your browser.

CLI Playground

kleos
# Type a command and press Enter
# Try: store, search, list, help
$

Store a Memory

Get started in
30 seconds.

One command. Full memory. Your hardware.

$ curl -fsSL kleos.sh | sh

127.0.0.1:4200 · 16 crates · rusqlite + ONNX + LanceDB · Zero cloud dependencies