clawlter/hermes-lcm: Forgejo fork of stephenschoettler/hermes-lcm for preparing LCM bugfix branches.

Watch

Forgejo fork of stephenschoettler/hermes-lcm for preparing LCM bugfix branches.

Python 99.9%
Shell 0.1%

Find a file

Repository files (latest commit first)
Filename	Latest commit message	Latest commit date
Stephen Schoettler b8d934b58f All checks were successful CI / test (3.11) (push) Successful in 2m37s Details CI / test (3.12) (push) Successful in 1m53s Details CI / test (3.13) (push) Successful in 2m49s Details chore: release v0.9.2 (#130 ) * chore: release v0.9.2 * docs: refresh issue template release example		2026-05-08 22:26:59 +02:00
.github	chore: release v0.9.2 (#130 )	2026-05-08 22:26:59 +02:00
docs	docs: update architecture diagram	2026-04-15 22:29:43 -07:00
scripts	fix: encode importer SQLite URI paths (#115 )	2026-05-06 07:21:28 -07:00
tests	chore: release v0.9.2 (#130 )	2026-05-08 22:26:59 +02:00
.gitignore	Initial commit: hermes-lcm plugin — Lossless Context Management for Hermes Agent	2026-04-06 18:57:59 -07:00
__init__.py	Harden /lcm slash registration behind explicit opt-in (#56 )	2026-04-19 22:34:39 -07:00
banner.png	Replace ASCII banner with pixel-art PNG for consistent rendering	2026-04-14 10:22:34 -07:00
command.py	fix: prevent ignored sessions from rebinding current LCM session (#127 )	2026-05-06 14:51:42 -07:00
config.py	fix: fall back to Hermes compression threshold (#114 )	2026-05-06 07:17:31 -07:00
CONTRIBUTING.md	docs: add issue forms	2026-05-07 20:22:30 -07:00
dag.py	fix: treat empty session ids as scoped search filters	2026-05-05 03:20:02 +02:00
db_bootstrap.py	fix: report FTS integrity failures separately (#98 )	2026-05-02 09:59:37 -07:00
engine.py	fix: require replay evidence for restart cursor reconciliation (#113 )	2026-05-08 01:52:07 -07:00
escalation.py	fix: route provider-prefixed LCM models (#72 )	2026-04-26 12:18:11 +02:00
externalize.py	fix: continue LCM state across compression boundaries (#76 )	2026-04-27 10:12:12 +02:00
extraction.py	fix: route provider-prefixed LCM models (#72 )	2026-04-26 12:18:11 +02:00
lifecycle_state.py	feat: report lifecycle fragmentation diagnostics (#90 )	2026-04-29 16:15:13 -07:00
message_content.py	fix: match structured message text parts	2026-05-07 00:57:39 +02:00
message_patterns.py	fix: guard message ignore regex matching	2026-05-07 01:04:39 +02:00
model_routing.py	fix: route configured custom provider model overrides (#99 )	2026-05-02 17:21:01 -07:00
plugin.yaml	chore: release v0.9.2 (#130 )	2026-05-08 22:26:59 +02:00
README.md	chore: release v0.9.2 (#130 )	2026-05-08 22:26:59 +02:00
schemas.py	fix: surface filter config in lcm_status (#121 )	2026-05-06 11:01:50 -07:00
search_query.py	fix: bound LIKE fallback candidate scans (#61 )	2026-04-19 22:42:42 -07:00
session_patterns.py	feat: add session filtering and stateless session handling	2026-04-13 00:12:24 +02:00
store.py	feat: add explicit LCM session loader (#116 )	2026-05-06 07:34:32 -07:00
tokens.py	fix: harden SQLite message persistence (#77 )	2026-04-28 22:24:42 +02:00
tools.py	fix: prevent ignored sessions from rebinding current LCM session (#127 )	2026-05-06 14:51:42 -07:00

README.md

Lossless Context Management plugin for Hermes Agent

Bounded context, unbounded memory. Nothing is ever lost.

Based on the LCM paper by Ehrlich & Blackman (Voltropy PBC, Feb 2026). Inspired by lossless-claw for OpenClaw.

The Problem

When active context fills up, agents usually replace older turns with a flat summary. Details can fall out of the prompt, and recovery depends on a separate history path the model may not use.

Standard compression

The Fix

Persist the conversation, compact old context into a hierarchical summary DAG, and give the agent tools to drill back into the exact material that was compacted.

LCM compression

Architecture

What It Does

SQLite message store - preserves raw messages by default before compaction
Summary DAG - compacts older context into depth-aware summary nodes
Bounded recovery - pages raw messages, child summaries, and externalized payloads without flooding the main context
Agent tools - lcm_grep, lcm_describe, lcm_expand, and lcm_expand_query
Source-aware retrieval - filters raw rows and summaries by descendant source lineage
Session controls - ignore noisy sessions or keep sessions read-only with glob patterns
Large output controls - optional externalization and transcript GC for oversized tool results
Diagnostics - lcm_status, lcm_doctor, and optional /lcm slash commands

LCM vs built-in compression

Hermes core may persist original conversation history in state.db before built-in compression rewrites the active prompt. Built-in compression can still be lossy in the active context, but previous content may be recoverable later through host-level history tools such as session_search.

hermes-lcm is different because recall is part of the active context engine:

plugin-local store and DAG built specifically for drill-down
current-session retrieval through LCM tools, not an auxiliary cross-session search step
explicit source-lineage and session-boundary rules

Position LCM around retrieval quality, autonomy, and drill-down behavior. Do not claim that Hermes core has no persisted record of pre-compression history.

Requirements

Hermes Agent with the pluggable context engine slot (PR #7464)
Python 3.11+
No required third-party runtime dependencies. tiktoken is used if available; otherwise LCM falls back to character-based token estimates. regex is used if available to apply timeouts to message ignore patterns; if it is not installed, message-level regex filtering is disabled with a warning rather than running unbounded stdlib re matches.

Install

Canonical install path: clone hermes-lcm as a general user plugin.

git clone https://github.com/stephenschoettler/hermes-lcm \
  ~/.hermes/plugins/hermes-lcm

For a profile-specific install:

git clone https://github.com/stephenschoettler/hermes-lcm \
  ~/.hermes/profiles/myprofile/plugins/hermes-lcm

From an existing checkout, install a symlink:

./scripts/install.sh
# Optional profile-aware install:
HERMES_PROFILE=myprofile ./scripts/install.sh

Activate

The plugin has two names:

plugin manifest name: hermes-lcm
runtime context engine name: lcm

Both must be configured:

plugins:
  enabled:
    - hermes-lcm

context:
  engine: lcm

Restart Hermes after changing plugin or context-engine config.

Update

If you cloned directly into the plugin directory:

cd ~/.hermes/plugins/hermes-lcm && git pull --ff-only

For a profile-specific install:

cd ~/.hermes/profiles/myprofile/plugins/hermes-lcm && git pull --ff-only

If you installed a symlink from a separate checkout:

./scripts/update.sh

Restart Hermes after updating.

Verify

Run:

hermes plugins

Expected signals:

plugin list includes hermes-lcm
selected context engine is lcm
tool list includes lcm_grep, lcm_load_session, lcm_describe, lcm_expand, lcm_expand_query, lcm_status, and lcm_doctor

Typical output:

Plugins (1):
  ✓ hermes-lcm v0.9.2 (7 tools)

Provider Plugins:
  Context Engine: lcm

For source checkouts, lcm_status, /lcm status, lcm_doctor, and /lcm doctor also report the loaded plugin path and best-effort git identity: plugin_git_commit, plugin_git_branch, and plugin_git_dirty.

Troubleshooting

`hermes plugins` shows `lcm (not found)` but LCM tools exist

If plugins.enabled contains hermes-lcm, context.engine: lcm is set, and the runtime exposes LCM tools, LCM is loaded. The lcm (not found) line is a Hermes host discovery/status mismatch, not an LCM storage or compaction failure.

`/lcm status` looks unbound after restart

After a fresh Hermes restart, /lcm status may show session_id: (unbound) or threshold_tokens: (uninitialized). Send one normal Hermes message first, then run lcm_status or /lcm status again for live per-session fields.

Configuration

Most installs only need plugins.enabled and context.engine: lcm. Useful environment variables:

Variable	Default	Use
`LCM_CONTEXT_THRESHOLD`	`0.75`	Fraction of the context window that triggers LCM compaction
`LCM_FRESH_TAIL_COUNT`	`64`	Recent messages protected from compaction
`LCM_LEAF_CHUNK_TOKENS`	`20000`	Token floor for leaf compaction chunks
`LCM_NEW_SESSION_RETAIN_DEPTH`	`2`	DAG depth retained after manual `/new` (`-1` all, `0` none)
`LCM_IGNORE_SESSION_PATTERNS`	empty	Comma-separated session globs excluded from LCM storage
`LCM_STATELESS_SESSION_PATTERNS`	empty	Comma-separated session globs kept read-only
`LCM_IGNORE_MESSAGE_PATTERNS`	empty	Comma-separated regex patterns; matching message content (plain text, extracted text parts for structured/multimodal content, or normalized JSON fallback when no text parts exist) is excluded from LCM storage
`LCM_LARGE_OUTPUT_EXTERNALIZATION_ENABLED`	`false`	Store oversized tool outputs in plugin-managed JSON files
`LCM_LARGE_OUTPUT_EXTERNALIZATION_THRESHOLD_CHARS`	`12000`	Externalization threshold for tool output text
`LCM_LARGE_OUTPUT_TRANSCRIPT_GC_ENABLED`	`false`	Rewrite already-externalized summarized tool rows to compact placeholders
`LCM_SUMMARY_MODEL`	auxiliary	Override summarization model
`LCM_EXPANSION_MODEL`	summary model / auxiliary	Override `lcm_expand_query` synthesis model
`LCM_EXPANSION_CONTEXT_TOKENS`	`32000`	Context budget used by the auxiliary LLM for `lcm_expand_query`
`LCM_SUMMARY_TIMEOUT_MS`	`60000`	Timeout for one summarization call
`LCM_EXPANSION_TIMEOUT_MS`	`120000`	Timeout for one `lcm_expand_query` synthesis call
`LCM_DATABASE_PATH`	auto	SQLite database path, profile-scoped by default
`LCM_ENABLE_SLASH_COMMAND`	`false`	Enable the optional `/lcm` operator command surface
`LCM_DOCTOR_CLEAN_APPLY_ENABLED`	`false`	Permit destructive `/lcm doctor clean apply` in trusted operator contexts

Advanced compaction, assembly, and extraction knobs are defined in config.py.

Threshold ownership

When context.engine: lcm is active, LCM_CONTEXT_THRESHOLD is the compaction threshold LCM uses. Hermes core compression.threshold belongs to the built-in compressor. Hermes core compression.enabled is still the global gate that allows compaction, so leave it enabled when using LCM.

If startup/status output shows a host-side compression percentage that disagrees with LCM, trust live LCM status after a normal message has initialized the session.

Session pattern syntax

Pattern matching checks multiple keys: raw session_id, platform, and platform:session_id.

* matches within one colon-delimited segment
** can span across colons

Example: cron:* can match Hermes cron sessions, while exact raw session IDs still work.

Noise suppression

LCM offers two layers of noise filtering, sized to two different shapes of noise:

Session-level filters (LCM_IGNORE_SESSION_PATTERNS, LCM_STATELESS_SESSION_PATTERNS) catch the case where the noisy traffic arrives as its own session or platform, for example a dedicated cron:* session. Match keys cover the session id, the platform, and platform:session_id.
Message-level patterns (LCM_IGNORE_MESSAGE_PATTERNS) catch the case where cron alerts or other noise are injected into a normal Telegram or WhatsApp conversation as ordinary user-visible messages. From LCM's perspective the session/platform is telegram or whatsapp, not cron, so only the message content is distinctive.

Message-level patterns are Python regex strings, comma-separated, compiled once at engine start. They run against plain message text. For structured multimodal payloads, LCM matches against concatenated text parts first, so anchored patterns bind to the text an operator sees. If a structured payload contains no text parts, matching falls back to the normalized JSON form that LCM would have written to the store. Matching messages are skipped before storage, so new matching rows do not enter the messages table or FTS index. Filtering is role-agnostic by default, since cron alerts can be re-emitted under any role depending on the gateway.

Example operator config:

LCM_IGNORE_MESSAGE_PATTERNS=^Cronjob Response:,^>>>Cronjob Response<<<:

Invalid regex entries are logged at warning level and dropped; the surviving patterns in the same list still take effect, so a misconfigured entry never crashes ingest. Pattern matching uses a 50 ms per-pattern timeout when the optional regex package is installed. If regex is not installed, LCM logs a warning and disables message-level regex filtering rather than running unbounded stdlib re matches in the ingest path.

One operator-facing limitation to know about:

Compaction-window edge. The filter runs at ingest time. When a matching message is part of the chunk being summarized in the same turn it arrived, the message's text may appear inside the resulting summary node text. In long-running sessions where compaction triggers every several dozen turns, this can affect multiple summary nodes per day rather than only happening rarely. The summary node's source_ids will not reference the filtered message (it was never written to the store), so DAG lineage stays clean; only the serialized summary text can carry it. Closing this window is tracked as follow-up work.

lcm_status surfaces the full filter contract under session_filters, including ignore_session_patterns, stateless_session_patterns, ignore_message_patterns, their *_source fields (default or env), the current session's ignored and stateless booleans, and a process-lifetime ignored_message_count so operators can confirm their patterns are loaded and watch how often message filters fire. The counter resets on engine restart.

Large tool-output handling

Externalization is opt-in. When enabled, oversized tool results are written to plugin-managed JSON files and referenced from summaries. They remain inspectable later through lcm_describe(externalized_ref=...) and lcm_expand(externalized_ref=...).

Transcript GC is separate and also opt-in. It only rewrites already-externalized, already-summarized tool-role rows to compact placeholders. It keeps the same store_id, keeps payload files, skips pinned messages, and preserves lossless recovery through externalized_ref. After GC, lcm_grep will not match the original giant tool blob text directly; search summaries or refs instead.

Agent Tools

Use these tools for current-session recall after compaction. Use session_search for earlier separate sessions or broad cross-session history.

Tool	Use
`lcm_grep`	Search current-session raw messages and summaries. Opt into `session_scope='all'` or `session_scope='session'` (with `session_id`) for bounded archive recovery over rows already present in `lcm.db`, including externally backfilled rows that may carry source strings such as `openclaw-lcm:*`; broader scopes return raw-message hits only. Use `session_search` for earlier separate sessions or broad cross-session recall.
`lcm_load_session`	Load one ordered raw-message transcript page for an explicit `session_id`. This is not search: it returns raw rows in `store_id` order, bounded by `limit`, with per-message content bounded by `max_content_chars`, and continues with `after_store_id` from `next_cursor`.
`lcm_describe`	Inspect the current-session DAG or preview an `externalized_ref` without loading full content.
`lcm_expand`	Recover source messages, child summaries, or externalized payloads with pagination. Use `store_id` to fetch a single raw message regardless of session, suitable for drilling into a cross-session `lcm_grep` result.
`lcm_expand_query`	Answer a question using expanded current-session LCM context while returning a bounded answer.
`lcm_status`	Show runtime health, context pressure, config, source lineage, and lifecycle stats.
`lcm_doctor`	Run database, FTS, lifecycle, config, and context-pressure diagnostics.

Retrieval contract

LCM retrieval tools default to current-session scope. lcm_grep accepts session_scope='all' or session_scope='session' as an explicit opt-in for bounded archive search over rows already present in lcm.db (raw-message hits only). Once a session id is known, lcm_load_session can enumerate that session's raw transcript in chronological store_id pages without a search query. Use Hermes session_search for broad cross-session history outside the LCM database.

Within the current session, source filters raw rows directly and filters summary nodes by descendant raw-message source lineage. unknown is a real source value, not a wildcard. Legacy blank-source rows are treated as unknown.

Carried-over summary nodes can become current-session content after /new, but their source eligibility still comes from the descendant raw messages.

Lossless raw recovery contract

Tool responses are bounded so one retrieval call cannot flood the main context. Lossless recovery means raw content is stored with stable source lineage and can be recovered in deterministic pages.

lcm_expand(node_id=...) pages immediate sources with source_offset and source_limit
lcm_load_session(session_id=...) pages ordered raw session rows with after_store_id and next_cursor; each row includes bounded content plus truncation metadata, and large individual rows can be recovered with lcm_expand(store_id=...) using content_offset
oversized raw messages continue with content_offset
lcm_expand(externalized_ref=...) pages payload content with content_offset
lcm_expand_query uses context_max_tokens for auxiliary context and reports truncation/pagination hints when needed

lossless-claw/OpenClaw import utility

hermes-lcm includes an opt-in operator script for backfilling raw message rows from a lossless-claw/OpenClaw LCM SQLite database into the local hermes-lcm SQLite store:

python scripts/import_lossless_claw.py \
  --source-db ~/.openclaw/path/to/lcm.db \
  --target-db ~/.hermes/lcm.db \
  --agent sammy

The script is intentionally conservative:

dry-run is the default; pass --apply to write
run it against an explicit target DB path, preferably while Hermes is stopped for that profile
writes create a timestamped target DB backup first when the target already exists
only raw messages are imported; summary DAG import is out of scope
imported rows keep explicit provenance in session_id and source, for example openclaw-lcm:agent:sammy:<source-session>
the default provenance identity is the concrete source conversations.session_id, preserving source session boundaries even when many conversations share one session_key
pass --session-identity session_key only when you intentionally want conversations with the same source session key grouped into one imported LCM session
reruns are idempotent for the same --import-id; the default import_id is path-derived, so pass a stable --import-id if you may import the same copied DB from different paths
changing --agent, --namespace, or --session-identity under the same --import-id is treated as the same import and will skip already-tracked source messages; use a new --import-id for a different mapping
no OpenClaw config or separate secret tables are imported, but raw transcripts and tool payloads are imported and may contain sensitive user data

This is a local archive migration path. It does not make LCM a general memory provider, and it does not change the current-session retrieval contract for agent tools.

Slash Commands

Slash commands are disabled by default. Enable them only in trusted operator contexts:

export LCM_ENABLE_SLASH_COMMAND=1

Available commands:

/lcm or /lcm status - current runtime/session status
/lcm doctor - read-only health checks
/lcm doctor clean - read-only scan for obvious junk/noise session candidates
/lcm doctor clean apply - backup-first cleanup for safe pattern-matched candidates; requires LCM_DOCTOR_CLEAN_APPLY_ENABLED=true
/lcm doctor repair - read-only SQLite/FTS repair diagnostics
/lcm doctor repair apply - backup-first SQLite/FTS repair
/lcm doctor source - read-only scan for legacy blank-source rows
/lcm doctor source apply - backup-first normalization of legacy blank-source rows to unknown
/lcm doctor retention - read-only retention analysis
/lcm backup - timestamped SQLite backup
/lcm help - command help

Apply paths are intentionally narrow and backup-first. Start with diagnostics before cleanup or repair.

How It Works

Ingest - persist each message in SQLite with FTS metadata
Compact - summarize older messages outside the fresh tail into D0 leaf nodes
Condense - merge same-depth nodes into higher-depth summaries
Escalate - shrink oversize summaries from detailed to bullets to deterministic truncate
Assemble - combine system prompt, highest-depth summaries, and fresh tail
Retrieve - use LCM tools to drill into compacted history or synthesize from expanded context

Development

Important files:

plugin.yaml      manifest
__init__.py      plugin registration and optional slash-command registration
engine.py        LCMEngine main orchestrator
store.py         SQLite message store and FTS
dag.py           summary DAG and FTS
config.py        env var defaults and overrides
command.py       /lcm command handlers
tools.py         lcm_grep, lcm_load_session, lcm_describe, lcm_expand, lcm_expand_query
schemas.py       tool schemas shown to the model
tests/           standalone pytest coverage

Run tests:

pip install pytest
python -m pytest tests/ -v

No Hermes Agent checkout is required for the test suite; tests include a lightweight ABC stub.

Contributing

Issues and PRs welcome. Bug fixes and correctness improvements are highest priority. New features should be scoped, backwards-compatible, and tested.

See CONTRIBUTING.md for branch, validation, and PR guidance. See the releases page for changelogs.

License

MIT