Skip to content

ClawVault LoCoMo evaluation#161

Draft
G9Pedro wants to merge 1 commit intomasterfrom
cursor/clawvault-locomo-evaluation-cce7
Draft

ClawVault LoCoMo evaluation#161
G9Pedro wants to merge 1 commit intomasterfrom
cursor/clawvault-locomo-evaluation-cce7

Conversation

@G9Pedro
Copy link
Copy Markdown
Contributor

@G9Pedro G9Pedro commented Mar 11, 2026

Adds a LoCoMo QA benchmark suite to evaluate ClawVault's memory and retrieval performance against published baselines.

This benchmark suite provides an end-to-end pipeline to ingest LoCoMo conversation histories into ClawVault, perform retrieval and QA, and score answers against gold standards using LoCoMo's official metrics. It enables comparison of ClawVault's filesystem-native approach with competitors and RAG baselines.

Open in Web Open in Cursor 

Co-authored-by: G9Pedro <G9Pedro@users.noreply.github.com>
@cursor
Copy link
Copy Markdown

cursor bot commented Mar 11, 2026

Cursor Agent can help with this pull request. Just @cursor in comments and I'll start working on changes in this branch.
Learn more about Cursor Agents

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants