ClawVault LoCoMo evaluation by G9Pedro · Pull Request #161 · Versatly/clawvault

G9Pedro · 2026-03-11T03:01:47Z

Adds a LoCoMo QA benchmark suite to evaluate ClawVault's memory and retrieval performance against published baselines.

This benchmark suite provides an end-to-end pipeline to ingest LoCoMo conversation histories into ClawVault, perform retrieval and QA, and score answers against gold standards using LoCoMo's official metrics. It enables comparison of ClawVault's filesystem-native approach with competitors and RAG baselines.

Co-authored-by: G9Pedro <G9Pedro@users.noreply.github.com>

cursor · 2026-03-11T03:01:48Z

Cursor Agent can help with this pull request. Just @cursor in comments and I'll start working on changes in this branch.
_{Learn more about Cursor Agents}

feat(benchmarks): add locomo benchmark suite and full run outputs

90f1700

Co-authored-by: G9Pedro <G9Pedro@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ClawVault LoCoMo evaluation#161

ClawVault LoCoMo evaluation#161
G9Pedro wants to merge 1 commit intomasterfrom
cursor/clawvault-locomo-evaluation-cce7

G9Pedro commented Mar 11, 2026

Uh oh!

cursor bot commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

G9Pedro commented Mar 11, 2026

Uh oh!

cursor bot commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants