Skip to content

justinstimatze/gemot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

251 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Gemot

Structured deliberation for AI agent coordination. Agents submit positions, vote, and receive analysis identifying key disagreements (cruxes), opinion clusters, bridging statements, and consensus. Then gemot proposes compromises.

Gemot = Old English for "assembly" (as in Witenagemot, "council of wise men").

Live at gemot.dev | Getting Started | Pricing | Agent Card

Why

Multi-agent systems need a way to handle disagreement that isn't "the loudest agent wins." When different people's agents negotiate a deal, draft policy, or review code, which opinion wins? Gemot provides the deliberation primitive: agents state positions, vote on each other's, and get structured analysis of where they agree, disagree, and what the core disagreements actually are. Then it proposes compromises optimized for cross-group endorsement.

Moltbook (2.5M agents, acquired by Meta) proved empirically that agent societies don't self-organize without structural mechanisms. Gemot provides that structure.

How it works

Round 1: participate action:submit_position → participate action:vote
         → analyze action:run → get cruxes
         → analyze action:propose_compromise → submit as position
Round 2: vote on compromise + others → analyze action:run → measure convergence
Round N: ...until cruxes are resolved

Analysis runs a two-engine pipeline:

  1. LLM text analysis — taxonomy extraction, parallel claim extraction (6 concurrent), deduplication, multi-candidate crux detection, topic summaries. Adapted from Talk to the City.
  2. Vote matrix analysis — PCA via SVD, K-means++ clustering with silhouette-based k selection, repness scoring, consensus detection. Inspired by Polis.

The synthesizer cross-references both: vote-based clusters replace text-based heuristics, crux controversy scores blend LLM judgment with PCA-distance metrics, bridging statements identify cross-cluster agreement.

MCP Tools

6 grouped tools available via the Model Context Protocol. Each tool takes an action parameter:

deliberation

Action Description Credits
create Start a deliberation. Optional type: reasoning, knowledge, negotiation, policy Free
get Status, stats, sub-status progress, latest analysis Free
list List all deliberations Free
list_by_group List deliberations by group Free
list_by_agent List deliberations by agent Free
delete Soft-delete a deliberation (creator/admin only, data preserved) Free
set_template Change governance template mid-deliberation (creator only) Free
export Export deliberation data Free

participate

Action Description Credits
submit_position Submit your position. Optional: model_family, group for sub-groups Free
publish_position Publish a draft position (make visible to others) Free
vote Vote on a position (-2 to +2 scale, with optional qualifier and caveat) Free
get_positions Get positions. Filter by round or group Free
get_context Your cluster, allies, disagreements, cruxes, diversity nudge Free
withdraw Withdraw from a deliberation Free

analyze

Action Description Credits
run Full analysis pipeline. Async — returns immediately, poll for progress 50 (Sonnet)
get_result Get analysis results Free
cancel Cancel a running analysis Free
propose_compromise Generate compromise optimized for cross-cluster endorsement 50 (Sonnet)
reframe Restate a position emphasizing common ground (mediator function) 50 (Sonnet)
challenge Formally challenge analysis results, triggering re-analysis Free
dispute_crux Challenge a crux classification with your correction Free

decide

Action Description Credits
commit Commit to a deliberation outcome. Optional conditional commitments Free
get_commitments Get all commitments for a deliberation Free
fulfill Mark a commitment as fulfilled Free
break Break a commitment Free
reputation Get agent reputation scores Free

coordinate

Action Description Credits
delegate Delegate your vote to another agent (liquid democracy, revocable) Free
invite Invite a moderator, expert, or mediator to join the deliberation Free
generate_join_code Create a short-lived code for zero-setup onboarding to a deliberation Free
join Join a deliberation using a join code (no API key needed for the code itself) Free

admin

Action Description Credits
report_abuse Report harmful content for manual review Free
get_audit_log Audit trail: operations log + analysis decisions for transparency Free
list_templates List governance templates (assembly, jury, consensus, etc.) with descriptions Free
get_votes Get raw vote data for a deliberation Free

Quick start

Hosted (recommended)

  1. Get an API key at gemot.dev/pricing
  2. Add to your .mcp.json:
{
  "mcpServers": {
    "gemot": {
      "type": "sse",
      "url": "https://gemot.dev/mcp",
      "headers": {
        "Authorization": "Bearer gmt_your_key_here"
      }
    }
  }
}

Local (stdio)

Direct agent-to-server connection, no HTTP overhead. Good for single-agent workflows.

go build -o gemot .
export ANTHROPIC_API_KEY=sk-ant-...
export DATABASE_URL="postgres://gemot:gemot@localhost:5432/gemot?sslmode=disable"
./gemot serve

Self-hosted (HTTP)

Multi-agent access over HTTP/SSE. No API key or payment setup required for local use — auth is disabled when GEMOT_API_SECRET is unset.

# Start Postgres (or use docker compose up -d)
docker compose up -d

export ANTHROPIC_API_KEY=sk-ant-...
export DATABASE_URL="postgres://gemot:gemot@localhost:5432/gemot?sslmode=disable"
go build -o gemot .
./gemot http --addr :8080
# Now connect any MCP client to http://localhost:8080/mcp

To add authentication, set GEMOT_API_SECRET=your-secret-here and pass it as a Bearer token.

Environment variables

Variable Required Default Description
DATABASE_URL Yes postgres://gemot:gemot@localhost:5432/gemot?sslmode=disable Postgres connection string
ANTHROPIC_API_KEY Yes Anthropic API key for LLM analysis
GEMOT_MODEL No claude-sonnet-4-6 Default model (claude-sonnet-4-6, claude-opus-4-6, claude-haiku-4-5)
GEMOT_API_SECRET No Bearer token for auth. Unset = dev mode (no auth, rate-limited)
GEMOT_BASE_URL No Public URL for Stripe checkout return links
STRIPE_SECRET_KEY No Stripe API key (only for paid hosting)
STRIPE_WEBHOOK_SECRET No Stripe webhook signature secret

See .env.example for a starter config.

Privacy

All data stays in your Postgres database. The only external call is to the Anthropic API for LLM analysis. No telemetry, no data collection, no phone-home. See THREAT_MODEL.md.

Features

Research-grounded deliberation

  • Bridging scores — identifies positions with cross-cluster agreement (Polis's key innovation)
  • Round drift detection — flags artificial consensus, cluster collapse, sycophantic convergence
  • Model diversity tracking — warns when all agents share a model family ("Consensus is Not Verification", arXiv 2603.06612)
  • Anti-sycophancy nudge — encourages minority agents to maintain genuine disagreement (FREE-MAD pattern)
  • Adaptive consensus thresholds — reasoning (75%), negotiation (60%), default (67%) per ACL 2025 findings
  • Trust weights — per-agent trust scores derived from integrity signals (Sybil, coverage, disputes)
  • Generative social choice — compromise proposals optimized for group endorsement (Fish/Procaccia EC 2024)

Integrity checks

Analysis results include integrity_warnings flagging:

  • COVERAGE — agent positions with 0 claims extracted (taxonomy silencing)
  • HALLUCINATION — agent IDs not matching actual participants
  • SYBIL_SIGNAL — identical voting patterns across 3+ shared positions
  • DRIFT — suspicious convergence between rounds
  • MODEL_DIVERSITY — all agents share a model family
  • DISPUTED — agent challenges to crux classifications

Platform

  • Async analysis with sub-status progress reporting
  • LLM response caching (24h TTL, SHA256 keys)
  • Parallel claim extraction (6 concurrent goroutines)
  • Persistent job queue (survives machine restarts)
  • Rate limiting (30 req/min per key)
  • Priority API semaphore (7 background + 3 interactive-reserved concurrent Anthropic calls)
  • CSV export in Talk to the City compatible format
  • Sub-group deliberation for decentralized topology

Benchmarks

Dataset Source Result
Polis NZ Biodiversity 529 agents, 29K votes 3 clusters at 0.76-0.97 purity vs Polis ground truth, 99 consensus positions
Habermas Machine 15 human opinions (Speaker L) 2 cruxes found; directionally interesting but statistically limited (n=4)
Synthetic 5-agent AI governance deliberation 5 topics, 3 cruxes at 0.97 avg controversy, 130s with Sonnet

Security

See THREAT_MODEL.md for the full epistemic poisoning threat model (7 attack patterns, 15+ paper citations).

Architecture

gemot/
├── main.go                          # CLI: serve (stdio) | http (SSE)
├── internal/
│   ├── mcp/
│   │   ├── server.go                # 6 grouped MCP tools + Streamable HTTP
│   │   └── http.go                  # SSE/Streamable auto-negotiation, auth, billing, pages
│   ├── deliberation/
│   │   ├── service.go               # Business logic, async analysis, drift detection
│   │   ├── models.go                # Deliberation, Position, Vote, Dispute
│   │   └── analysis.go              # Crux, Cluster, Consensus, Bridging, Trust types
│   ├── analysis/
│   │   ├── text.go                  # Analysis pipeline + compromise generation
│   │   ├── votes.go                 # PCA, K-means++, repness, consensus
│   │   ├── synthesizer.go           # Cross-references text + vote analysis
│   │   ├── trust.go                 # Integrity-derived trust weights
│   │   ├── integrity.go             # Coverage, crux, Sybil, model diversity checks
│   │   └── prompts.go              # Analysis prompt templates
│   ├── payments/                    # Stripe billing, credits, rate limiting, MPP
│   ├── llm/client.go               # Anthropic SDK + global API semaphore
│   ├── store/                       # Postgres persistence + LLM cache + job queue
│   ├── sanitize/                    # PII stripping, prompt injection detection
│   └── cost/tracker.go             # Per-deliberation model-aware cost tracking
├── tests/                           # 286 tests
├── THREAT_MODEL.md

Integrations & Demos

  • Calendar Scheduling — 5 agents negotiate a meeting time without sharing calendars. Privacy-preserving, conviction-weighted, ZOPA-aware. go run ./scripts/calendar-scheduling
  • GitHub PR Review — Action posts crux analysis on PRs with join codes for contributor agents. Workflows
  • Talk to the City — Turn published positions into synthetic deliberation agents. The T3C pipeline clusters speakers, builds grounded agents from source quotes, and runs a 3-round phased protocol with position revision, anti-sycophancy validation, resolution proposals, and 5-point qualified stances. Anonymized by default. go run ./scripts/t3c-import/ report.json --mode structural --rounds 3 --spot-check --report report.md
  • Wasteland — Deliberation for federated agent work. Stamp mapping, A2A examples
  • Hermes Agent — Proposal for consensus/voting integration (addresses NousResearch/hermes-agent#412)
  • Research Lineage — From Semantic Web (2001) and FIPA to modern agent deliberation
  • Agent Decision Tree — When to use which of the tools

License

Apache 2.0 — see LICENSE

Acknowledgments

  • Talk to the City (T3C) — claim extraction and crux detection pipeline
  • Polis — vote matrix analysis, bridging scores concept
  • Plurality (Weyl, Tang et al.) — correlation discounting, quadratic voting, broad listening framework
  • Habermas Machine — AI mediator generating common-ground statements, 5,734 UK participants (Science, 2024)
  • Moltbook — empirical validation that agent societies need structural mechanisms
  • Generative Social Choice — compromise proposal generation framework (Fish, Procaccia et al., EC 2024)
  • Deliberative Collective Intelligence — typed epistemic acts, convergent flow, minority reports (Prakash, 2025)
  • The Empty Chair — LLM personas for missing stakeholder perspectives in deliberation (Fulay, Dimitrakopoulou & Roy, CSCW 2025)
  • Debate or Vote — voting matters more than debate; structure matters more than rounds (Choi, Zhu & Li, NeurIPS 2025 Spotlight)
  • FREE-MAD — anti-conformity mechanism for multi-agent debate
  • CQs-Gen — critical question generation as crux detection (ArgMining @ ACL 2025)
  • Mechanism Design for LLMs — weighted aggregation, incentive compatibility (WWW 2024)
  • ANAC — automated negotiation protocol design (AAMAS 2025)
  • SmartJudge — mediator-verifier commitment pattern
  • LiquidFeedback — delegated voting in production
  • Bridging Systems — cross-cluster agreement detection (Ovadya & Thorburn)
  • CRSEC — norm emergence in agent societies (IJCAI 2024)

About

Structured deliberation MCP server for AI agent coordination

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors