GitHub - sliamh11/Deus: A token-efficient open-source AI assistant that remembers, adapts, and improves — secured, self-hosted, and entirely yours.

A personal AI that understands you - not just recalls things you've said. It learns what you care about, how you think, and what you'll actually find useful. The longer you use it, the more it feels like it gets you. Everything runs on your computer. Your data stays yours.

What it does

Understands you - It doesn't just store memories - it breaks conversations into facts, indexes by meaning, and builds a model of what you care about. Ask about something from three weeks ago and it recalls the details, even if you don't remember what you called it. Works in any language — Hebrew, Arabic, and other non-Latin scripts included. (95% recall on the LongMemEval benchmark; multilingual reranker active.)
Adapts to how you think - Scores its own responses, generates self-critiques, and rewrites its system prompt based on what worked. Tone, judgment, the kind of suggestions it surfaces - all of it improves at the personality level.
Picks up where you left off - Context carries over between sessions. Start a project Monday, come back Thursday, and it knows where you left off.
Lives where you already are - WhatsApp, Telegram, Slack, Discord, Gmail. Add only the ones you need. Your memory follows you across all of them.
Private by default - Runs on your machine in isolated containers. No cloud sync, no tracking, no data leaving your computer.
Works on your code too - Run deus in any project directory for a coding assistant that already knows your preferences and past work.

And more

Voice - Send a voice message and it transcribes and responds. Runs locally on Apple Silicon.
Vision - Send a photo or screenshot and it sees and responds to it.
Calendar - Reads and manages your Google Calendar. Ask what's coming up, or tell it to book something.
Scheduled tasks - Daily summaries, weekly recaps, reminders - set it and forget it.
Web & video - Summarize YouTube videos, fetch web pages, or research a topic, all from a chat message.
Self-maintaining docs - A weekly background agent scans for stale documentation and opens fix PRs automatically.
Reliable long sessions - Detects infinite tool-call loops and auto-summarizes large tool outputs so long sessions stay coherent.

Quick Start

What you need

macOS (Apple Silicon recommended), Linux, or Windows
Claude Code or Codex CLI installed and authenticated
- Codex with an API key is recommended — subscription-only auth disables hooks (see CLI)
Docker Desktop (handles WSL 2 on Windows automatically)
Node.js 20+, Python 3.11+
A Gemini API key (free tier is enough)
Ollama for local embeddings and scoring (not an agent backend) - /setup pulls the right models automatically based on your hardware
Optional: llama.cpp for a fully local, API-free agent backend — no per-turn cost, works offline. Run /add-llama-cpp to install.
Optional: free-claude-code proxy for using Claude Code CLI with alternative models (Ollama, llama-server, Gemini). Launch with deus fcc after configuring via deus provider and deus model.

Install

git clone https://github.com/sliamh11/Deus.git
cd Deus
claude            # or: codex

Then inside the CLI:

/setup

Setup installs dependencies, builds the container, and walks you through configuration. At the end it offers a Personality Kickstarter - choose a behavioral bundle (developer, student, universal) or pick individual behaviors, and optionally give it example conversations so it's useful from day one.

Connect a channel

A fresh install has zero channels. Add only what you need:

/add-whatsapp           # Scan QR code to connect WhatsApp
/add-telegram           # Paste bot token to connect Telegram

See AGENTS.md for all available skills.

Start talking

@Deus what's on my calendar tomorrow?
@Deus summarize the YouTube video at <url>
@Deus remind me every Monday morning what I worked on last week

Switching from another AI? Paste this into your current AI (ChatGPT, Gemini, etc.) and send the output to Deus in your first conversation:

I'm switching to a new AI assistant called Deus. Generate a structured summary
about me that I can give it so it knows me from day one. Include:

1. About me - name, role, location, languages
2. What I use AI for - main topics and tasks
3. Communication style - how I like responses
4. Preferences - things I've corrected you on
5. Key context - ongoing projects, goals, background

Be specific and factual. Skip anything generic. Format as plain text.

CLI

Command	What it does
`deus`	Launch in the current directory (project mode if outside `~/deus`)
`deus home`	Launch in home mode regardless of current directory
`deus codex`	Use OpenAI/Codex backend for this session
`deus fcc`	Launch with a local proxy model (Ollama, llama-server, Gemini)
`deus provider <name>`	Switch proxy provider (`ollama`, `llamacpp`, `gemini`)
`deus model <name>`	Switch proxy model (auto-prefixes active provider)
`deus model dashboard`	Open proxy admin UI in browser
`deus auth`	Rebuild and restart background services
`deus gcal`	Google Calendar token management (`status`, `auth`, `ping`)
`deus listen`	Record from mic, transcribe locally, copy to clipboard
`deus tui`	Full-screen terminal UI for chat, wardens, services, and channels
`deus pipeline`	Live pipeline monitor (default), or one-shot audit (`PROJ-123`, `--failed`, `--active`)
`deus usage`	Token-efficiency + cost report across all projects (`--since`, `--project`, `--pricing`, `--json`)
`deus backend`	Show active agent backend (`claude`, `codex`, `llama-cpp`)
`deus backend set <name>`	Switch backend for all future sessions

For direct Codex CLI sessions outside the deus launcher, register Deus memory recall as an MCP tool through the repo launcher:

codex mcp add deus-memory -- /path/to/deus/scripts/deus-memory-mcp

To mirror the repo's Warden gates in direct Codex CLI sessions, install the local Codex hooks:

python3 scripts/codex_warden_hooks.py install --dry-run
python3 scripts/codex_warden_hooks.py install
python3 scripts/codex_warden_hooks.py check

Codex auth modes and hooks: Codex supports two authentication modes: API key (OPENAI_API_KEY) and subscription/OAuth (codex login). Warden hooks require an API key — subscription-only auth cannot enable the [features].codex_hooks flag, so no quality gates, memory retrieval, or safety checks will fire. For the full Deus experience with Codex, use an API key. See Multi-backend for setup and Hook Dispatch System for the architectural rationale.

Linear Automation

Use Linear as a Kanban command center for autonomous agent work. Move an issue to Ready for Agent and Deus picks it up, implements it in a container, opens a PR, and optionally merges it -- without waiting for you.

How it works

Issues move through five stages: Todo → Ready for Agent → Agent Working → In Review → Done. Three quality checks fire automatically as issues move through the board:

Check	Fires on	What it does
agent-readiness-gate	Todo → Ready for Agent	Scopes the issue: implementation plan, acceptance criteria, effort/complexity ratings
output-quality-gate	Agent Working → In Review	Verifies the agent produced a real deliverable (PR, document, etc.)
completion-gate	In Review → Done	Checks all acceptance criteria are met and PR is merged

When LINEAR_AUTO_MERGE=1, Deus automatically merges the agent's PR once CI passes.

Each issue gets a single rolling Pipeline Log comment that tracks every event (gate verdicts, agent dispatch, PR creation, merge) -- no comment spam.

Under the hood: event-driven orchestrator

The pipeline is event-driven. Instead of each step writing directly to Linear and the event log, the orchestrator emits typed events onto an in-process event bus, and independent listeners react:

one listener acts on Linear (e.g. moves an issue to In Review when an agent finishes),
another records every event to the pipeline log that backs deus pipeline and the rolling comment.

This pub/sub seam keeps the dispatcher, webhook gates, and message path decoupled -- new behavior plugs in as a listener without touching the producers. See Event hub architecture for the bus contract, the strangler migration, and the phase roadmap.

Setup

/add-linear    # Gives Deus read/write access to your Linear workspace

Then configure the automation layer in .env:

LINEAR_API_TOKEN=lin_api_...    # Linear personal API key
LINEAR_WEBHOOK_SECRET=...       # For webhook signature verification
# LINEAR_AUTO_MERGE=1           # Optional: auto-merge agent PRs after CI

Quality checks require a public URL to receive Linear webhook events. For local dev, use ngrok (ngrok http 3005). Register the URL in Linear Settings → API → Webhooks. Dispatch (polling) works without a webhook URL.

See Linear automation architecture for the full setup, gate spec format, and configuration reference.

Pipeline monitor

Run deus pipeline to open a live dashboard backed by a webhook-fed SQLite cache (2s refresh when cached, 10s fallback when polling API):

deus pipeline                        # Live monitor (default)
deus pipeline PROJ-123               # Full timeline for an issue
deus pipeline --failed --since 24h   # Failures in the last 24 hours
deus pipeline --active               # One-shot active view

Usage report

Run deus usage for a per-model token-efficiency and cost report across all your Claude Code projects (deduped by API response):

deus usage                           # Full report, all projects, all time
deus usage --since 2026-05-01        # Scope to a date window
deus usage --project myrepo          # Filter to one project (by dir substring)
deus usage --pricing none            # Efficiency ratios only, no cost column
deus usage --rates ~/rates.json      # Override per-model rates (API-direct pricing)
deus usage --json                    # Machine-readable output

The report breaks down per project first (heaviest first; worktrees fold into their repo and ephemeral temp runs bucket together), then gives the all-projects per-model totals. The efficiency layer is pricing-independent: per-model cacheRead:output (context drag per generated token), cacheRead:cacheCreation (cache amortization), and output-share. The cost layer prices tokens via a built-in per-model table (notional for subscription users; the real bill for API-direct users) and degrades gracefully to "—" for models it has no rate for. Absolute token totals may differ slightly from ccusage (which counts duplicated subagent messages differently); the efficiency ratios are unaffected. Direct background/proxy model calls that do not flow through Claude Code are not yet captured (tracked separately).

Vault sync

Linear is the source of truth for pending tasks. The vault's CLAUDE.md pending: block stays in sync automatically:

Webhook push: when issues change in Linear, the webhook handler updates the vault file within ~2 seconds (debounced).
Session-start pull: a SessionStart hook queries Linear on every new Claude Code session, ensuring freshness at the moment it matters most.
/compress sync: the /compress skill pulls the full active issue list from Linear and rebuilds the pending block.

Adding or customizing gates

Gates are plain markdown files in .claude/agents/wardens/. Adding a gate is one file with YAML frontmatter -- no code change:

---
name: my-custom-gate
gate_to: "In Review"
allowed_from: ["Agent Working"]
mode: advise          # or strict (reverts on non-SHIP)
cooldown_minutes: 60
---
Your gate prompt here...

Comparison

	Deus	OpenClaw	NemoClaw	Hermes Agent	Plain Claude
Memory	Understands you - indexes facts by meaning, recalls in context	Markdown files	Via OpenClaw	Full-text search + preference profiling	Conversation only
Learning	Adapts at the personality level - tone, judgment, suggestions	No	No	Auto-creates & refines skills	No
Channels	5 (WhatsApp, Telegram, Slack, Discord, Gmail)	10+	Via OpenClaw	15+ (WhatsApp, Telegram, Signal, Matrix...)	None
Isolation	Container per conversation	Opt-in Docker	Landlock + seccomp	Per-session	None
LLM support	Claude default, OpenAI/llama.cpp opt-in	Any provider	Any (via OpenClaw)	Any (10+ providers)	Claude only
Setup	~5 min	~15 min	~20 min	~10 min	N/A
Repo size	~13 MB	~592 MB	~22 MB	~147 MB	N/A

Deus goes deep on understanding you and adapting over time. Hermes goes wide on channels and LLM flexibility. See docs/benchmarks.md for detailed numbers.

Docs

Topic
How it works	Architecture
Memory system	Architecture - Memory
Self-improvement loop	Architecture - Evolution
Developer tools (CodeGraph, code_search, etc.)	Architecture - Developer Tools
Security model	Security
Benchmarks & token costs	Benchmarks
Environment variables	Environment
Using different AI backends	Multi-backend
Local backend (llama.cpp)	Multi-backend — llama.cpp
Backend quality benchmark	Claude vs Codex parity report
Development setup	Development
Contributing	Contributing
Known limitations	Limitations
Linear automation	Setup, gates, and pipeline
Event hub architecture	Orchestrator Event Hub
Hook dispatch architecture	Hook Dispatch System

Contributing

PRs welcome. Every change goes through a pull request - no direct pushes to main. See CONTRIBUTING.md for the full guide.

Support

Built and maintained solo - no company, no funding. If Deus is useful to you, sponsoring helps keep it going.

Acknowledgments

Built on NanoClaw - thanks to the NanoClaw team for the foundation.

Multi-model proxy support powered by free-claude-code - a local reverse proxy that enables Claude Code CLI to work with alternative LLM providers.

License

MIT

Contributing

See CONTRIBUTING.md.

Name		Name	Last commit message	Last commit date
Latest commit History 759 Commits
.claude		.claude
.github		.github
.husky		.husky
.mex		.mex
assets/brand-production		assets/brand-production
config-examples		config-examples
container		container
demo		demo
docs		docs
eval		eval
evolution		evolution
groups		groups
integrations/gcal		integrations/gcal
launchd		launchd
migrations		migrations
packages		packages
patterns		patterns
scripts		scripts
seeds		seeds
setup		setup
src		src
tests		tests
tui		tui
.claudeignore		.claudeignore
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.nvmrc		.nvmrc
.prettierrc		.prettierrc
.release-please-manifest.json		.release-please-manifest.json
AGENTS.md		AGENTS.md
AI_AGENT_GUIDELINES.md		AI_AGENT_GUIDELINES.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
commitlint.config.mjs		commitlint.config.mjs
deus-cmd.ps1		deus-cmd.ps1
deus-cmd.sh		deus-cmd.sh
eslint.config.js		eslint.config.js
package-lock.json		package-lock.json
package.json		package.json
release-please-config.json		release-please-config.json
setup.sh		setup.sh
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts
vitest.skills.config.ts		vitest.skills.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

What it does

Quick Start

What you need

Install

Connect a channel

Start talking

CLI

Linear Automation

How it works

Under the hood: event-driven orchestrator

Setup

Pipeline monitor

Usage report

Vault sync

Adding or customizing gates

Comparison

Docs

Contributing

Support

Acknowledgments

License

Contributing

About

Uh oh!

Releases 18

Sponsor this project

Uh oh!

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

What it does

Quick Start

What you need

Install

Connect a channel

Start talking

CLI

Linear Automation

How it works

Under the hood: event-driven orchestrator

Setup

Pipeline monitor

Usage report

Vault sync

Adding or customizing gates

Comparison

Docs

Contributing

Support

Acknowledgments

License

Contributing

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 18

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages