AgentOS Workbench

An AgentOS product by Frame.dev

React + Vite dashboard for inspecting AgentOS sessions locally. The goal is to give builders a zero-config cockpit that mirrors how Frame.dev debugs adaptive agents.

GMIs, Agents, and Agency

GMIs (Generalised Mind Instances) package persona prompts, memory policies, tool permissions, language preferences, and guardrail hooks into reusable minds.
Agents wrap GMIs for product surfaces (labels, icons, availability) while preserving the GMI’s cognition and policy.
Agencies coordinate multiple GMIs (and humans) via workflows; the workbench visualises WORKFLOW_UPDATE and AGENCY_UPDATE events in the timeline.

Benefits:

Cohesive cognition: one unit to version, export, and reuse across apps
Guardrail-first: policy decisions are streamed and auditable
Portable: same GMI across cloud/desktop/mobile/browser (capability-aware)

Highlights

Sidebar session switcher backed by a lightweight zustand store
Timeline inspector that renders streaming @framers/agentos chunks with color-coded context
Request composer for prototyping turns or replaying transcripts (wire it to your backend when ready)
Adaptive execution dashboard (task-outcome KPI, fail-open overrides, tool-exposure recovery state)
Multi-tenant telemetry slices (scope + routing mode visibility for single-tenant and multi-tenant runs)
Discovery telemetry visibility (default tool-selection mode + recall profile from runtime config/stream payload)
Dark, neon-drenched UI that matches the Frame.dev production command centre

Scripts

pnpm dev       # launch Vite dev server on http://localhost:5175
pnpm build     # production build (emits dist/)
pnpm preview   # preview the built app
pnpm lint      # eslint
pnpm typecheck
pnpm e2e       # all Playwright suites (including smoke + screenshots)
pnpm e2e:workbench      # split workbench suites only
pnpm e2e:core           # tabs/composer/personas/agency/header
pnpm e2e:eval-planning  # evaluation + planning flows
pnpm e2e:quality        # responsive/a11y/console scans
pnpm e2e:screenshots    # screenshot matrix
pnpm e2e:smoke:pw       # smoke.spec.ts via Playwright
pnpm e2e:smoke          # legacy smoke script (tsx e2e-test.ts)

Storage, export, and import

Data is stored locally in your browser using IndexedDB (no server writes).
Stored: personas (remote + local), agencies, and sessions (timeline events).
Export per-session from the timeline header: "Export session", "Export agency", "Export workflow".
Export everything from Settings → Data → "Export all" (also available in the timeline).
Import from Settings → Data → "Import…" (schema: agentos-workbench-export-v1).
Clear local data from Settings → Data → "Clear storage" (export first if needed).

See docs/CLIENT_STORAGE_AND_EXPORTS.md for details.

Wiring it up

Copy .env.example → .env.local (or set env vars in your shell) and point the workbench at your backend:

# Option A: explicit API base URL
VITE_API_URL=http://localhost:3001

# Option B: same-origin `/api/*` with dev proxy target
VITE_BACKEND_PORT=3001
VITE_BACKEND_HOST=localhost
VITE_BACKEND_PROTOCOL=http

VITE_AGENTOS_* overrides are still supported for specialized stream/persona/workflow path tuning.

In the backend, ensure provider keys are set and configure runtime if needed:

AGENTOS_WORKBENCH_BACKEND_PORT=3001
AGENTOS_WORKBENCH_BACKEND_HOST=0.0.0.0
AGENTOS_WORKBENCH_EVALUATION_STORE_PATH=../.data/evaluation-store.json
AGENTOS_WORKBENCH_PLANNING_STORE_PATH=../.data/planning-store.json

Start the backend (pnpm --filter backend dev) and then run the workbench (pnpm --filter @framersai/agentos-workbench dev).
Use Compose for turns, Evaluation for benchmark runs, and Planning for plan lifecycle experiments.

The client mirrors the streaming contracts from @framers/agentos, so backend responses flow straight into the UI with no reshaping.

Onboarding

A first-run guided tour highlights tabs and controls. You can "Remind me later" or "Don't show again" (saved locally).

AgentOS HTTP endpoints (quick list)

POST /api/agentos/chat — send a turn (messages, mode, optional workflow)
GET /api/agentos/stream — SSE stream for incremental updates
GET /api/agentos/personas — list personas (filters: capability, tier, search)
GET /api/agentos/workflows/definitions — list workflow definitions
POST /api/agentos/workflows/start — start a workflow
GET /api/evaluation/runs — list persisted evaluation runs
POST /api/evaluation/run — start a new evaluation run
GET /api/planning/plans — list persisted plans
POST /api/planning/plans — create a new plan

See docs/BACKEND_API.md for complete request/response shapes and examples.

Licensing

AgentOS core (@framers/agentos) — Apache 2.0
Marketplace and site components — MIT (vca.chat is the public marketplace we operate)

Links

Website: https://agentos.sh
Frame: https://frame.dev
Marketplace: https://vca.chat
GitHub: https://github.com/framersai/agentos
NPM: https://www.npmjs.com/package/@framers/agentos, https://www.npmjs.com/package/@framers/sql-storage-adapter

_{AgentOS product by Frame.dev}

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github		.github
backend		backend
demo-automation		demo-automation
dist		dist
public		public
src		src
tests/e2e		tests/e2e
.env.example		.env.example
.env.local		.env.local
.eslintignore		.eslintignore
.eslintrc.cjs		.eslintrc.cjs
.gitignore		.gitignore
ACCESSIBILITY.md		ACCESSIBILITY.md
CHANGELOG.md		CHANGELOG.md
DESIGN_IMPROVEMENTS.md		DESIGN_IMPROVEMENTS.md
LICENSE		LICENSE
README.md		README.md
SEARCH_SETUP.md		SEARCH_SETUP.md
e2e-test.ts		e2e-test.ts
index.html		index.html
package.json		package.json
playwright.config.ts		playwright.config.ts
postcss.config.js		postcss.config.js
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
typedoc.json		typedoc.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AgentOS Workbench

An AgentOS product by Frame.dev

GMIs, Agents, and Agency

Highlights

Scripts

Storage, export, and import

Wiring it up

Onboarding

AgentOS HTTP endpoints (quick list)

Licensing

Links

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AgentOS Workbench

An AgentOS product by Frame.dev

GMIs, Agents, and Agency

Highlights

Scripts

Storage, export, and import

Wiring it up

Onboarding

AgentOS HTTP endpoints (quick list)

Licensing

Links

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages