Audit Agent Experience Skill by jay-sahnan · Pull Request #79 · browserbase/skills

jay-sahnan · 2026-04-25T22:12:34Z

Spawns parallel Claude subagents against a target docs/SDK/SKILL.md from a one-sentence prompt, captures structured traces, and renders a graded HTML report scoring Setup Friction, Speed, Efficiency, Error Recovery, and Doc Quality. Includes narrative cross-agent review to surface convergent hallucinations and silent workarounds the JSON self-report misses.

Note

Low Risk
Primarily adds new skill documentation/templates and a static prospecting profile, with no executable code changes beyond what future skill runners may follow.

Overview
Introduces a new audit-agent-experience skill, including a detailed SKILL.md playbook for running parallel subagent onboarding audits (config prompts, credential-handling guidance, trace parsing, scoring rubric, and report generation flow).

Adds supporting assets for the skill: an HTML report template (assets/report-template.html) plus reference docs (references/*) defining prompt variants, subagent trace schema/brief, and scoring rules, and includes a new MIT LICENSE.txt. Also adds a Browserbase prospecting profile JSON under skills/event-prospecting/profiles/.

^{Reviewed by Cursor Bugbot for commit 8f64258. Bugbot is set up for automated code reviews on this repo. Configure here.}

Spawns parallel Claude subagents against a target docs/SDK/SKILL.md from a one-sentence prompt, captures structured traces, and renders a graded HTML report scoring Setup Friction, Speed, Efficiency, Error Recovery, and Doc Quality. Includes narrative cross-agent review to surface convergent hallucinations and silent workarounds the JSON self-report misses. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

jay-sahnan · 2026-05-09T19:54:44Z

@cursor review

cursor

Cursor Bugbot has reviewed your changes and found 3 potential issues.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 8f64258. Configure here.}

shrey150

Pre-approving, quick fixes to make

shrey150 · 2026-05-05T16:18:47Z

+description: "Audit the developer experience of a product, SDK, docs site, or SKILL.md by dropping multiple Claude subagents at it with only a tiny task prompt and real tools (WebFetch, Bash, Write). Agents must discover the docs themselves, install deps, ask for credentials if needed, and attempt real execution. The skill captures each agent's trace — tool calls, retries, wall time, errors — and scores on Setup Friction, Speed, Efficiency, Error Recovery, and Doc Quality, then emits an HTML report with an A–F grade and concrete fixes. Use when the user asks to audit agent experience, test a skill, audit docs for agents, check if a SDK is agent-friendly, validate a SKILL.md, measure agent DX, or benchmark how painful onboarding is for an AI agent. Triggers: 'audit agent experience', 'test this skill', 'audit docs for agents', 'is my SDK agent-friendly', 'run a DX audit', 'agent experience test', 'test my docs', 'how do agents do with my product'."
+license: MIT
+metadata:
+  author: jay


Want to make this your GH username?

shrey150 · 2026-05-13T18:41:06Z

+
+Write the value into per-agent workspace `.env` files using the same generic names (`API_KEY`, `PROJECT_ID`, `SECRET`) as the paste flow — see Step 2. The discovery layer is upstream of injection; downstream behavior (generic names, agent must read docs to map them) is unchanged.
+
+**Orchestrator-retained credentials.** After writing per-agent `.env` files, the orchestrator keeps the **original product-specific names → values** (e.g. `BROWSERBASE_API_KEY`, `BROWSERBASE_PROJECT_ID`) available to itself for downstream verification work in Steps 6 / 6.5 / 8 — for example, calling the product's API with `curl` to confirm that a session ID an agent reported actually resolves, or fetching session metadata to enrich the report. The orchestrator can read them with `printenv` (no need to store anywhere — the parent shell already has them since auto-discover sourced them from there).


remove BROWSERBASE_PROJECT_ID

jay-sahnan and others added 2 commits April 25, 2026 09:07

fix: updated license.txt

55c6fa8

cursor Bot reviewed Apr 25, 2026

View reviewed changes

bugbot fixes

6c33140

cursor Bot reviewed Apr 25, 2026

View reviewed changes

Comment thread skills/audit-agent-experience/assets/report-template.html

bugbot fixes

485b946

cursor Bot reviewed Apr 25, 2026

View reviewed changes

shrey150 self-requested a review May 5, 2026 16:18

bugbot fixes

8f64258

cursor Bot reviewed May 9, 2026

View reviewed changes

Comment thread skills/event-prospecting/profiles/browserbase.json Outdated

Comment thread skills/audit-agent-experience/assets/report-template.html Outdated

Comment thread skills/audit-agent-experience/SKILL.md Outdated

bugbot fixes

f1385a1

shrey150 approved these changes May 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audit Agent Experience Skill#79

Audit Agent Experience Skill#79
jay-sahnan wants to merge 6 commits into
mainfrom
audit-agent-experience

jay-sahnan commented Apr 25, 2026 •

edited by cursor Bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jay-sahnan commented May 9, 2026

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shrey150 left a comment

Uh oh!

shrey150 May 5, 2026

Uh oh!

shrey150 May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		Write the value into per-agent workspace `.env` files using the same generic names (`API_KEY`, `PROJECT_ID`, `SECRET`) as the paste flow — see Step 2. The discovery layer is upstream of injection; downstream behavior (generic names, agent must read docs to map them) is unchanged.

		Orchestrator-retained credentials. After writing per-agent `.env` files, the orchestrator keeps the original product-specific names → values (e.g. `BROWSERBASE_API_KEY`, `BROWSERBASE_PROJECT_ID`) available to itself for downstream verification work in Steps 6 / 6.5 / 8 — for example, calling the product's API with `curl` to confirm that a session ID an agent reported actually resolves, or fetching session metadata to enrich the report. The orchestrator can read them with `printenv` (no need to store anywhere — the parent shell already has them since auto-discover sourced them from there).

Conversation

jay-sahnan commented Apr 25, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jay-sahnan commented May 9, 2026

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shrey150 left a comment

Choose a reason for hiding this comment

Uh oh!

shrey150 May 5, 2026

Choose a reason for hiding this comment

Uh oh!

shrey150 May 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jay-sahnan commented Apr 25, 2026 •

edited by cursor Bot

Loading