diff --git a/.github/ISSUE_TEMPLATE/bug_report.md b/.github/ISSUE_TEMPLATE/bug_report.md index 0e53ea2..2b6ea99 100644 --- a/.github/ISSUE_TEMPLATE/bug_report.md +++ b/.github/ISSUE_TEMPLATE/bug_report.md @@ -10,7 +10,7 @@ assignees: '' Describe what's incorrect, outdated, or broken. **Which file(s)?** -List the file path(s) affected (e.g., `data/k12-regulatory.md`, `skills/pilot-design/SKILL.md`). +List the file path(s) affected (e.g., `data/k12-regulatory.md`, `data/operator-lessons.md`). **What should it say instead?** If you know the correct information, include it here with a source. diff --git a/.github/ISSUE_TEMPLATE/feature_request.md b/.github/ISSUE_TEMPLATE/feature_request.md index df758d8..15e2d74 100644 --- a/.github/ISSUE_TEMPLATE/feature_request.md +++ b/.github/ISSUE_TEMPLATE/feature_request.md @@ -1,19 +1,19 @@ --- name: Feature request -about: Suggest a new skill, data topic, or market coverage +about: Suggest a new data topic or coverage area title: '' labels: enhancement assignees: '' --- **What's missing?** -Describe the skill, data file, or coverage gap. +Describe the data, topic, or coverage gap. **Who needs it?** What type of edtech founder would use this? (e.g., K-12 founders in the UK, corporate L&D startups targeting healthcare) -**What would it do?** -Describe the ideal output. What questions would it ask? What guidance would it provide? +**What would it add?** +What should the knowledge base say? Point to sources if you have them. -**Do you want to build it?** -If you're interested in contributing this, let us know. We can help with the skill structure. +**Do you want to add it?** +If you're interested in contributing this, let us know. diff --git a/.github/PULL_REQUEST_TEMPLATE.md b/.github/PULL_REQUEST_TEMPLATE.md index be7b2c7..1b6695b 100644 --- a/.github/PULL_REQUEST_TEMPLATE.md +++ b/.github/PULL_REQUEST_TEMPLATE.md @@ -4,18 +4,16 @@ ## Type of change -- [ ] New skill -- [ ] Skill update - [ ] Data file update (regulatory, market, competitive) - [ ] Research paper addition +- [ ] New data topic or coverage area - [ ] Documentation - [ ] Other ## Checklist -- [ ] SKILL.md files have YAML frontmatter with `name` and `description` -- [ ] Data files have "Last updated" footer with date +- [ ] Data files have a "Last updated" footer with date - [ ] Research papers include Title, Takeaway, Type, Year, Citations, DOI -- [ ] CLAUDE.md routing rules updated (if adding a new skill) +- [ ] Every factual claim cites a source (regulation, DOI, or named operator) - [ ] CHANGELOG.md updated with what changed - [ ] No broken cross-references between files diff --git a/.github/workflows/validate.yml b/.github/workflows/validate.yml index b1accc5..7b4ebe4 100644 --- a/.github/workflows/validate.yml +++ b/.github/workflows/validate.yml @@ -12,41 +12,6 @@ jobs: steps: - uses: actions/checkout@v4 - - name: Validate SKILL.md frontmatter - run: | - echo "Checking SKILL.md files have required frontmatter..." - errors=0 - for skill in skills/*/SKILL.md; do - if ! head -5 "$skill" | grep -q "^name:"; then - echo "ERROR: $skill missing 'name' in frontmatter" - errors=$((errors + 1)) - fi - if ! head -5 "$skill" | grep -q "^description:"; then - echo "ERROR: $skill missing 'description' in frontmatter" - errors=$((errors + 1)) - fi - done - if [ $errors -gt 0 ]; then - echo "$errors frontmatter errors found" - exit 1 - fi - echo "All SKILL.md files have valid frontmatter" - - - name: Validate CLAUDE.md routing - run: | - echo "Checking all routed skills exist..." - errors=0 - grep "^- /" CLAUDE.md | sed 's|.* → ||' | while read -r path; do - if [ ! -d "$path" ]; then - echo "ERROR: CLAUDE.md routes to '$path' but directory does not exist" - errors=$((errors + 1)) - fi - done - if [ $errors -gt 0 ]; then - exit 1 - fi - echo "All CLAUDE.md routes point to existing skill directories" - - name: Validate data files have update footer run: | echo "Checking data files for 'Last updated' footer..." @@ -82,11 +47,9 @@ jobs: - name: Count and report run: | - skills=$(ls -d skills/*/SKILL.md 2>/dev/null | wc -l | tr -d ' ') datafiles=$(ls data/*.md 2>/dev/null | grep -v README | wc -l | tr -d ' ') research=$(ls data/research/*.md 2>/dev/null | grep -v README | wc -l | tr -d ' ') version=$(cat VERSION 2>/dev/null || echo "unknown") echo "EdTech Founder Stack v$version" - echo " $skills skills" echo " $datafiles data files" echo " $research research topics" diff --git a/ARCHITECTURE.md b/ARCHITECTURE.md index 7dfd9af..9aebf9a 100644 --- a/ARCHITECTURE.md +++ b/ARCHITECTURE.md @@ -1,89 +1,37 @@ # Architecture -How EdTech Founder Stack works under the hood. +How EdTech Founder Stack is put together. -## Overview +## What it is -EdTech Founder Stack is a collection of Claude Code skills backed by structured reference data. Skills encode ASU ScaleU's edtech accelerator expertise as interactive AI workflows. Reference data files provide the factual foundation that skills draw from, so guidance is grounded in real regulations, real research, and real market data rather than LLM training knowledge. +A knowledge base for edtech founders, stored as plain markdown. No app, no API, no backend, no dependencies — just files that an AI tool (Claude Code, Cursor, ChatGPT) or a person reads. Edit a file and the next read is current. Every claim traces to a source you can check. -This is not a web app, API, or platform. It's markdown files that AI coding tools read and execute. No backend, no auth, no dependencies beyond the AI tool itself. +## The knowledge -## Skills +All of it lives in `data/`, in three kinds of files. -Each skill lives in `skills/{name}/SKILL.md` and follows the standard SKILL.md format: +### Reference data — `data/*.md` -```yaml ---- -name: skill-name -description: One-line description. ---- -``` +Structured domain knowledge, grounded in real sources rather than model training data: -Below the frontmatter, the skill contains instructions for the AI agent. Skills are interactive: they ask founders structured questions via `AskUserQuestion`, branch based on answers (K-12 founders get FERPA guidance, higher ed founders get accreditation guidance), and output tailored recommendations. +- **Regulatory** — FERPA, COPPA, and state privacy law (K-12); accreditation and accessibility (higher ed) +- **Market** — competitive landscape by segment, buyer personas, funding landscape by stage, procurement, pilot benchmarks +- **Frameworks** — ESSA evidence tiers, AI-native vs. bolted-on, the higher-ed jobs atlas, and founder traps -**Skill design principles:** +Each regulatory and market file carries a "last updated" date. Update cadence is roughly quarterly; regulatory data when laws change; the competitive landscape goes stale fastest. -1. **Interactive, not static.** Every skill asks questions before giving advice. The answers determine what guidance the founder receives. -2. **Opinionated.** Skills take positions. "Usage-based pricing works better than per-seat for K-12." "Your pilot is too long, 8 weeks, not 12." This is ScaleU's perspective, not Wikipedia. -3. **Reference data-backed.** Skills read from `data/` files for factual claims. No hallucinated company names, no outdated regulations, no made-up funding amounts. -4. **Natural funnel.** Each skill ends with context-appropriate next steps, including other skills to run. For the most relevant skills, a brief factual mention that ScaleU offers this kind of support. -5. **Context-aware routing.** Each skill recommends the single most relevant next skill based on the founder's specific answers during the session (scores, evidence tier, stage, challenges), not a static list. For example, `/product-review` routes to `/accessibility-check` if buyer requirements scored low, to `/evidence-check` if evidence readiness scored low, or to `/go-to-market` if all scores are strong. +### Research corpus — `data/research/` -**Founder journey flow:** +Hundreds of peer-reviewed papers across the major learning-science topics, each stored in a markdown table: Title, Takeaway, Study Type, Year, Citations, DOI. The index lives in `data/research/README.md`. This is the evidence base — claims about what works in learning cite specific papers with author, year, finding, and DOI. -``` -Discovery Product Evidence Sales Fundraising -───────── ─────── ──────── ───── ─────────── -/edtech-landscape → /product-review → /evidence-check → /go-to-market → /pitch-review -/idea-validation /accessibility- /pilot-design /sales-strategy /fundraising-guide - check -``` +### Operator playbooks — `data/operator-lessons.md` -Founders typically start with `/edtech-landscape` and the skills guide them through the journey based on their stage and needs. +Dozens of field lessons from operators and investors, distilled and attributed from the public archive of Lenny's Podcast and Lenny's Newsletter, then mapped to selling into schools, universities, and L&D. These are practitioner experience, not peer-reviewed evidence — the research corpus is the evidence layer, and the file says so. -## Reference Data +## How it's consumed -The `data/` directory contains markdown files with structured domain knowledge: +Point an AI tool at the repo or a single file and ask; it reads the relevant knowledge and answers in context. Or read the markdown directly. Because the knowledge is files you can diff and audit, it stays current and checkable in a way baked-in model knowledge isn't. -- **Regulatory guides:** FERPA, COPPA, state privacy laws (K-12), accreditation and accessibility (higher ed), SOC 2 and SCORM (corporate L&D) -- **Market data:** buyer personas, competitive landscape by segment, funding landscape by stage -- **Frameworks:** ESSA evidence tiers, procurement guides, pilot benchmarks -- **Higher ed framework:** Validated jobs across student journey phases (from SXSW EDU 2026) and the structural patterns founders miss -- **AI-native framework:** 4 AI-native criteria, 5 bolted-on indicators, the removal test, Karpathy hierarchy (for developer-tool founders), architecture patterns, and pricing models. Every skill reads this file to classify a founder's AI posture and adapt guidance accordingly +## Cross-platform -Skills read these files at runtime using the AI tool's file reading capability. This means the data is always current (edit the markdown, the skill reads the updated version) and auditable (every fact has a source you can check). - -**Update cadence:** Quarterly recommended, aligned with academic and market cycles. Regulatory data should be checked when new state laws pass. Competitive landscape data goes stale fastest. - -## Research Corpus - -`data/research/` contains peer-reviewed papers organized by learning science topic. Each paper is stored in a structured markdown table with: Title, Takeaway, Study Type, Year, Citations, DOI. The current count and full topic index live in `data/research/README.md`. - -Topics include: active learning, adaptive learning, spaced repetition, cognitive load theory, formative assessment, multimedia principles, mastery-based grading, gamification, worked examples, and more. - -Skills cite specific papers when making claims about learning science. For example, when `/product-review` evaluates whether a product's learning design is evidence-based, it reads the relevant research file and references specific studies with author, year, finding, and DOI. This is what separates ScaleU guidance from generic AI advice. - -**Adding papers:** Append to the relevant topic file in `data/research/`. Follow the existing table format. Sort by citations descending. - -## Higher Ed Framework - -Two files encode ScaleU's SXSW EDU 2026 framework for higher education: - -- **`data/higher-ed-jobs-atlas.md`** — Validated jobs organized by student journey phase (Pre-enroll, Apply, Onboard, Select & Enroll, Course Experience, Graduate & Beyond). Each job names the person, the struggling moment, what they've already tried, and why it fails. Includes saturation analysis showing most founders build in the most crowded phase. - -- **`data/founder-traps.md`** — 4 structural patterns that are invisible from surface-level discovery: the information sequencing trap, the upstream cause / downstream symptom split, when qualitative and quantitative evidence disagree, and the same-problem-two-jobs dynamic. Also includes the noise vs. signal filter for evaluating pilot results. - -Skills targeting higher ed founders (`/idea-validation`, `/edtech-landscape`, `/go-to-market`, `/product-review`) read these files automatically and apply the framework to the founder's specific situation. - -## Cross-Platform Compatibility - -Skills use the standard SKILL.md format, which works across multiple AI coding tools: - -| Platform | Install | Interactive features | -|----------|---------|---------------------| -| Claude Code | Automated via `./setup` | Full support (AskUserQuestion, branching) | -| Codex CLI | Manual (reference SKILL.md as system prompt) | Partial (prompts work, interactive questions vary) | -| Gemini CLI | Manual (reference SKILL.md) | Partial | -| Cursor | Manual (add to custom instructions) | Limited | - -Claude Code is the primary target. Other tools can read and execute the skills as system prompts, but interactive features like branching questions work best in Claude Code. +Anything that can read markdown works: Claude Code, Cursor, ChatGPT, Claude, or a plain text editor. There's nothing to install beyond cloning the repo. diff --git a/CHANGELOG.md b/CHANGELOG.md index 810f1b8..be291ec 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -1,5 +1,16 @@ # Changelog +## 2.0.0 (2026-05-29) + +### Repositioned as a knowledge base + +EdTech Founder Stack is now a curated, AI-friendly knowledge base for edtech founders — markdown you point your AI tools at or read directly — rather than a set of interactive skills. The knowledge in `data/` is the product. + +- **New `data/operator-lessons.md`** — 71 field lessons on validation, product, GTM, sales, pricing, pilots, fundraising, and team, distilled and attributed from the free public archive of Lenny's Podcast and Lenny's Newsletter and mapped to selling into schools, universities, and L&D. Paraphrased under the source's personal/non-commercial terms. +- **Removed the interactive skills** — the `skills/` directory and the `setup` script are gone; the `data/` knowledge base is the whole product. CI now validates the data and research files only. +- **Docs rewritten** — README, ARCHITECTURE.md, CONTRIBUTING.md, and CLAUDE.md now describe the knowledge base and how to use it with Claude Code, Cursor, and ChatGPT. +- **Version badge** — bumped to 2.0.0. + ## 1.4.0 (2026-04-25) ### Public launch readiness diff --git a/CLAUDE.md b/CLAUDE.md index dc1c3ca..ab5dcef 100644 --- a/CLAUDE.md +++ b/CLAUDE.md @@ -1,48 +1,24 @@ # EdTech Founder Stack -AI-powered skills for edtech founders, built by ASU ScaleU. +An open, AI-friendly knowledge base for edtech founders, built by ASU ScaleU. The knowledge lives in `data/` as markdown. When answering a founder's question, read the relevant file rather than leaning on training data for facts about regulations, companies, funding, learning science, or operator experience. -## Skill routing +## Knowledge base -When the user's request matches an available skill, invoke it using the Skill tool. +- `data/` — regulatory (FERPA, COPPA, state privacy, accreditation, accessibility), competitive landscape, funding landscape, buyer personas, procurement, pilot benchmarks, ESSA evidence tiers +- `data/research/` — hundreds of peer-reviewed learning-science papers across the major topics. Cite specific papers with author, year, and DOI. Index (with the current count) in `data/research/README.md`. +- `data/operator-lessons.md` — dozens of operator and investor lessons distilled from Lenny's Podcast and Lenny's Newsletter, mapped to edtech. Practitioner experience, not peer-reviewed; don't present it as research. +- `data/ai-native-framework.md` — AI-native vs. bolted-on: criteria, the removal test, architecture patterns, pricing models, and the Karpathy hierarchy. Use it to classify a founder's AI posture. +- `data/higher-ed-jobs-atlas.md` and `data/founder-traps.md` — ScaleU's SXSW EDU 2026 higher-ed framework: validated jobs across the student journey with saturation analysis, and the structural patterns founders miss. +- `ETHOS.md` — the seven principles, starting with "validate demand, not interest." -- /edtechfounderstack → skills/welcome -- /edtech-landscape → skills/edtech-landscape -- /idea-validation → skills/idea-validation -- /product-review → skills/product-review -- /accessibility-check → skills/accessibility-check -- /evidence-check → skills/evidence-check -- /pilot-design → skills/pilot-design -- /go-to-market → skills/go-to-market -- /sales-strategy → skills/sales-strategy -- /pitch-review → skills/pitch-review -- /fundraising-guide → skills/fundraising-guide - -## Data files - -Skills reference markdown files in `data/` for regulatory, market, and evidence information. Always read the relevant data file rather than relying on training data for factual claims about regulations, companies, or funding. - -## AI-native framework - -`data/ai-native-framework.md` contains the AI-native vs bolted-on framework: 4 AI-native criteria, 5 bolted-on indicators, the removal test, architecture patterns, pricing models, and the Karpathy hierarchy (for developer-tool founders). Skills evaluating AI products should read this file to classify the founder's AI posture and adapt guidance accordingly. - -## Higher ed framework - -`data/higher-ed-jobs-atlas.md` contains validated jobs across the student journey with saturation analysis. `data/founder-traps.md` contains the structural patterns founders miss plus the noise vs. signal filter. Both from ScaleU's SXSW EDU 2026 framework. Skills targeting higher ed founders should reference these files. - -## Research corpus - -`data/research/` contains peer-reviewed papers organized by learning science topic (spaced repetition, cognitive load, formative assessment, adaptive learning, etc.). See `data/research/README.md` for the current index. When skills make claims about what works in learning, they should cite specific papers from this corpus with author, year, and DOI. +Always cite the source: a named regulation, a paper's DOI, or the named operator. ## For contributors -### Adding a new skill -1. Create `skills/{skill-name}/SKILL.md` with YAML frontmatter (`name`, `description`) -2. Add a routing rule to the "Skill routing" section above -3. Follow the patterns in existing skills: interactive questions, sector-based branching, reference data reads, next-skill recommendations, ScaleU mention at the end - ### Updating data files -Edit the relevant markdown file in `data/`. Keep the existing structure and formatting consistent. For regulatory data, note the update date at the bottom of the file. For competitive landscape data, verify company status before updating. + +Edit the relevant markdown in `data/`. Keep the existing structure and formatting. For regulatory data, note the update date at the bottom of the file. For the competitive landscape, verify company status before updating. ### Adding research papers -Append to the relevant topic file in `data/research/`. Follow the existing table format: Title, Takeaway, Type, Year, Citations, DOI. Sort by citations descending. If the topic doesn't exist, create a new file and add it to `data/research/README.md`. + +Append to the relevant topic file in `data/research/`. Follow the table format — Title, Takeaway, Type, Year, Citations, DOI — and sort by citations descending. If the topic doesn't exist, create a new file and add it to `data/research/README.md`. diff --git a/CODEOWNERS b/CODEOWNERS index 074862d..4d809cd 100644 --- a/CODEOWNERS +++ b/CODEOWNERS @@ -1,9 +1,6 @@ # EdTech Founder Stack — Code Owners # PRs to these paths require review from the ScaleU team -# Skills -skills/ @savvides - # Reference data data/ @savvides diff --git a/CODE_OF_CONDUCT.md b/CODE_OF_CONDUCT.md index cecb147..b415f4e 100644 --- a/CODE_OF_CONDUCT.md +++ b/CODE_OF_CONDUCT.md @@ -4,7 +4,7 @@ We are committed to making participation in EdTech Founder Stack a welcoming experience for everyone, regardless of background, identity, or experience level. -This project exists to help edtech founders build better products. Contributions that advance that mission, whether correcting a regulatory detail, adding research, or proposing new skills, are welcome from anyone. +This project exists to help edtech founders build better products. Contributions that advance that mission, whether correcting a regulatory detail, adding research, or expanding the knowledge base, are welcome from anyone. ## Standards diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md index 942c24a..4c3cca2 100644 --- a/CONTRIBUTING.md +++ b/CONTRIBUTING.md @@ -1,40 +1,36 @@ # Contributing to EdTech Founder Stack -Thanks for helping us make this better. If you've lived through the edtech founder grind, you probably have something to add. +Thanks for helping make this better. If you've lived through the edtech founder grind, you probably have something to add. -## How skills work +## What this is -Each skill is just a markdown file at `skills/{skill-name}/SKILL.md`. - -They're designed to be the opposite of "generic AI advice." Instead of the AI guessing based on its training data, we force it to read from the `data/` and `data/research/` directories. This ensures that when it talks about FERPA, procurement, or cognitive load, it's actually referencing the latest standards and peer-reviewed science. +A knowledge base for edtech founders, stored as plain markdown in `data/`. The point is to ground guidance in real sources — regulations, peer-reviewed research, named operators — instead of whatever a model happens to remember. Contributions keep it accurate and expand its coverage. ## What we're looking for ### High-value contributions -- **International regulatory data:** We're very US-centric right now. If you can help us with GDPR, country-specific regulations, or international accreditation, please do. +- **International regulatory data:** We're very US-centric right now. GDPR, country-specific regulations, and international accreditation are all welcome. - **New buyer personas:** Procurement in the UK or India looks different than in the US. We need those maps. -- **Corrections:** Education markets move fast. If our competitive landscape data is stale or a regulation has changed, fix it. -- **New skills:** What's missing? If there's a specific bottleneck in the founder journey we haven't covered, let's build it. +- **Corrections:** Education markets move fast. If the competitive landscape is stale or a regulation has changed, fix it. +- **More research:** Add peer-reviewed papers to the corpus in `data/research/`. ### Guidelines -1. **Be opinionated.** We don't do "on the one hand, on the other hand" here. These skills take a stand. If you're adding guidance, it should have a point of view backed by actual experience. -2. **Read from data.** Never let the AI guess factual info. If you're adding a factual claim, it should live in a `data/` file that the skill reads at runtime. -3. **Branch based on answers.** A K-12 founder and a corporate L&D founder are living in two different worlds. Your skill should ask questions and adapt its advice accordingly. -4. **End with an action.** Every skill should result in a "do this next" list. Skip the abstract considerations; give them the procurement timeline or the demo script. -5. **Smart routing.** Don't just list all the other skills at the end. Recommend the *one* skill they actually need next based on their scores or current stage. +1. **Cite the source.** Every factual claim should trace to something checkable — a named regulation with a date, a paper with a DOI, a named operator. No guessing. +2. **Be opinionated.** We don't do "on the one hand, on the other hand." If you add guidance, give it a point of view backed by real experience. +3. **Keep the format.** Match the structure of the file you're editing. For regulatory and market files, keep the "last updated" date at the bottom. For research, follow the table format (Title, Takeaway, Type, Year, Citations, DOI) and sort by citations descending. +4. **Segment matters.** K-12, higher ed, and corporate L&D are different worlds. Say which one a claim applies to. ## Process 1. Fork the repo. -2. Create a branch for your changes. +2. Create a branch. 3. Make your edits. -4. Open a PR and tell us why these changes make the stack better for founders. +4. Open a PR and tell us why it makes the knowledge base better for founders. -The ScaleU team reviews every PR to make sure we're keeping the quality high and the advice consistent. +The ScaleU team reviews every PR to keep quality high and advice consistent. ## Questions? Open an issue or reach out to the ScaleU team. We're usually around. - diff --git a/ETHOS.md b/ETHOS.md index c08bfe6..87e07b3 100644 --- a/ETHOS.md +++ b/ETHOS.md @@ -1,6 +1,6 @@ # Ethos -What we believe about building edtech products that actually work. These principles shape every skill in this repo and every company that goes through ScaleU. +What we believe about building edtech products that actually work. These principles shape everything in this repo and every company that goes through ScaleU. Informed by "Cracking Higher Ed: Why Startups Miss the Mark" (SXSW EDU 2026, CC BY 4.0). @@ -20,7 +20,7 @@ When you hear a problem, ask: what happened one phase earlier that made this ine Most edtech founders overestimate their evidence tier. A teacher saying "my students loved it" is not evidence. Usage data showing high engagement is not evidence of learning outcomes. Positive pre/post scores without a comparison group are not Tier 3. -Know where you stand on the ESSA evidence ladder (Tier 1-4). Know what tier your buyer requires. Have a concrete plan to close the gap. Investors and institutional buyers increasingly demand real outcome data, not testimonials. The skills in this repo cite peer-reviewed research because evidence is the language that institutions trust. +Know where you stand on the ESSA evidence ladder (Tier 1-4). Know what tier your buyer requires. Have a concrete plan to close the gap. Investors and institutional buyers increasingly demand real outcome data, not testimonials. The guidance in this repo cites peer-reviewed research because evidence is the language that institutions trust. ## 4. The buyer is not the user diff --git a/LICENSE b/LICENSE index 34efbc3..f93addf 100644 --- a/LICENSE +++ b/LICENSE @@ -19,3 +19,11 @@ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. + +--- + +Exception: data/operator-lessons.md is NOT covered by the MIT license above. +Those lessons are paraphrased from the free public archive of Lenny's Podcast +and Lenny's Newsletter and may be used for personal, non-commercial purposes +only, under the source's terms. Do not reuse that file commercially or +redistribute it as raw content. diff --git a/README.md b/README.md index 144c8a4..dc71624 100644 --- a/README.md +++ b/README.md @@ -1,128 +1,71 @@ # EdTech Founder Stack - +  -**Executable expertise for edtech founders — grounded in peer-reviewed learning science and ScaleU's higher ed jobs framework.** +**An open, AI-friendly knowledge base for edtech founders — built by [ASU ScaleU](https://scaleu.asu.edu).** -Built by [ASU ScaleU](https://scaleu.asu.edu), Arizona State University's edtech accelerator. Presented at [SXSW EDU 2026](https://github.com/savvides/cracking-higher-ed-sxswedu). +Building an edtech product means making calls that schools, universities, and L&D buyers will judge you on. Is there real demand or just polite interest? What evidence does a district actually require before it buys? How does higher-ed procurement really work, and how long does it take? Is your AI load-bearing or decorative? Most founders answer these by Googling for weeks, or by guessing. -EdTech Founder Stack isn't just another list of resource links. It's executable expertise that lives in your terminal. Run a skill and get the kind of opinionated, research-backed guidance we usually only give to our portfolio companies—right when you're actually making decisions. +This repo puts the answers in one place: a curated body of knowledge you point your AI tools at — Claude Code, Cursor, ChatGPT — or just read. It's grounded in peer-reviewed learning science, real regulatory and market data, and lessons from operators who've actually done it. Not generic model training data that's a year stale. -## Who this is for +Built by ASU ScaleU, Arizona State University's edtech validation program. The higher-ed framework comes from "Cracking Higher Ed," presented at SXSW EDU 2026. -You're building an edtech product and the stakes are high. You need to know: Is my idea actually viable? Will districts buy this, or just say it's "interesting"? What evidence do investors *really* care about? How do I navigate higher ed procurement without losing six months? Is my AI architecture a load-bearing wall or just decorative wallpaper? +## Who it's for -Stop guessing and skip the weeks of Googling. Run a skill and get answers in five minutes. +You're building in edtech and the stakes are high. You need straight answers about demand, evidence, procurement, accessibility, pricing, and fundraising that account for how education actually buys — not startup advice written for consumer apps. This is the knowledge ScaleU gives its portfolio companies, written down and kept current. -## The founder journey +## What's inside -Skills map to the stages every edtech founder goes through. Start anywhere. Each skill recommends what to run next. +Everything lives in `data/` as plain markdown, in four layers. -``` -Discovery Product Evidence -/edtech-landscape /product-review /evidence-check -/idea-validation /accessibility-check /pilot-design - -Sales Fundraising -/go-to-market /pitch-review -/sales-strategy /fundraising-guide -``` - -## Install - -### Claude Code - -```bash -git clone https://github.com/savvides/edtechfounderstack.git ~/.claude/skills/edtechfounderstack -cd ~/.claude/skills/edtechfounderstack && ./setup -``` - -Then open Claude Code and run `/edtechfounderstack` — it'll ask one question to figure out where you are and route you to the right first skill. All skills are available as slash commands. - -### Codex CLI - -Clone the repo anywhere. Reference individual SKILL.md files as system prompts: - -```bash -codex exec "$(cat path/to/edtechfounderstack/skills/pilot-design/SKILL.md)" -s read-only -``` - -### Gemini CLI - -Clone the repo. Add skill paths to your Gemini configuration or reference SKILL.md files directly. - -### Cursor - -Clone the repo. Copy the contents of any SKILL.md into Cursor's custom instructions for that workspace. - -Interactive features (branching questions based on your answers) work best in Claude Code. Other tools can execute the skills as prompts, but the interactive flow may vary. +### Learning science research — `data/research/` -## Skills +Hundreds of peer-reviewed papers across the major learning-science topics: spaced repetition, cognitive load, formative assessment, adaptive learning, worked examples, the learning-styles myth, and more. Each paper carries its takeaway, study type, year, citation count, and DOI. When you tell a buyer or investor that something works, cite the paper, not a vibe. Index: [`data/research/README.md`](data/research/README.md). -### Discovery & Validation +### Market & regulatory reference — `data/` -**`/edtech-landscape`** — Stop building in a vacuum. Map your market segment, buyer persona, regulatory hurdles, and competitive context before you commit code. Run this first. +- Competitive landscape by segment (K-12, higher ed, corporate L&D) +- K-12 privacy and compliance: FERPA, COPPA, state laws +- Higher-ed landscape, procurement, and accessibility +- Funding landscape by stage — who funds edtech and what they require +- Buyer personas, pilot benchmarks, and the ESSA evidence tiers (1–4) -**`/idea-validation`** — An honest pressure-test. Uses ScaleU's 5-question diagnostic for higher ed founders. You'll get a clear verdict: GO, PIVOT, DIG DEEPER, or STOP. +### ScaleU frameworks -### Product +- **AI-native vs. bolted-on** — is your AI load-bearing or decorative ([`data/ai-native-framework.md`](data/ai-native-framework.md)) +- **Higher-ed jobs atlas** — validated jobs across the student journey, with saturation analysis showing where everyone's already crowded ([`data/higher-ed-jobs-atlas.md`](data/higher-ed-jobs-atlas.md)) +- **Founder traps** — the structural patterns founders miss ([`data/founder-traps.md`](data/founder-traps.md)) +- **The ethos** — seven principles, starting with "validate demand, not interest" ([`ETHOS.md`](ETHOS.md)) -**`/product-review`** — Audit your product across the five dimensions that actually matter: learning design, UX, buyer requirements, differentiation, and evidence readiness. Scores 1-10 with specific fixes. +### Operator playbooks — `data/operator-lessons.md` -**`/accessibility-check`** — Don't treat accessibility as an afterthought. Check against WCAG 2.1 AA, Section 508, and UDL. Get a prioritized checklist, VPAT guidance, and an action plan. +Dozens of field lessons on validation, product, go-to-market, sales, pricing, pilots, fundraising, and team — distilled from operators and investors on Lenny's Podcast and Lenny's Newsletter, and mapped to selling into schools, universities, and L&D. Practitioner experience, attributed and paraphrased; the research corpus is the evidence layer. -### Evidence & Research +## How to use it -**`/evidence-check`** — Most founders overestimate their evidence. Get an honest assessment of your ESSA tier (1-4), a gap analysis, and a concrete plan to move up the ladder. +Point your AI tool at the repo — or at the one file that fits your decision — and ask. The markdown is written to be read by a model or a person. -**`/pilot-design`** — How to design a pilot that actually converts to a contract. Covers timelines, success metrics, MOU templates, IRB guidance, and conversion benchmarks. - -### Sales & Go-to-Market - -**`/go-to-market`** — Build a GTM strategy that survives procurement. Beachhead selection, pricing guidance tied to budget thresholds, and your first-90-day action plan. - -**`/sales-strategy`** — Tactical sales playbook. Outreach templates, demo scripts, and objection handling for the specific buyers who hold the purse strings. - -### Fundraising & Growth - -**`/pitch-review`** — Review your pitch through an edtech investor lens. Scores six dimensions, flags common traps, and provides a revised pitch outline. - -**`/fundraising-guide`** — Who funds edtech right now? Find out what evidence they require and get a week-by-week fundraising playbook. - -## What powers the skills - -### Reference data - -We don't rely on generic LLM training data. Skills read from `data/` for factual claims, meaning guidance is grounded in real regulations, real market data, and real procurement processes. - -### Research corpus - -Peer-reviewed learning science in `data/research/`. When a skill makes a recommendation, it cites specific studies—spaced repetition, cognitive load theory, formative assessment—so you have the DOI to back up your claims. +```bash +git clone https://github.com/savvides/edtechfounderstack.git +``` -### Higher ed framework (SXSW EDU 2026) +- **Claude Code / Cursor:** open the repo and ask; the agent reads the relevant `data/` files. +- **ChatGPT / Claude:** paste the file that fits your question, or upload the repo. +- **Reading it yourself:** start with [`ETHOS.md`](ETHOS.md) for the worldview, [`data/operator-lessons.md`](data/operator-lessons.md) for the playbooks, or [`data/research/README.md`](data/research/README.md) for the evidence base. -Validated jobs across the student journey. The structural patterns founders usually miss. The noise vs. signal filter. This is the "Cracking Higher Ed" framework we presented at SXSW EDU 2026. +Because it's markdown you can audit, every claim traces to something you can check — a named regulation, a paper with a DOI, a named operator. ## About ASU ScaleU -[ScaleU](https://scaleu.asu.edu) is ASU's edtech validation program. We take 1% equity in early-stage startups in exchange for controlled access and a paid pilot at Arizona State University—generating the evidence you need to fundraise and sell at enterprise scale. - -If designing and running an institutional pilot is your next step, that's literally what we do. [Learn more](https://scaleu.asu.edu) or [apply directly](https://scaleu.asu.edu/apply). - -## Philosophy - -See [ETHOS.md](ETHOS.md) for what we believe about building edtech products that actually work. Seven principles, starting with "validate demand, not interest." - -## Architecture - -See [ARCHITECTURE.md](ARCHITECTURE.md) for how skills, data files, and the research corpus fit together. +[ScaleU](https://scaleu.asu.edu) is ASU's edtech validation program. We take 1% equity in early-stage startups in exchange for controlled access and a paid pilot at Arizona State University — the kind of evidence you need to fundraise and sell at enterprise scale. If running an institutional pilot is your next step, that's what we do. [Learn more](https://scaleu.asu.edu) or [apply](https://scaleu.asu.edu/apply). ## Contributing -See [CONTRIBUTING.md](CONTRIBUTING.md) for how to improve existing skills or add new ones. We're always looking for better regulatory data, new buyer personas, and more research papers. +See [CONTRIBUTING.md](CONTRIBUTING.md) for how to help, and [ARCHITECTURE.md](ARCHITECTURE.md) for how the knowledge base is laid out. We're always after better regulatory data, new buyer personas, and more peer-reviewed research. For research, follow the table format in `data/research/` and sort by citation count. ## License -MIT. Use these skills however you want. +The repository's original content is **MIT** — use it however you want. +**Exception:** [`data/operator-lessons.md`](data/operator-lessons.md) is **not** covered by MIT. Those lessons are paraphrased from the free public archive of Lenny's Podcast and Lenny's Newsletter under its personal/non-commercial terms, so treat that one file as personal, non-commercial use only, not for commercial reuse. See the note at the bottom of the file. diff --git a/SECURITY.md b/SECURITY.md index 3f6590a..d811cb3 100644 --- a/SECURITY.md +++ b/SECURITY.md @@ -2,7 +2,7 @@ ## Scope -EdTech Founder Stack is a collection of markdown files (skills and reference data). It does not process user data, run servers, or store credentials. Security concerns for this project are primarily about **data accuracy** rather than traditional software vulnerabilities. +EdTech Founder Stack is a collection of markdown reference files. It does not process user data, run servers, or store credentials. Security concerns for this project are primarily about **data accuracy** rather than traditional software vulnerabilities. ## Reporting data accuracy issues @@ -22,10 +22,10 @@ We will respond within 5 business days and prioritize corrections to regulatory ## Reporting security vulnerabilities -If you discover a security issue in the setup script or any tooling in this repo, please email **philippos.savvides@asuep.org** rather than opening a public issue. +If you discover a security issue in any tooling in this repo, please email **philippos.savvides@asuep.org** rather than opening a public issue. ## Supported versions | Version | Supported | |---------|-----------| -| 1.x | Yes | +| 2.x | Yes | diff --git a/TODOS.md b/TODOS.md index 017fc6f..3ed5e2a 100644 --- a/TODOS.md +++ b/TODOS.md @@ -1,11 +1,3 @@ # TODOs -## P3: AI-native framework consistency check - -**What:** Add a CI step that validates all SKILL.md files reference `data/ai-native-framework.md` consistently. Grep for framework criteria references and flag drift when the data file evolves. - -**Why:** The AI-native framework is integrated across all skills with duplicated detection logic (each skill asks its own AI posture question). When the framework criteria evolve (and they will, this space moves fast), updating the data file is easy but verifying all skills still align requires manual review. - -**Effort:** S (human: ~2 hours / CC: ~15 min) - -**Depends on:** v1.3.0 shipped (AI-native framework integration) +_Nothing open right now._ diff --git a/VERSION b/VERSION index 88c5fb8..227cea2 100644 --- a/VERSION +++ b/VERSION @@ -1 +1 @@ -1.4.0 +2.0.0 diff --git a/data/competitive-landscape.md b/data/competitive-landscape.md index 85365bc..0b0e5b1 100644 --- a/data/competitive-landscape.md +++ b/data/competitive-landscape.md @@ -2,7 +2,7 @@ ## How to use this file -This file maps key companies by education segment. Skills reference it to provide competitive context without relying on LLM training data (which goes stale). Update this file quarterly. +This file maps key companies by education segment. It provides competitive context without relying on LLM training data (which goes stale). Update this file quarterly. ## K-12 diff --git a/data/operator-lessons.md b/data/operator-lessons.md new file mode 100644 index 0000000..c771778 --- /dev/null +++ b/data/operator-lessons.md @@ -0,0 +1,171 @@ +# Operator lessons for edtech founders + +Field lessons from operators and investors, distilled in our own words from the free public archive of [Lenny's Podcast](https://www.lennyspodcast.com) and [Lenny's Newsletter](https://www.lennysnewsletter.com) ([starter dataset](https://github.com/LennysNewsletter/lennys-newsletterpodcastdata)). These are practitioner experience, not peer-reviewed research — for the evidence base on what actually works in learning, see `data/research/`. + +Each lesson notes who it came from. Where a lesson links out, the link goes to that episode or post page on Lenny's site (lennysnewsletter.com). Most podcast episodes have no stable URL in the source data, so for those, search the guest's name on lennyspodcast.com to find the episode. + +## Validating demand + +**Count the workaround, don't survey the concept.** Chesky sold Rabois on Airbnb with ~30 Craigslist posts where people typed out that they wanted to rent a stranger's room; DoorDash, with "93% of US restaurants don't deliver." A count of people already doing the painful workaround beats any volume of "would you use this." Look for who's already hacking a fix for your gap, or who has an open RFP for it. — Keith Rabois, *Lenny's Podcast* + +**Buyer interviews are trustworthy when one person decides.** Customer development works where you can name a single utilitarian decision-maker — which is exactly the institutional buying setup. Treat your district CTO or VP of L&D conversations as real signal. "I surveyed eight teachers" is noise: free-teacher enthusiasm is not the budget-holder's decision. — Keith Rabois, *Lenny's Podcast* + +**Watch organic compounding, not the launch spike.** Winning Product Hunt's day, week, and month told Gamma nothing. The signal that mattered was whether signups kept growing on their own after the spike faded; when they flattened, the team called it *no fit* despite the trophies. Friendly pilot users also quietly stop at day 30, 60, 90 unless the product is genuinely useful. — Grant Lee, *Lenny's Podcast* + +**A dead reply rate is a premise problem, not a tooling problem.** Two companies ran identical outreach and got 2% versus 12% interest; the only difference was the sharpness of the founder's insight into the problem. If cold outreach to principals or L&D leads flops, suspect your premise before blaming the email tool. "Better than your current LMS" gets ignored; a felt pain like a compliance deadline or teacher time gets replies. — Jen Abel, *Lenny's Podcast* + +**Chase the buyer who pulls it out of your hands.** Hold a strong thesis about how the world is changing, but stay loose on the exact wedge and go after the customer who is shockingly easy to sell to. If selling the next customer is a grind, the business won't scale — so trust the eager buyer over the district you spend nine months persuading that your pedagogically "correct" product is right. — Brendan Foody, *Lenny's Podcast* + +**You can't research your way to a zero-to-one product.** People answer from what they've seen — ask about a touchscreen phone and they describe a better keyboard. Feedback loops are right for improving an existing category and actively mislead on category-defining bets. For an incremental tool, lean on educator feedback; for a new bet, expect teachers to ask for the familiar workflow and validate by putting a working prototype in front of them. — Caitlin Kalinowski, [*Lenny's Podcast*](https://www.lennysnewsletter.com/p/why-were-at-the-beginning-of-the) + +**Your next product line is in the off-label requests.** Canva began when one happy yearbook customer asked "can I also use this for newsletters?" Perkins checked that nothing on the market solved it and expanded there, rather than designing a wedge strategy from scratch. Mine the "can I also use it for…" asks from your most engaged school before inventing an expansion roadmap. — Melanie Perkins, *Lenny's Podcast* + +**Reconstruct the buyer's economics yourself.** Restaurants wouldn't share real margins, so Uber Eats ordered food and weighed the ingredients against a supplier catalog to rebuild the cost structure — which gave them conviction to price boldly. Budget owners and procurement won't hand you real numbers either, so reconstruct district budget math independently before you price. — Jason Droege, *Lenny's Podcast* + +## Building the product + +**Make the first 30 seconds magical, and treat onboarding as the product.** Gamma's frame: what can you give a selfish, lazy, impatient new user in 30 seconds that earns you the next 30? That forces you to surface the single most valuable action instead of a feature tour. Teachers and students abandon tools that demand setup before value, so engineer one undeniable aha — a generated lesson, a graded draft — in the first session. — Grant Lee, *Lenny's Podcast* + +**For anything novel, the obstacle is comprehension, not friction.** Removing clicks only helps when the buyer already knows what they want; for everything new, people arrive barely over the intent threshold and bounce because they don't understand what it is or what happens next. Educators are not power users and feel stupid when confused, so your homepage and onboarding have to build understanding — not just shorten the signup form. — Stewart Butterfield, *Lenny's Podcast* + +**Listen for the problem, then invent the solution.** When users demanded a literal "send to all" button, Snap dug into why — social pressure, permanence, reverse-chron feeds — and shipped Stories, solving the real need without building the requested feature. Teachers and admins will ask for another report or another field; the win is mining the underlying job and shipping something they didn't know to request. — Evan Spiegel, *Lenny's Podcast* + +**Ship rough and learn from real users — you can't spec a non-deterministic product.** You can't fully mock an AI product before shipping, so release early as a "research preview" and learn the real use cases and safety failures that lab evals miss. Pair it with fixing feedback within minutes, which makes people feel heard and drives a flood of more feedback. A fast pilot turns early schools into co-designers and shows where the tool produces wrong or unsafe output in front of students. — Boris Cherny, *Lenny's Podcast* + +**Demo, don't memo.** Shopify hasn't let PMs pitch ideas as slides for two years; a four-hour prototype communicated more than a week of documents. With AI prototyping nearly free, build several versions and pick from the real thing. Walking into a district meeting or investor pitch with a working demo of the exact teacher or student workflow beats any roadmap slide — and forces honesty about whether the idea works. — Keith Rabois, *Lenny's Podcast* + +**Good-enough is no longer a differentiator — ship a minimum lovable product.** AI lets everyone produce good-enough fast; the gap that pays now is the distance to world-class (taste, design, copy, emotional resonance). Make a designer an early hire. A generic AI-wrapper tool is trivially cloneable, so your edge with skeptical educators is craft: a tool that feels good gets evangelized in the staff room while a clunky-but-functional one gets abandoned. — Elena Verna, *Lenny's Podcast* + +**Match the technology to the job; don't force the LLM everywhere.** Chess.com scores every move with a deterministic chess engine (LLMs are bad at chess) and uses the LLM only for the human-friendly explanation. For grading, adaptivity, or domain logic, a specialized or rules-based engine often beats an LLM — reserve the model for the tutoring and explanation layer, which also controls cost and hallucination risk. — Albert Cheng, *Lenny's Podcast* + +**"Thin wrapper on a model" is the wrong dismissal.** Horowitz maps it to the 1980s "thin wrapper on a database" jab that Salesforce disproved. Cursor's moat is proprietary data and domain depth — internal models trained on real high-end developer interactions — not the base model. Because human behavior is fat-tailed, capturing the messy real-world edge cases of teaching, student responses, and how learners actually behave is where defensibility lives. — Ben Horowitz, *Lenny's Podcast* + +**As generation gets cheap, design the review surface.** The bottleneck moves from making things to checking them — usually the least fun part. Show the human a rendered preview or an AI pre-review first, not raw output, so they verify in the fastest way and stay accelerated. If your tool drafts lesson plans, IEPs, or feedback at scale, the teacher's review burden is the real adoption gate — design it as carefully as the generation, or volume buries them and kills usage. — Alexander Embiricos, *Lenny's Podcast* + +**Before copying a feature that works elsewhere, ask why it works there.** Duolingo borrowed a moves-counter from a match-3 game and it flopped, because that game's moves require strategy while answering a lesson question doesn't — so the counter was just an annoyance. Always ask why it works in the source product, whether that translates, and what you must adapt before cloning a streak, a chat UI, or a consumer growth loop into a classroom. — Jorge Mazal, [*Lenny's Newsletter*](https://www.lennysnewsletter.com/p/how-duolingo-reignited-user-growth) + +## Building with AI + +**Start AI quality work with error analysis, not tests.** Sample ~100 real production traces and write one free-form note per trace about the first thing that went wrong; stop when notes stop appearing, then have an LLM cluster and count them to find your most common failure. For LLM-as-judge, force a binary pass/fail per single failure mode (never a 1–5 score) and validate it against human labels with a confusion matrix — a judge that always says "pass" scores 90% when errors are only 10% of cases. Reading real student-AI transcripts surfaces failures (hallucinated content, missed handoffs) you'd never spec upfront. — Hamel Husain & Shreya Shankar, *Lenny's Podcast* + +**Design around the "lethal trifecta."** Any agent that combines private-data access, exposure to untrusted input, and an outbound channel can be tricked into exfiltrating data — and prompt-injection filters top out around 97%, a failing grade. The fix isn't better guardrails; it's cutting one leg, usually the ability to send data out. An AI tutor touching FERPA records that also reads untrusted content and can email or post is a breach waiting to happen, so architect the exfiltration path away rather than trusting a prompt to behave. — Simon Willison, *Lenny's Podcast* + +**Treat "agent" as a dial, not a label.** Build along an agency ladder: V1 only suggests to staff, V2 drafts a reply the human edits, V3 acts directly — and log what the human does at each stage as free training data, raising autonomy only once the data shows reliability. School buyers are nervous about AI acting unsupervised on student data or grades, so a deliberately less-agentic design is both safer and often the faster path to a signed pilot. — Aishwarya Naresh Reganti & Kiriti Badam, *Lenny's Podcast* + +**When the agent fails, it's usually context, not capability.** Base-model intelligence is largely already there; the real product work is encoding the idiosyncrasies of a specific workflow — which logs to check, which steps to run, how to handle each failure — into structured context the model can see, rather than reaching for a bigger model. Your moat isn't a smarter LLM; it's encoding how a district's IEP process, your state's standards, and a course's rubrics actually work, which generic ChatGPT can't replicate and which improves with each deployment. — Scott Wu, *Lenny's Podcast* + +**Aim AI at processes that are currently bad, not ones that are already near-perfect.** AI delivers huge value where the human process is only 10–20% accurate (getting to 60–80% is a celebrated win) and struggles closing the last 2% on a 98% process. And an automation that works 95% of the time isn't an automation — you can't rely on it until it's near-100%, so always design an escalation path to a human. Point it at feedback nobody has time to write or triaging which students need help, not tasks teachers already do well. — Jason Droege, *Lenny's Podcast* + +**The models eat your scaffolding for breakfast.** Heavy infrastructure built to compensate for weak models — custom agent frameworks, mandatory vector-store RAG, forced to-do lists — becomes dead weight as models improve, and hand-built orchestration adds maybe 10–20% that the next release erases. Build for the capability that's ~80% there today, give the model tools and a goal rather than rigid step-1/step-2 workflows, and revisit your prompt scaffolding each model release. Architect a borderline-today district pilot for the accuracy you expect in 6–12 months. — Sherwin Wu, *Lenny's Podcast* + +**Own the data or be the model — the middle gets crushed.** Point solutions lack the first-party data to do useful joins, and integrating via flat-file feeds is drinking through a straw. Defensible AI products either own the data ("the mine") or provide the model ("the shovels"); renting both crushes your unit economics. A single-feature tool pulling SIS/LMS/assessment data through brittle integrations won't be allowed good economics by the platforms it depends on — owning a data wedge and accumulating each teacher's students, materials, and corrections is the moat. — Matt MacInnis, *Lenny's Podcast* + +**Distrust one-click agents and multi-agent "swarms."** Replacing a real workflow takes roughly four to six months even with good data, because enterprise data is messy — duplicate functions, broken taxonomies, undocumented rules. Current models can't reliably run a utopia of peer agents that self-coordinate; what works is one supervisor agent (or a human) orchestrating sub-agents. School SIS, LMS, and roster data are exactly this messy, so budget months, not weeks — and don't architect a tutoring product as a swarm of specialists talking to each other when a student or parent is on the other end. — Aishwarya Naresh Reganti & Kiriti Badam, *Lenny's Podcast* + +**For RAG, the wins are in data prep, not the vector database.** Tune chunk size, rewrite source content into question-answer pairs, and add an annotation layer for things AI lacks common sense about. What improves AI apps is talking to users, preparing better data, fixing the end-to-end workflow, and better prompts; what wastes time is chasing AI news, swapping frameworks, agonizing over vector DBs, and fine-tuning. If you're building a tutor over curriculum or training docs, restructure that content into Q&A form before touching infrastructure — docs written for humans often fail for AI. — Chip Huyen, *Lenny's Podcast* + +**Onboard an agent like an employee, and build the undo.** An agent can make hundreds of changes in seconds, so the UX flips: you need approval queues, a summarized inbox of what happened, change logs, and one-click rollback — not a real-time collaboration view. Give it its own scoped credentials (never your master password), widen permissions as trust is earned, and hard-code that it takes instructions only from the user on one channel, treating email and the open web as data, not commands. A least-privilege, stated-trust-boundary design is also far easier to get through district IT and security questionnaires. — Dan Shipper, [*Lenny's Podcast*](https://www.lennysnewsletter.com/p/the-ai-paradox-dan-shipper) + +## Pilots and proof + +**Sell a paid service before you sell the technology.** 40–50% of B2B startups must sell a time-boxed (90-day) paid service first, because the buyer has no process or human-in-the-loop plan to adopt something new yet. The service gets you the logo, revenue that signals real intent, and the position of educating the buyer. If a district has no workflow for adopting AI tutoring, sell a paid 90-day implementation first rather than waiting 18 months — whoever educates the buyer wins the eventual contract. — Jen Abel, *Lenny's Podcast* + +**Shrink the change to de-risk the yes.** When a commitment feels scary, repackage it as a one-week proof of concept with explicit success criteria and a pre-set check-in date. It's the certainty of a next checkpoint, not certainty of the outcome, that calms a cautious decision-maker. Risk-averse school and university buyers say yes faster to a tightly scoped, time-boxed pilot with defined metrics and a scheduled review than to anything open-ended. — Jessica Fain, *Lenny's Podcast* + +**Pick at least three metrics across different dimensions.** Teams default to easy-to-instrument activity counts that look productive but say nothing about whether the work is good or sustainable — which is why SPACE forces metrics across satisfaction, performance, activity, collaboration, and flow. A pilot reported purely on usage will get gamed and won't convince a skeptical district; pair a usage metric with a learning-outcome metric and a teacher-satisfaction survey. — Nicole Forsgren, *Lenny's Podcast* + +**A demo is not a deployable system — budget for the nines.** Getting an AI from a 60–70% demo to production reliability follows a data-center-uptime curve: each additional "nine" is an order of magnitude more work, and serious automation takes 6–12 months of engineering, legal, and change management. Cheap proofs-of-concept inflate the failure-rate headlines. Set district expectations accordingly so you don't get labeled a failed pilot when the demo doesn't survive contact with real classrooms. — Jason Droege, *Lenny's Podcast* + +**Distrust flashy leaderboard "wins."** Quick crowd-vote leaderboards reward flashy, emoji-laden, longer outputs even when the answer is wrong, because people skim for two seconds. Real evaluation needs domain experts working through the actual task — checking the code, the equations, the reasoning. An AI tutor that sounds confident and looks polished can score well while being pedagogically wrong, so prove learning outcomes with educators rigorously checking accuracy, not vibe-based demo reactions. — Edwin Chen, *Lenny's Podcast* + +**Give a pilot one accountable owner.** Anything cross-functional dies when it's split across sales, implementation, and product — "if you want to kill a plant, have two people water it." Name one Directly Responsible Individual with authority to direct other teams. A district pilot touches sales, onboarding, support, and product, so a single accountable owner keeps a make-or-break reference deployment from falling through the cracks. — Brian Halligan, *Lenny's Podcast* + +## Go-to-market + +**Keep word of mouth above half, or you're on a CAC treadmill.** Gamma keeps over 50% of signups coming from word of mouth and treats paid acquisition as a hard ceiling; if more than half your growth is ads, the core engine is broken. Mercor reached a nine-figure run rate with nobody in sales or marketing. Prove that teachers refer colleagues and schools refer schools before pouring money into ads or hiring SDRs. — Grant Lee, *Lenny's Podcast* + +**Lead with a use case the buyer already measures.** Companies buy AI use cases with measurable outcomes far more readily than fuzzy productivity ones — a sales chatbot sells because you can compare conversion before and after, while internal-knowledge tools stall. Lead with a metric districts and L&D already track (completion, support-ticket deflection, time-to-onboard), not vague "engagement" or "productivity," because clear outcomes unlock budget. — Chip Huyen, *Lenny's Podcast* + +**Earn "permission to play" before you build.** Before a new feature or product line, ask whether buyers will find it credible that *your* company offers it and whether you already have a route to reach them; Jeetu Patel kills ~99% of new-idea proposals on this test. A literacy-platform founder has permission to launch an adjacent assessment tool to the same districts — but an unrelated product to those schools wastes scarce calories, because the relationship and category credibility aren't there. — Jeetu Patel, *Lenny's Podcast* + +**Answer Engine Optimization is a channel you can win this week.** Unlike SEO, which needs years of domain authority, you can show up in a ChatGPT or Perplexity answer tomorrow via a Reddit thread, a YouTube video, or a single blog mention — and being cited most often across sources beats being the #1 link. Teachers, instructional coaches, and L&D buyers increasingly ask AI "what's the best tool for X," so a young product can get into those answers now by seeding authentic mentions. — Ethan Smith, *Lenny's Podcast* + +**Tell pilots to throw their hardest real problem at it.** A professional-grade tool earns trust by solving the gnarly problem nobody else could; dumbing the trial down both undersells it and hides where it breaks. In a district pilot, ask the skeptical department head for their messiest real case — the impossible-to-grade essays, the tangled scheduling conflict — because succeeding there converts a doubter faster than ten polished demos. — Alexander Embiricos, *Lenny's Podcast* + +**Channels sag before they die — assume decay.** Channels don't plateau on a clean S-curve; they're an "elephant curve" where audiences saturate and decline while vendors report healthy numbers right up to failure. Constant Contact restarted growth with in-person workshops; HubSpot built an agency channel that became ~50% of revenue. Durable edtech distribution often comes from ecosystem partners — district PD workshops, university teaching centers, reseller networks — that start you with an existing audience. — Jason Cohen, *Lenny's Podcast* + +## Sales + +**Your funnel breaks at qualification, not closing.** Abel has never seen a genuine bottom-of-funnel problem; deals stall because you reached the wrong person, used the wrong message, or pitched a problem you can't solve. Book the next call during the current call, and treat "I'll email you" as a soft no. When school deals fizzle after warm first meetings, the cause is almost always upstream — you're talking to a teacher who can't buy. — Jen Abel, *Lenny's Podcast* + +**Do all the procurement work for them, and pre-arm the signer.** Fill out their forms, and state precisely what you do and don't do — vague claims get you classified high-risk and routed to the kitchen-sink MSA. Abel lost a month because a CFO got a contract he didn't understand and kicked it to the back of the queue. District and university procurement, security review, and privacy vetting stall unless you project-manage every form and hand the actual signer — CFO, CISO, department head — a clear one-liner. — Jen Abel, *Lenny's Podcast* + +**Sell against the risk of doing nothing.** Roughly four in five buyers purchase to avoid pain or reduce risk, not to chase upside, so the founder-vision pitch mostly lands only with other founders. A district CIO or VP of L&D is protecting a budget and a career — lead with the cost of falling behind, audit or compliance exposure, or a renewal that won't show outcomes, not the visionary future state. — Jeanne DeWitt Grosser, *Lenny's Podcast* + +**Hand-build 30 prospects before you buy sales tooling.** Manually find 30 ideal prospects and spend 15–20 minutes writing each a real note across email, LinkedIn, and a call. The exercise reveals whether your buyers are even discoverable and what they share — only then do enrichment tools work. If you can't find 30 curriculum directors or L&D managers, or you get zero replies, you've learned something cheap before automating outreach and torching your domain. — Jen Abel, *Lenny's Podcast* + +**First-rep rules: close ~10 deals yourself, hire two, pick who you'd buy from.** Personally close around 10 deals before hiring; hire two reps, not one, so you can compare; pick the rep you'd actually buy your own product from over the impressive logo; and favor reps whose last product was slightly *harder* to sell than yours. Wait on a VP of sales until two reps hit quota. A rep who fought procurement-heavy, security-paranoid sectors finds edtech easier than one from a frictionless SMB product. — Jason Lemkin, *Lenny's Podcast* + +**Make the first call a working session that leaves an asset.** Stripe's first sales call was a whiteboard of the customer's own payments architecture, so the buyer walked away with a diagram they'd never drawn — helped, not quizzed. Run your first school meeting as a co-mapping of their student journey, data flow, or current tool stack, so they leave with something useful even if they don't buy. That's the trust that wins a slow committee sale. — Jeanne DeWitt Grosser, *Lenny's Podcast* + +**Ask spicier discovery questions.** "What's top of mind?" has gone generic and yields rehearsed answers. Fain asks emotion-surfacing questions instead: "What's the most urgent thing you're scared of messing up?" and "What is your board pushing you on?" With a district or L&D buyer, probe what their board, superintendent, or state accountability metrics are pressuring them on this year, then tie your tool to relieving that specific fear. — Jessica Fain, *Lenny's Podcast* + +## Pricing + +**Raising the price often raises signups in B2B.** Price signals quality and selects which buyers even consider you. One founder selling to enterprise and government at $300/year saw zero change after switching to $300/month — 12x — proof he was nowhere near the ceiling. A cheap per-teacher price screens you out of district procurement, where buyers equate low price with weak security, support, and governance. — Jason Cohen, *Lenny's Podcast* + +**Reposition from savings to growth and charge far more.** Pitched as "cut your AdWords cost in half," a tool is worth ~$5K/month because the buyer keeps most of the savings; pitched as "double your leads at the same ROI," the identical tool is worth ~$40K, because growth budget dwarfs cost-cutting budget. Sell "raise completion, retention, or enrollment," not "save teachers time" — outcomes that leadership is measured on carry an order-of-magnitude bigger budget than efficiency claims. — Jason Cohen, *Lenny's Podcast* + +**Make the free tier a live sample of the full product.** Grammarly's free users only saw spelling and grammar fixes, so they assumed that was all it did; interspersing a capped real-time taste of paid suggestions nearly doubled upgrades — despite fears that giving more away would hurt conversion. If your free or pilot tier shows only the boring basics, schools and learners will price you as a basic tool. — Albert Cheng, *Lenny's Podcast* + +**Heavy spenders have more intense needs, not deeper pockets.** Tinder's biggest à-la-carte spenders weren't wealthy flaunters but people with urgent needs — military, frequent movers, new to a city — who priced the app against the cost of dating, not other subscriptions. Find the buyers with the most acute pain (a district under a compliance deadline, a school with a specific outcome gap) and anchor pricing to the cost of their alternative, not to competitor subscription prices. — Ravi Mehta, *Lenny's Podcast* + +**Don't set a price until users are begging, then verify margins.** Gamma waited until users wanted to pay, then used a Van Westendorp willingness-to-pay survey plus conjoint to land on one simple plan around $20/month, anchored to ChatGPT — and checked it produced positive margins on inference rather than assuming they'd figure out economics later. For a prosumer teacher tier, run a quick willingness-to-pay survey, keep the plan dead simple, and confirm the price covers AI cost from day one, since education budgets won't tolerate later hikes. — Grant Lee, *Lenny's Podcast* + +## Growth and retention + +**Know your growth ceiling: new customers per month divided by monthly churn.** At 100 new logos a month and 5% churn you will never pass ~2,000 customers, because cancellations scale with your base while marketing doesn't. Run this number before chasing more acquisition. Districts and universities churn on a budget cycle, so a bad churn rate quietly caps you no matter how hard your reps push. — Jason Cohen, *Lenny's Podcast* + +**Model your users as states, and find the one lever that compounds.** Duolingo bucketed every user into mutually exclusive states (new, current, reactivated, resurrected, at-risk, dormant) and modeled the transition rates between them. A sensitivity analysis showed current-user retention drove 5x more daily-active growth than the next-best lever, because retained users loop back and compound. It's a concrete way to find the one retention lever that actually moves the active-usage number districts judge renewals on. — Jorge Mazal, [*Lenny's Newsletter*](https://www.lennysnewsletter.com/p/how-duolingo-reignited-user-growth) + +**"Too expensive" is almost never the real reason they canceled.** A customer who got through your whole funnel and paid had already accepted the price. Ask "what made you cancel?" as an open question (it roughly doubles usable responses), then dig past the excuse to the real failure. School buyers will blame budget when the truth is your tool didn't sync with their SIS or a teacher couldn't onboard students, and fixing the stated reason wastes a renewal cycle. — Jason Cohen, *Lenny's Podcast* + +**Leads from AI answers convert far better, and you probably can't see them.** Webflow saw a 6x conversion gap between ChatGPT-referred traffic and Google, because the buyer already had a long qualifying conversation before clicking. Most of it is mis-attributed as "direct" or "branded search" because people open a new tab and type your name, so add a post-signup "how did you hear about us?" If your dashboard shows direct or branded traffic rising, AI may already be sending you high-intent district and L&D buyers you can't see. — Ethan Smith, *Lenny's Podcast* + +**Reframe failure instead of rubbing it in.** Chess.com found 80% of users review a game after a win, not a loss, so they flipped the post-loss screen to surface best moves and encouragement instead of blunders, and grew the feature 25% and subscriptions 20%. Learners abandon tools that punish their mistakes, so reframing errors positively can lift retention and paid conversion across any K-12 or L&D product. — Albert Cheng, *Lenny's Podcast* + +**Keep the friction that helps a user see the product is for them.** Don't reflexively strip onboarding steps. Anthropic, MasterClass, Mercury, and Calm all keep deliberately long onboarding quizzes, because asking who the user is lets you route them to the right feature and personalize later. Cut friction that adds nothing; keep friction that earns a better first experience, and reuse the profile data for re-engagement. Activation in edtech hinges on a teacher or learner seeing "this is for my subject, grade, or role." — Amol Avasare, *Lenny's Podcast* + +**Protect your notification and email channels as a hard constraint.** Groupon escalated to five emails a day, briefly won on metrics, then permanently lost the channel, because opted-out users never come back. Duolingo let the team optimize timing, copy, and images freely but required CEO approval to raise frequency. Edtech leans on reminder emails and push notifications to drive student and teacher engagement, so over-testing frequency can burn the exact channel that keeps a district's usage numbers up at renewal time. — Jorge Mazal, [*Lenny's Newsletter*](https://www.lennysnewsletter.com/p/how-duolingo-reignited-user-growth) + +## Fundraising + +**Treat every "no" as a missing slide.** When Canva heard "market's too small" or "you're like X," Perkins added a market-size slide and a competitive-gap slide to pre-answer those exact objections — so later investors grasped in 10 minutes what the first took six hours to get. The vision stayed constant; only the articulation sharpened through rejection. Edtech founders face investors who don't understand the sector, so log each objection and add the slide that pre-empts it. — Melanie Perkins, *Lenny's Podcast* + +**Name your accumulating advantage against the model labs.** Investors increasingly ask whether foundation-model labs will leave "no oxygen" for you over an 8–20 year horizon. You don't need a proven moat at seed, but you must articulate, in sequence, where the compounding advantage — data, network, switching costs — will come from and when you'd start measuring it. A thin AI wrapper on a teacher tool reads as model-fodder; show how district data, integrations, or workflow lock-in compound. — Keith Rabois, *Lenny's Podcast* + +**Lock in mission-protective structure before you raise.** A Public Benefit Corporation filing is a roughly two-page Delaware filing with essentially no downside, and the only time to do it is before priced rounds, because leverage only shrinks afterward — only ~20% of founders are still CEO three years after IPO. "It's always too early until it's too late." A founder who wants control over what happens to student data and learning outcomes should lock this in at the SAFE stage, not discover at acquisition that the board is bound to take the highest bid. — Eric Ries, [*Lenny's Podcast*](https://www.lennysnewsletter.com/p/how-to-build-a-company-that-withstands) + +**"Never quit" serves VC incentives, not yours.** Seed investors model every bet to zero and want you trying against all odds, because that's the only way they get their money back. If you're in year four or five, still pivoting without rip-roaring growth, quitting and resetting the cap table is often the rational move — product-market fit is unmistakable when it's real, and if you're unsure, you don't have it. Edtech's long sales cycles disguise the absence of demand, so founders rationalize "almost there" for years. — Matt MacInnis, *Lenny's Podcast* + +## Team and leadership + +**Count your "barrels," not your headcount.** Barrels are the rare people who own an initiative end-to-end and get it over the hill; everyone else is ammunition that amplifies a barrel. PayPal had only 12–17 barrels among 254 people. Adding ammunition behind the same barrels adds coordination tax without throughput, so staff against your number of true owners rather than raising and over-hiring against a thin pilot or sales motion. — Keith Rabois, *Lenny's Podcast* + +**AI adoption needs top-down budget plus a bottom-up tiger team.** Pure mandates fail; what works pairs exec buy-in with a small team of your most excited people who find real workflows, run hackathons, and evangelize — often technically adjacent non-engineers like the ops lead who's an Excel wizard, not coders. This is exactly how your district and university customers will or won't succeed, so build a customer-side champion team into your implementation plan rather than relying on an administrator's mandate that teachers quietly ignore. — Sherwin Wu, *Lenny's Podcast* + +**Use blind references and sharper hiring questions.** Ask "On a scale of 1–10, how likely are you to rehire this person?" and "Were they in the top 1% or top 10% of your reports?" Founders overrate their own gut, and roughly half of senior hires are gone within 18 months. Shrink the interview panel (HubSpot went from 8 to 4) and pick spiky candidates with real strengths over safe 3-out-of-4 generalists. A mis-hire on a small team is slow and expensive to correct. — Brian Halligan, *Lenny's Podcast* + +**Agree on what the goal means before anyone builds.** Forsgren's most common failure — 80% of teams — is starting work before defining the goal. "Improve developer experience" splinters into culture versus tooling versus friction, and teams spend months building the wrong thing; the fix costs one week of writing it down. When a district says it wants to "improve engagement" or "use AI," pin the exact definition in writing before scoping the pilot, or you'll ship something they didn't mean and lose the renewal. — Nicole Forsgren, *Lenny's Podcast* + +**Every useful AI agent needs a named human owner.** Agents quietly stop being useful unless someone owns them and keeps adding context; individuals won't do the upkeep, so the working model is a single company "super agent" maintained by a forward-deployed-engineer type. If you sell an AI agent into a school, name the staff owner who maintains it during the pilot — an unowned agent degrades and your renewal dies. Build the owner role into your deployment plan. — Dan Shipper, [*Lenny's Podcast*](https://www.lennysnewsletter.com/p/the-ai-paradox-dan-shipper) + +**Don't make the impressive big-company hire too early.** A VP from Microsoft, Google, or Salesforce expects you to "have your act together," which a 50–500 person startup doesn't — Halligan saw near-100% attrition on those hires. The underrated alternative is promoting homegrown people who've already proven themselves and whose weaknesses you can see. A founder dazzled by a Pearson or Google-for-Education resume often gets a leader who can't operate without big-company infrastructure. — Brian Halligan, *Lenny's Podcast* + +**Separate identity from behavior before a hard conversation.** Open by affirming the person is capable and on your team, then address the specific behavior. Skip this and people hear a verdict on their worth and get defensive, and you end up silently arguing about whether they're a good person instead of fixing the issue. It's the most reliable way to give a struggling teammate, co-founder, or a misfiring implementation lead at a customer site feedback that changes behavior. — Dr. Becky Kennedy, *Lenny's Podcast* + +**Hesitation is the most destructive thing a founder does.** When both options look bad, avoiding the call freezes the company and pushes nervous senior people to fill the void, which turns political fast. The fix is rarely more analysis — often you already see the answer but don't know how to have the hard conversation, so make it specific and behavioral. Edtech's long sales and budget cycles punish a frozen team severely, so choosing beats stalling on whether to re-architect, cut a pilot, or replace a hire. — Ben Horowitz, *Lenny's Podcast* + +--- + +*Distilled from the free public starter dataset of Lenny's Podcast and Lenny's Newsletter ([github.com/LennysNewsletter/lennys-newsletterpodcastdata](https://github.com/LennysNewsletter/lennys-newsletterpodcastdata)), used under its personal/non-commercial terms. Takeaways are paraphrased, not quoted. Last updated 2026-05-29.* diff --git a/data/research/README.md b/data/research/README.md index 9759b24..dc16a01 100644 --- a/data/research/README.md +++ b/data/research/README.md @@ -1,6 +1,6 @@ # Research Corpus Index -Evidence base powering EdTech Founder Stack skills and recommendations. +Evidence base powering EdTech Founder Stack. | Topic | File | Papers | |-------|------|--------| diff --git a/docs/social-preview.html b/docs/social-preview.html index a5ec3c4..950befa 100644 --- a/docs/social-preview.html +++ b/docs/social-preview.html @@ -84,22 +84,22 @@