Add blog post: Glassbox — Grab vLLM's Attention by dmaniloff · Pull Request #183 · vllm-project/vllm-project.github.io

dmaniloff · 2026-03-27T23:00:30Z

Summary

Introduces glassbox, a vLLM plugin for extracting structured signals from transformer attention during inference
Authors: Diego Maniloff, Dominik Dahlem, Mac Misiura — Red Hat AI

Post structure

Three design pillars:

Research-informed signals — five feature groups from current literature + new research: spectral (pre-softmax SVD), AttentionTracker, LLM-Check, LapEigvals (EMNLP 2025), and routing/Hodge features from degree-normalized attention (Dahlem et al., upcoming)
Built for inference — matrix-free SVD via matvec oracles and a fused Triton kernel, configurable overhead (intervals, heads, signals on/off)
vLLM-native — custom attention backend registered via vllm.general_plugins, no source modifications

Also covers:

Three run modes: vllm serve, glassbox-run, glassbox-extract
Pluggable handler system: JSONL, OpenTelemetry spans, custom handlers
Terminal-style demo output from a real OPT-125m run
Vision: closing the loop from signals to action — inline detection, Observation Plugin RFC (#36998), external serving, llm-d shadow mode

Introduces glassbox, a vLLM plugin for extracting structured signals from transformer attention during inference. Covers research-informed signals, matrix-free SVD for inference efficiency, and vLLM-native integration via custom attention backends. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Signed-off-by: Diego Maniloff <diego.maniloff@gmail.com>

…ision - Add LapEigvals (EMNLP 2025) as 5th signal group - Update YAML config to current signal names (spectral, routing, tracker, selfattn, laplacian) - Add "Running glassbox" section with three run modes (vllm serve, glassbox-run, glassbox-extract) - Add signal emission subsection (JsonlHandler, OtelHandler, custom handlers) - Rewrite vision section around closing the loop: inline detection, RFC #36998 ABORT/CONTINUE, external serving, llm-d shadow mode - Fix post-softmax description: clarify difference from FlashAttention tiling Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Diego Maniloff <diego.maniloff@gmail.com>

Show actual glassbox-run command and cleaned-up log output from a real OPT-125m run. Add "What the features tell you" subheading for the ratio trajectory table. Update table values to match real run. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Diego Maniloff <diego.maniloff@gmail.com>

Signed-off-by: Diego Maniloff <diego.maniloff@gmail.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Diego Maniloff <diego.maniloff@gmail.com>

vercel bot deployed to Preview March 27, 2026 23:00 View deployment

vercel bot deployed to Preview April 14, 2026 17:32 View deployment

vercel bot deployed to Preview April 14, 2026 20:05 View deployment

dmaniloff force-pushed the blog/glassbox-intro branch from 68d76dc to 9e6c0bb Compare April 14, 2026 20:09

vercel bot deployed to Preview April 14, 2026 20:10 View deployment

dmaniloff and others added 6 commits April 15, 2026 16:29

snapshot example.

2dad6b4

Signed-off-by: Diego Maniloff <diego.maniloff@gmail.com>

whitespace.

4f3a4b7

Signed-off-by: Diego Maniloff <diego.maniloff@gmail.com>

Tighten opening: lead with value prop, remove bullet list

36ac891

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Diego Maniloff <diego.maniloff@gmail.com>

dmaniloff force-pushed the blog/glassbox-intro branch from 9e6c0bb to 36ac891 Compare April 15, 2026 16:30

vercel bot deployed to Preview April 15, 2026 16:30 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add blog post: Glassbox — Grab vLLM's Attention#183

Add blog post: Glassbox — Grab vLLM's Attention#183
dmaniloff wants to merge 6 commits intovllm-project:mainfrom
dmaniloff:blog/glassbox-intro

dmaniloff commented Mar 27, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dmaniloff commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Post structure

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dmaniloff commented Mar 27, 2026 •

edited

Loading