Detect whitespace padding used to hide prompt-injection instructions (P9) by korjavin · Pull Request #24 · NVIDIA/SkillSpector

korjavin · 2026-06-11T14:53:02Z

Adds rule P9 "Whitespace Padding" under Prompt Injection, for issue #20. It detects padding that pushes injected instructions out of a reviewer's view while the agent still reads them.

P6 through P8 were taken by System Prompt Leakage, so this uses P9. One id covers all three signals; confidence carries the weighting.

Signals (all reported as P9):

Vertical: 20+ consecutive blank or whitespace-only lines. MEDIUM, raised to HIGH when content follows a gap of 40+ lines. Confidence 0.8 with trailing content, 0.6 without.
Horizontal: 80+ consecutive whitespace characters in a line, including leading indentation. MEDIUM, confidence 0.7.
Ratio: a contiguous whitespace block over 2 KB, or whitespace over 90% of a file larger than 4 KB. LOW, confidence 0.4.

Whitespace is classified by Unicode category rather than ASCII space/tab: controls (\t \n \r \v \f), categories Zs/Zl/Zp (U+00A0, U+2028, U+2029, U+3000, and so on), and the zero-width family (U+200B/C/D, U+2060, U+FEFF). That zero-width set is now one shared constant (ZERO_WIDTH_CHARS) used by P2's regex and the mcp_tool_poisoning zero-width check, so the two cannot drift; the MCP check also picks up U+2060/U+FEFF.

Each finding points at the line where padding starts and includes a visible snippet of what was hidden (for example U+00A0 x82 or \n x80).

False-positive guards: markdown fenced code is skipped for the horizontal signal; vendored files are skipped (*.min.js, *.min.css, *.lock, package-lock.json, yarn.lock, *.svg, *.map); binary-ish content (containing U+FFFD) bails out; the ratio signal stays at LOW. Eval-dataset prose and files over 1 MB are already skipped upstream.

MCP manifest description fields are covered by the same detector (horizontal and block signals; the per-file ratio signal is skipped since fields are short).

Tests cover all three signals at their thresholds, the full Unicode evasion set, the false-positive guards, and the MCP path. Thresholds are named constants, easy to tune against a real corpus. Happy to adjust the signals or thresholds before merge.

Plan for issue NVIDIA#20 — detect large whitespace padding used to hide prompt-injection instructions from review. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

…VIDIA#20 comment

…up, tests)

…mpleted/

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

korjavin and others added 12 commits June 11, 2026 14:50

docs: add whitespace padding detection (P9) implementation plan

0d06810

Plan for issue NVIDIA#20 — detect large whitespace padding used to hide prompt-injection instructions from review. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

feat: add whitespace padding detector helper module (P9 Task 1)

0dad1c7

feat: add P9 whitespace padding findings to prompt-injection analyzer

3c81ee2

feat: add P9 whitespace padding detection to MCP manifest fields

884f134

test: P9 acceptance verification and adversarial padding-char coverage

bac008f

docs: add P9 whitespace padding to README and CLAUDE.md, draft issue N…

c46232d

…VIDIA#20 comment

fix: address review findings (U+2028/2029 vertical padding, block ded…

172fac4

…up, tests)

fix: detect U+2028/U+2029 vertical padding in MCP description fields

106ab39

test: cover MCP block-kind P9 path; clarify _check_p9_padding docstring

bc1e96f

fix: codex review - trailing-gap boundary and block/ratio dedup

f427866

docs: move completed plan 20260611-detect-whitespace-padding.md to co…

0b0ad17

…mpleted/

chore: drop internal planning artifacts from feature branch

3d44702

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detect whitespace padding used to hide prompt-injection instructions (P9)#24

Detect whitespace padding used to hide prompt-injection instructions (P9)#24
korjavin wants to merge 12 commits into
NVIDIA:mainfrom
korjavin:feat/detect-whitespace-padding-injection

korjavin commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

korjavin commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant