fix(anthropic): report real token usage on blocked responses by seph-barker · Pull Request #3 · predibase/litellm

seph-barker · 2026-06-24T17:10:17Z

The ModifyResponseException handler in the /v1/messages endpoint synthesizes a "blocked" response reporting zero input and output tokens, even though the request consumed real input tokens and the synthetic block message carries real content. Callers relying on usage (billing, quotas, metrics) under-count every blocked response.

Compute input_tokens from the original request messages (carried on the exception's request_data) and output_tokens from the block message text via litellm.token_counter. Counting is best-effort and falls back to zero on failure so a blocked response is always returned. The streaming synthesis path reuses the same response object, so both paths are fixed by one change.

Adds tests asserting nonzero, correct counts and graceful fallback. The endpoint's test file passes (5 tests).

The ModifyResponseException handler in the /v1/messages endpoint synthesizes a "blocked" response with hardcoded usage of zero input and output tokens, despite the request having consumed real input tokens and the block message carrying real content. Compute input_tokens from the original request messages (carried on the exception's request_data) and output_tokens from the block message text via litellm.token_counter. Token counting is best-effort and falls back to zero on failure so a blocked response is always returned. The streaming synthesis path reuses the same response object, so both paths are fixed consistently. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

seph-barker · 2026-06-24T17:55:06Z

Superseded by upstream BerriAI#31217 — opening directly against the main litellm repo.

seph-barker closed this Jun 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(anthropic): report real token usage on blocked responses#3

fix(anthropic): report real token usage on blocked responses#3
seph-barker wants to merge 1 commit into
mainfrom
joseph/fix-blocked-token-counts-main

seph-barker commented Jun 24, 2026

Uh oh!

seph-barker commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

seph-barker commented Jun 24, 2026

Uh oh!

seph-barker commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant