perf: drop AVIF + raise Cloud Run CPU/mem to fix 5.4s home LCP by julianken · Pull Request #442 · julianken/detached-node

julianken · 2026-06-01T17:09:27Z

Diagrams

N/A — config-only change (two lines: next.config.ts image formats + deploy.yml Cloud Run flags); no architecture or data-flow change to illustrate.

Summary

Hotfix for a production incident: the home-page LCP is 5,406 ms (Google "poor" is > 4,000 ms). TTFB is fine (129 ms, prerender cache HIT); the slow element is a featured-post card hero image (PostCard → ThemeAwareHero) optimized on demand by /_next/image. The trace showed a 4,723 ms load with only 12 ms of download — so ~4.7 s is pure server-side encode wait on a --cpu=1 --memory=512Mi, CPU-throttled Cloud Run instance with an ephemeral optimizer cache and no CDN. images.formats lists AVIF first, the most CPU-expensive encode, which also starved sibling immutable JS chunks.

Two config changes:

Drop AVIF — next.config.ts images.formats: ['image/avif', 'image/webp'] → ['image/webp']. Sources are already WebP; WebP encodes ~2–4× faster on 1 vCPU, and the larger WebP bytes are irrelevant when download was 12 ms.
Give the origin CPU + memory — deploy.yml Cloud Run flags: --cpu=1 --memory=512Mi → --cpu=2 --memory=1Gi. CPU throttling left on (still grants full CPU during the request, when the encode runs); --no-cpu-throttling deliberately not added (it only adds 24/7 instance billing for no encode-latency benefit).

Non-goals (unchanged here): no CDN (durable edge-cache tracked in #415), no --no-cpu-throttling, and no JSX/component edits — the home LCP image already sets fetchPriority="high"; this PR does not add priority/preload. All other deploy flags (--allow-unauthenticated, --min-instances=1, --max-instances=3, --port=8080, --timeout=60s) are untouched.

Closes #440

Screenshots

N/A — not UI. This is a build-config + deploy-flag change with no markup difference.

Test plan

Local gates run from the worktree:

pnpm lint — PASS (0 errors; 19 pre-existing warnings, none from this change).
pnpm typecheck (tsc --noEmit) — PASS.
pnpm test:unit (vitest run) — PASS (710/710 tests, 49 files).
pnpm build (next build) — PASS (exit 0; 67/67 static pages generated, full route table including /). Required NEXT_PUBLIC_SERVER_URL was supplied locally to satisfy the build-phase env guard (src/lib/env/required-env.ts), exactly as CI injects it.
pnpm test:e2e (playwright test) — DEFERRED-CI: globalSetup runs pnpm seed:test, which needs a Postgres DATABASE_URL not available in this sandbox. The four required E2E Shard x/4 checks (CI + Mergify-enforced) run on this PR. This is a config-only change with no JSX/behavioral edits, so E2E-exercised markup is unaffected.

Post-merge verification (to record here after deploy): re-run a Chrome DevTools trace of /; the LCP element is the first featured-post card image (PostCard → ThemeAwareHero). Expect LCP well under the prior ~5.4 s (target < ~2.5 s); note before/after.

Plan reference

Out of plan — prod incident: home-page LCP 5.4s hotfix; durable CDN follow-up #415.

🤖 Generated with Claude Code

Production home-page LCP is 5.4s. The LCP element is a featured-post card hero image optimized on demand by /_next/image on a single-vCPU, CPU-throttled Cloud Run instance with an ephemeral optimizer cache and no CDN. The trace showed ~4.7s of pure server-side wait with only ~12ms of download, so the cost is encode time, not bytes. AVIF is listed first in images.formats and is the most CPU-expensive encode. Sources are already WebP and WebP encodes ~2-4x faster on one vCPU, so dropping AVIF removes the dominant per-request work; the larger WebP bytes are irrelevant when download was 12ms. Raising the origin to 2 vCPU / 1Gi gives the optimizer headroom so a single encode no longer pegs the CPU and stalls sibling immutable JS chunks. CPU throttling is intentionally left on: throttling-on still grants full CPU during a request (when the encode runs), whereas --no-cpu-throttling only adds 24/7 instance billing for no encode-latency benefit. The durable edge-cache (CDN) fix is tracked separately in #415. Closes #440 Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

julianken-bot

Verdict: APPROVE (code is correct; one non-code SUGGESTION on merge-readiness)

Clean two-line config hotfix. The diff matches issue #440's acceptance criteria exactly, the one load-bearing API value is the literal Next.js framework default (context7-verified), and no consumer-side coupling breaks from dropping AVIF.

Verification ledger (commands run this turn)

gh pr view 442 → 2 files, +2/-2, draft, mergeStateStatus: BLOCKED, head 13def5e (stable across two checks).
gh pr diff 442 → exactly two lines: deploy.yml flags + next.config.ts images.formats.
Read next.config.ts + full deploy.yml deploy step → new flag string is --cpu=2 --memory=1Gi; --allow-unauthenticated --min-instances=1 --max-instances=3 --port=8080 --timeout=60s unchanged; --no-cpu-throttling absent. grep for cpu-throttling across .github/workflows/ confirms no override anywhere.
context7 /vercel/next.js image-config.ts → imageConfigDefault.formats = ['image/webp'], type ImageFormat = 'image/avif' | 'image/webp'. The new value is the v16 default and a valid ImageFormat[] — no type/runtime breakage. (Installed: Next 16.2.6, sharp 0.34.5.)
statusCheckRollup → ESLint/TypeScript/Vitest/Next.js Build/Analyze Bundle/CodeQL all SUCCESS; all four E2E shards SKIPPED (gated on draft != true, e2e-tests.yml:29).
grep tests/ e2e/ src/ for AVIF / images.formats coupling → none beyond the (now-removed-in-#441) blur-placeholder concern; orthogonal. git log on both files → changed lines predate this PR (R7 clean).
R15: 0 mermaid blocks → skipped. R16: no UI source → skipped. R13 fired (.github/workflows/**): T4/T6/T7 clear; shadow-mode, non-verdict-affecting.

Findings (1)

SUGGESTION — PR body claims the E2E shards "run on this PR"; they're SKIPPED while the PR is a draft. No code change needed — marking the PR ready triggers the required shards so they actually run before merge.

Specific praise (not filler)

images.formats: ['image/webp'] lands exactly on the Next.js 16 default — the lowest-risk value — and not adding --no-cpu-throttling is correct: throttling-on still grants full CPU during request processing (when the encode runs), capturing the latency win without always-on billing.

Bottom line

APPROVE on the merits — code correct and verified against current docs and the linked issue. Marking ready so the required E2E shards run before merge.

— @julianken-bot (opus, fresh context)

julianken · 2026-06-01T17:27:02Z

@Mergifyio queue

mergify · 2026-06-01T17:28:18Z

mergify · 2026-06-01T17:30:43Z

julianken · 2026-06-01T17:47:49Z

Post-deploy verification ✅

Re-traced / on prod after the Cloud Run deploy (succeeded 17:31, run 26771000341):

Metric	Before	After
LCP	5,406 ms	339 ms (≈16×)
Hero image load duration	4,723 ms	2 ms (warm)
TTFB	129 ms	72 ms
CLS	0.00	0.00

Hero image content-type flipped image/avif → image/webp (confirmed via curl with an AVIF-accepting Accept header). The AVIF-encode-on-1-vCPU bottleneck is gone — a cold x-nextjs-cache: MISS now re-encodes WebP on 2 vCPU quickly, and warm hits are ~2 ms.

Durable edge-cache (so the first global request per asset also skips the re-encode) remains tracked in #415.

julianken marked this pull request as ready for review June 1, 2026 17:26

julianken-bot approved these changes Jun 1, 2026

View reviewed changes

mergify Bot added the queued label Jun 1, 2026

mergify Bot merged commit 321c86a into main Jun 1, 2026
15 checks passed

mergify Bot deleted the perf/440-drop-avif-cpu branch June 1, 2026 17:31

mergify Bot removed the queued label Jun 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: drop AVIF + raise Cloud Run CPU/mem to fix 5.4s home LCP#442

perf: drop AVIF + raise Cloud Run CPU/mem to fix 5.4s home LCP#442
mergify[bot] merged 1 commit into
mainfrom
perf/440-drop-avif-cpu

julianken commented Jun 1, 2026

Uh oh!

julianken-bot left a comment

Uh oh!

julianken commented Jun 1, 2026

Uh oh!

mergify Bot commented Jun 1, 2026 •

edited

Loading

Uh oh!

mergify Bot commented Jun 1, 2026 •

edited

Loading

Uh oh!

Uh oh!

julianken commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

julianken commented Jun 1, 2026

Diagrams

Summary

Screenshots

Test plan

Plan reference

Uh oh!

julianken-bot left a comment

Choose a reason for hiding this comment

Verification ledger (commands run this turn)

Findings (1)

Specific praise (not filler)

Bottom line

Uh oh!

julianken commented Jun 1, 2026

Uh oh!

mergify Bot commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge Queue Status

Uh oh!

mergify Bot commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge Queue Status

Uh oh!

Uh oh!

julianken commented Jun 1, 2026

Post-deploy verification ✅

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mergify Bot commented Jun 1, 2026 •

edited

Loading

mergify Bot commented Jun 1, 2026 •

edited

Loading