Add PR #672 baseline (Cosine TTT30, 1.0781 BPB) by dhruvjatkar · Pull Request #1 · dhruvjatkar/parameter-golf

dhruvjatkar · 2026-03-25T05:30:02Z

Summary

Fetched train_gpt.py verbatim from upstream openai/parameter-golf PR Record: 30ep Cosine TTT on LeakyReLU² stack (3-seed mean val_bpb=1.0781) openai/parameter-golf#672 (commit 3f9fa54)
Installed as new baseline reference at records/track_10min_16mb/PR672_CosineTTT30_1.0781/
PR Record: 30ep Cosine TTT on LeakyReLU² stack (3-seed mean val_bpb=1.0781) openai/parameter-golf#672 achieves 1.0781 BPB (3-seed mean, std=0.0041) using TTT_EPOCHS=30 with cosine TTT schedule, beating the prior merged SOTA of 1.1194

Test plan

python3 -m py_compile passes on train_gpt.py
README.md has correct metadata
No pycache or build artifacts committed

🤖 Generated with Claude Code

…line reference Fetched train_gpt.py verbatim from upstream openai/parameter-golf PR openai#672 which achieves 1.0781 BPB (3-seed mean, std=0.0041) using TTT_EPOCHS=30 with cosine TTT schedule. This replaces 1.1194 as the baseline to beat. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

PR openai#672 maxes TTT at 30 epochs (590s/600s eval budget), so all future improvements must be orthogonal to TTT. This update: - Sets 1.0781 BPB (PR openai#672) as the new target to beat - Reorders Top 8 directions: XSA-all confirmed at #1, Full GPTQ #2, SwiGLU #3, Muon-VS #4, aggressive quant #5, MASA openai#6, depth recurrence openai#7 with int6 risk warning, AdEMAMix openai#8 - Deprioritizes TTT-related directions already exploited by PR openai#672 - Collapses ~1000 lines of stale Round 0-3.9 session logs into a concise historical summary - Removes resolved blockers (flash_attn, SSH hangs, local runtime) - Adds fresh Round 1 section with 5 submitted experiments Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

dhruvjatkar · 2026-03-25T05:38:38Z

Merged directly to main via cherry-pick

dhruvjatkar mentioned this pull request Mar 25, 2026

Update plan.md: reset baseline to 1.0781 BPB, reprioritize directions #4

Closed

5 tasks

dhruvjatkar closed this Mar 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PR #672 baseline (Cosine TTT30, 1.0781 BPB)#1

Add PR #672 baseline (Cosine TTT30, 1.0781 BPB)#1
dhruvjatkar wants to merge 1 commit intomainfrom
worktree-agent-a36af41a

dhruvjatkar commented Mar 25, 2026

Uh oh!

dhruvjatkar commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dhruvjatkar commented Mar 25, 2026

Summary

Test plan

Uh oh!

dhruvjatkar commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant