Merge ACE2.1-ERA5 (AIMIP) training and evaluation baseline configs by brianhenn · Pull Request #1027 · ai2cm/ace

brianhenn · 2026-03-31T18:35:33Z

This PR adds the full set of scripts and configurations for "ACE2.1-ERA5" — a modification of the deterministic ACE2-ERA5 model trained and evaluated under the AIMIP protocol. It also merges main to pick up the SecondaryDecoderConfig API used for the pressure-level decoder fine-tuning stage, while retaining all of the job names, output paths, and checkpoint IDs as used on the original branch where the workflow actually occurred.

Changes:

configs/baselines/era5-aimip/ — new directory containing all scripts and configs for the ACE2.1-ERA5 pipeline (previously configs/baselines/era5/aimip/)
- run-ace-train.sh / ace-train-config.yaml — train 4-seed ensemble on ERA5 1979–2008
- run-ace-evaluator-seed-selection.sh / run-ace-evaluator-seed-selection-single.sh — evaluate trained and fine-tuned checkpoints to select best seeds
- run-ace-fine-tune-decoder-pressure-levels.sh / ace-fine-tune-pressure-level-separate-decoder-config.yaml — fine-tune a secondary MLP decoder for 65 pressure-level diagnostic variables, using secondary_decoder (main's SecondaryDecoderConfig)
- run-ace-inference.sh / ace-aimip-inference-{,p2k-,p4k-}config.yaml — 46-year inference with 5 ICs × 3 SST scenarios; IC label expansion done via inline sed at job time (eliminates 15 near-identical committed config files)
- README.md — documents the intended workflow
Tests added
If dependencies changed, "deps only" image rebuilt and "latest_deps_only_image.txt" file updated

Remove intermediate fine-tuning explorations superseded by the separate-decoder + LR warmup approach (the final model). Trim the fine-tuning and seed-selection launch scripts to reference only the final approach. Deleted configs (non-final fine-tuning variants): - ace-fine-tune-decoder-pressure-level-config.yaml - ace-fine-tune-decoder-pressure-level-lr-warmup-config.yaml - ace-fine-tune-decoder-pressure-level-frozen-config.yaml - ace-fine-tune-decoder-pressure-level-frozen-lr-warmup-config.yaml - ace-fine-tune-decoder-pressure-level-reweight-config.yaml - ace-fine-tune-decoder-pressure-level-separate-decoder-config.yaml - restart-ace-fine-tune-decoder-pressure-levels.sh

Resolves conflicts in fme/core/step/single_module.py and test_step.py by accepting main's SecondaryDecoderConfig/SecondaryDecoder approach and dropping the branch's inline MLP + additional_diagnostic_names approach. Updates AIMIP configs to use the new secondary_decoder config format and moves loss/parameter_init from stepper to stepper_training per the TrainConfig restructuring in main.

…der-config.yaml

Delete 15 pre-generated IC-specific config files and instead do the _r[N]i label substitution inside the gantry container at job runtime via sed, keeping only the 3 template configs committed.

configs/baselines/era5-aimip/ace-evaluator-seed-selection-single-config.yaml

configs/baselines/era5-aimip/ace-evaluator-seed-selection-config.yaml

Arcomano1234

Left a few comments / questions for my own curiosity but this looks mostly good to go. I've been using some of these scripts so it will be nice to have in main. My only real comment is removing your hard-coded wandb name in a lot of the job submission scripts.

Arcomano1234 · 2026-03-31T21:33:05Z

configs/baselines/era5-aimip/README.md

+- `run-ace-evaluator-seed-selection-single.sh` — single continuous 36-year run (1978-10-01 to
+  2014-12-31). Config: `ace-evaluator-seed-selection-single-config.yaml`.
+
+After reviewing results, update the base checkpoint ID in `run-ace-fine-tune-decoder-pressure-levels.sh`.


Question: What criteria did we use for AIMIP to determine the best seed? As I am still not sure I understand what the run-ace-evaluator-seed-selection.sh eval tells use compared to run-ace-evaluator-seed-selection-single.sh.

My suggestion is maybe add one more sentence on how the best seed was chosen at this stage.

Will add a sentence, something like "hopefully the seed with the best time-mean climate/trends is the same across both evaluations otherwise make a subjective decision".

Thanks, its definitely the most hand-wavy part of the process and even something vaguely similar to what we did is fine just so its documented

Arcomano1234 · 2026-03-31T21:35:01Z

configs/baselines/era5-aimip/run-ace-evaluator-seed-selection-single.sh

+CONFIG_FILENAME="ace-evaluator-seed-selection-single-config.yaml"
+SCRIPT_PATH=$(git rev-parse --show-prefix)  # relative to the root of the repository
+CONFIG_PATH=$SCRIPT_PATH/$CONFIG_FILENAME
+BEAKER_USERNAME=bhenn1983


Nit: I didn't check all of the bash scripts but it would be nice to not have your username hardcoded here. I suggest BEAKER_USERNAME=$(beaker account whoami --format=json | jq -r '.[0].name')

Thanks for catching this. Since my beaker name and wandb name don't match I always have to change this 🙄

brianhenn added 30 commits November 5, 2025 14:18

AIMIP ACE train 4 random seeds

58abeef

Merge branch 'main' into workflow/aimip-ace

b992032

t add seed selection evaluation runs for ensemble

78a7847

add full 36-year seed evaluation runs

1cfb928

fine-tuning of decoder for pressure-level outputs

d5eee3f

add downweighted q fine-tuning training

ba3a473

rename evaluator w/ single IC to evaluator seed selection single

5305abe

add in-sample evaluator run of fine-tuned checkpoint

1c75485

add script restarting fine-tuning with more epochs

604178b

add in-sample evaluator runs of all fine-tuned checkpoints

f19c133

fix fine-tuning restart script

5f66857

add fine-tune pressure-levels with frozen decoder jobs

7e19d27

add fine-tuned checkpoints to seed selection evaluator

0b0c67d

add fine-tuned with downweighted q to seed selection evaluations

ff3ff8f

add LR warmup to frozen and unfrozen FT cases

07b20a5

add reweighting fine-tuning case

7cc3f85

squash merge separate decoder branch

836084d

add separate decoder case

623a8a3

add separate decoder + LR warmup case

c2edb65

add full 4 seeds to separate decoder + LR warmup

ae1e6a7

Merge branch 'main' into workflow/aimip-ace

ed6b3fc

add separate decoder checkpoints to in sample evaluations

8d47482

AIMIP inference configuration with full set of outputs and simulations

0864e5d

save results to weka instead of beaker datasets

4f15405

add model-layer outputs and rename files for p2k/p4k

fab699e

fix first daily data subset typo

01646d1

save data as 'huss' instead of 'tdas'

4bb021e

filename updates for CMIP conventions

b510bea

only use 'gr' for datasets with vertical dim, otherwise 'gn'

ea505f1

brianhenn added 6 commits March 31, 2026 10:52

Rename fine-tune config to ace-fine-tune-pressure-level-separate-deco…

cc59001

…der-config.yaml

Replace IC-specific inference configs with inline sed expansion

8273c14

Delete 15 pre-generated IC-specific config files and instead do the _r[N]i label substitution inside the gantry container at job runtime via sed, keeping only the 3 template configs committed.

Move configs/baselines/era5/aimip to configs/baselines/era5-aimip

6c0d830

add README.md

bbb07bb

Merge branch 'main' into workflow/aimip-ace-to-merge

69d6b1f

brianhenn marked this pull request as ready for review March 31, 2026 19:00

brianhenn mentioned this pull request Mar 31, 2026

Add scripts/aimip_postprocessing for CMIP6-compliant AIMIP result processing #1023

Open

2 tasks

brianhenn commented Mar 31, 2026

View reviewed changes

configs/baselines/era5-aimip/ace-evaluator-seed-selection-single-config.yaml Show resolved Hide resolved

Arcomano1234 reviewed Mar 31, 2026

View reviewed changes

configs/baselines/era5-aimip/ace-evaluator-seed-selection-config.yaml Show resolved Hide resolved

Arcomano1234 approved these changes Mar 31, 2026

View reviewed changes

brianhenn added 3 commits March 31, 2026 15:29

Replace hardcoded BEAKER_USERNAME with dynamic beaker whoami lookup

e3b9597

add seed selection method to readme

34bdac4

Merge branch 'main' into workflow/aimip-ace-to-merge

60e0d37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge ACE2.1-ERA5 (AIMIP) training and evaluation baseline configs#1027

Merge ACE2.1-ERA5 (AIMIP) training and evaluation baseline configs#1027
brianhenn wants to merge 39 commits intomainfrom
workflow/aimip-ace-to-merge

brianhenn commented Mar 31, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Arcomano1234 left a comment

Uh oh!

Arcomano1234 Mar 31, 2026

Uh oh!

brianhenn Mar 31, 2026

Uh oh!

Arcomano1234 Apr 1, 2026

Uh oh!

Arcomano1234 Mar 31, 2026

Uh oh!

brianhenn Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

brianhenn commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Arcomano1234 left a comment

Choose a reason for hiding this comment

Uh oh!

Arcomano1234 Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

brianhenn Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Arcomano1234 Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Arcomano1234 Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

brianhenn Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

brianhenn commented Mar 31, 2026 •

edited

Loading