Skip to content

[release/0.2.1] Change the prompt for the finetuning#722

Draft
pulkitg01 wants to merge 2 commits into
release/0.2.1from
pulkitg/release/0.2.1/change_trained_prompt
Draft

[release/0.2.1] Change the prompt for the finetuning#722
pulkitg01 wants to merge 2 commits into
release/0.2.1from
pulkitg/release/0.2.1/change_trained_prompt

Conversation

@pulkitg01
Copy link
Copy Markdown
Collaborator

@pulkitg01 pulkitg01 commented May 26, 2026

Summary

Align static apple prompt

Detailed description

  • Reason: align the simulated static apple workflow with the same prompt used for the real setup.
  • Changed the GR00T LeRobot training config prompt to move the apple to the plate.
  • Updated the closed-loop evaluation config to use the same prompt.
  • Updated the static apple policy-training docs snippet so users finetune with the matching prompt.
  • Impact: training and evaluation now use consistent language conditioning across sim and real workflows.

@pulkitg01 pulkitg01 self-assigned this May 26, 2026
@pulkitg01 pulkitg01 changed the title Change the prompt for the finetuning [release/0.2.1] Change the prompt for the finetuning May 26, 2026
Copy link
Copy Markdown
Contributor

@isaaclab-review-bot isaaclab-review-bot Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Review Summary

This PR simplifies the language instruction prompt from a detailed description to a more concise task statement across 3 configuration files and documentation.

Changes Reviewed

File Change
docs/pages/example_workflows/static_apple/step_3_policy_training.rst Updated documentation prompt
isaaclab_arena_gr00t/lerobot/config/g1_static_apple_config.yaml Updated training config prompt
isaaclab_arena_gr00t/policy/config/g1_static_apple_gr00t_closedloop_config.yaml Updated inference config prompt

Observations

Consistency: The prompt change is applied consistently across all relevant files (documentation, LeRobot training config, and closed-loop policy config).

Commit hygiene: Good to see the follow-up commit removing the trailing period for cleaner prompt formatting.

Clarification Request

The prompt is changing from:

"Pick up the apple from the shelf and place it onto the plate on the same shelf next to it."

to:

"move the apple to the plate"

Question: Could you confirm this matches the prompt used during the finetuning of checkpoint-20000? Language-conditioned policies can be sensitive to prompt wording, so ensuring the inference prompt matches the training prompt is important for optimal performance.


This is an automated review. Please address any questions before merging.

@pulkitg01 pulkitg01 force-pushed the pulkitg/release/0.2.1/change_trained_prompt branch from 23c7a78 to 7080605 Compare May 26, 2026 22:14
@shanereetz
Copy link
Copy Markdown

LGTM 🚀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants