[feat] Add dummy model feature by wangshankun · Pull Request #1009 · ModelTC/LightX2V

wangshankun · 2026-04-14T07:37:50Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces a "dummy model" mode across several model components, including the Audio Encoder, T5, CLIP, and VAE. This feature enables model initialization with random weights by reading only the metadata from safetensors headers, facilitating testing without loading full checkpoints. Review feedback suggests adding error handling for missing checkpoint files and adjusting the random weight initialization scale to 0.02 to ensure numerical stability during dummy inference.

gemini-code-assist · 2026-04-14T07:40:13Z

+        if os.path.isdir(safetensors_path):
+            safetensors_files = glob.glob(os.path.join(safetensors_path, "*.safetensors"))
+        else:
+            safetensors_files = [safetensors_path]


If safetensors_path is a directory but contains no .safetensors files, safetensors_files will be an empty list. This will result in an empty weight_dict, which might cause silent failures or uninitialized parameters later in the pipeline. It's better to check if any files were found and raise an informative error if not.

gemini-code-assist · 2026-04-14T07:40:13Z

+                original_dtype = SAFETENSORS_DTYPE_MAP.get(st_dtype_str)
+                if original_dtype is not None and not original_dtype.is_floating_point:
+                    dtype = original_dtype
+                weight_dict[key] = torch.randn(shape, dtype=dtype, device=dummy_device) if dtype.is_floating_point else torch.zeros(shape, dtype=dtype, device=dummy_device)


Using torch.randn with default parameters (mean=0, std=1) for dummy weight initialization can lead to numerical instability or activations exploding in deep networks, potentially causing NaNs during inference. For transformer models, a smaller standard deviation (e.g., 0.02) is generally safer and more representative of actual weight distributions.

Suggested change

weight_dict[key] = torch.randn(shape, dtype=dtype, device=dummy_device) if dtype.is_floating_point else torch.zeros(shape, dtype=dtype, device=dummy_device)

weight_dict[key] = (torch.randn(shape, dtype=dtype, device=dummy_device) * 0.02) if dtype.is_floating_point else torch.zeros(shape, dtype=dtype, device=dummy_device)

gemini-code-assist · 2026-04-14T07:40:13Z

+                original_dtype = SAFETENSORS_DTYPE_MAP.get(meta["dtype"])
+                if original_dtype is not None and not original_dtype.is_floating_point:
+                    dtype = original_dtype
+                adapter_weights_dict[key] = torch.randn(shape, dtype=dtype, device=dummy_device) if dtype.is_floating_point else torch.zeros(shape, dtype=dtype, device=dummy_device)


Similar to the base model, using torch.randn with unit variance for dummy weights can cause numerical issues. Scaling by a factor like 0.02 is recommended for more stable dummy initialization.

Suggested change

adapter_weights_dict[key] = torch.randn(shape, dtype=dtype, device=dummy_device) if dtype.is_floating_point else torch.zeros(shape, dtype=dtype, device=dummy_device)

adapter_weights_dict[key] = (torch.randn(shape, dtype=dtype, device=dummy_device) * 0.02) if dtype.is_floating_point else torch.zeros(shape, dtype=dtype, device=dummy_device)

[feat] Add dummy model feature

6cbe0a1

gemini-code-assist bot reviewed Apr 14, 2026

View reviewed changes

helloyongyang approved these changes Apr 14, 2026

View reviewed changes

ci format

9a19aef

helloyongyang merged commit 6db002f into main Apr 14, 2026
2 checks passed

helloyongyang deleted the dev/dummy_model branch April 14, 2026 09:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] Add dummy model feature#1009

[feat] Add dummy model feature#1009
helloyongyang merged 2 commits intomainfrom
dev/dummy_model

wangshankun commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 14, 2026

Uh oh!

gemini-code-assist bot Apr 14, 2026

Uh oh!

gemini-code-assist bot Apr 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	weight_dict[key] = torch.randn(shape, dtype=dtype, device=dummy_device) if dtype.is_floating_point else torch.zeros(shape, dtype=dtype, device=dummy_device)
	weight_dict[key] = (torch.randn(shape, dtype=dtype, device=dummy_device) * 0.02) if dtype.is_floating_point else torch.zeros(shape, dtype=dtype, device=dummy_device)

	adapter_weights_dict[key] = torch.randn(shape, dtype=dtype, device=dummy_device) if dtype.is_floating_point else torch.zeros(shape, dtype=dtype, device=dummy_device)
	adapter_weights_dict[key] = (torch.randn(shape, dtype=dtype, device=dummy_device) * 0.02) if dtype.is_floating_point else torch.zeros(shape, dtype=dtype, device=dummy_device)

Conversation

wangshankun commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants