Adjust kv_cache_size calculation based on local_attn_size by gushiqiao · Pull Request #1008 · ModelTC/LightX2V

gushiqiao · 2026-04-14T05:03:45Z

No description provided.

gemini-code-assist

Code Review

This pull request updates the KV cache initialization in transformer_infer.py to account for local attention settings. A review comment suggests using self.max_attention_size instead of manual calculation to maintain consistency with configuration overrides and avoid potential runtime crashes or memory waste.

gemini-code-assist · 2026-04-14T05:06:13Z

+        if self.local_attn_size != -1:
+            kv_cache_size = self.local_attn_size * self.frame_seq_length // ws
+        else:
+            kv_cache_size = self._kv_size // ws
+        self.kv_cache_size = kv_cache_size


The current logic for calculating kv_cache_size ignores the max_attention_size override from the configuration (cfg_max in reinit_caches). If cfg_max is set to a value larger than the calculated local_attn_size * self.frame_seq_length, the attention mechanism will attempt to access indices outside the allocated KV cache, leading to a crash. Conversely, if cfg_max is smaller, memory is wasted. Since self.max_attention_size already correctly accounts for this override and sequence parallelism when local_attn_size != -1, it should be used to set the cache size to ensure consistency and robustness.

Suggested change

if self.local_attn_size != -1:

kv_cache_size = self.local_attn_size * self.frame_seq_length // ws

else:

kv_cache_size = self._kv_size // ws

self.kv_cache_size = kv_cache_size

if self.local_attn_size != -1:

self.kv_cache_size = self.max_attention_size

else:

self.kv_cache_size = self._kv_size // ws

Adjust kv_cache_size calculation based on local_attn_size

7fe6d89

llmc-reviewer approved these changes Apr 14, 2026

View reviewed changes

gushiqiao merged commit 90457e8 into main Apr 14, 2026
2 checks passed

gushiqiao deleted the gushiqiao-patch-1 branch April 14, 2026 05:04

gemini-code-assist bot reviewed Apr 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjust kv_cache_size calculation based on local_attn_size#1008

Adjust kv_cache_size calculation based on local_attn_size#1008
gushiqiao merged 1 commit intomainfrom
gushiqiao-patch-1

gushiqiao commented Apr 14, 2026

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gushiqiao commented Apr 14, 2026

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants