fix: fix quant config read logic in model loading by Phi-C · Pull Request #1119 · ROCm/ATOM

Phi-C · 2026-06-07T04:15:54Z

Motivation

Fix quant config read procedure in #958. Without this modification, ATOM SGLang benchmark will fail (e.g. https://github.com/ROCm/ATOM/actions/runs/27054539290/job/79856431418).

Technical Details

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Signed-off-by: Phi-C <chenxjhit@163.com>

valarLip · 2026-06-08T02:58:09Z

        module_prefix = matching_name.split("shared_expert", 1)[0]
        shared_expert_prefix = layer_prefix + matching_name.rstrip(".")
        routed_expert_prefix = layer_prefix + f"{module_prefix}experts"
-        model_quant_config = getattr(getattr(model, "args", None), "quant_config", None)


maybe let's force all models have "quant_config"

It seems not easy to unify the read logic in one path, since 1) for plugins, "model.quant_config" means vllm/sglang's quant_config, which is different from "model.atom_config.quant_config"; 2) for dsv4, it uses "model.args.quant_config".

how about unified to "model.atom_config.quant_config", you can feel free to change dsv4's code for this target

how about unified to "model.atom_config.quant_config", you can feel free to change dsv4's code for this target

For models in ATOM/atom/models, we use "self.config" for atom config, if we want to unified to "model.atom_config.quant_config", it means we have to change all these into "self.atom_config". Maybe we can keep "model.atom_config.quant_config" and "model.quant_config" in this PR to avoid too many modifications, and change all models's "config" to "atom_config" in another PR to unify the read path if necessary?

Signed-off-by: Phi-C <chenxjhit@163.com>

fix: fix quant config read in model loading

27c65ee

Signed-off-by: Phi-C <chenxjhit@163.com>

Phi-C requested a review from ZLkanyo009 June 8, 2026 02:16

ZLkanyo009 previously approved these changes Jun 8, 2026

View reviewed changes

Phi-C requested a review from valarLip June 8, 2026 02:20

valarLip reviewed Jun 8, 2026

View reviewed changes

read DSV4's quant config from model.atom_config.quant_config

94fef90

Signed-off-by: Phi-C <chenxjhit@163.com>

Phi-C dismissed ZLkanyo009’s stale review via 94fef90 June 8, 2026 08:54

valarLip approved these changes Jun 8, 2026

View reviewed changes

valarLip merged commit a8862d5 into ROCm:main Jun 8, 2026
20 of 31 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: fix quant config read logic in model loading#1119

fix: fix quant config read logic in model loading#1119
valarLip merged 2 commits into
ROCm:mainfrom
Phi-C:fix_qwen_config

Phi-C commented Jun 7, 2026

Uh oh!

valarLip Jun 8, 2026

Uh oh!

Phi-C Jun 8, 2026

Uh oh!

valarLip Jun 8, 2026

Uh oh!

Phi-C Jun 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Phi-C commented Jun 7, 2026

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

valarLip Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

Phi-C Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

valarLip Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

Phi-C Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants