Adding Qwen3-Omni inference by dannigt · Pull Request #38 · hlt-mt/mcif

dannigt · 2026-03-17T16:01:01Z

Adds Qwen3-Omni-30B-A3B-Instruct as a MLLM baseline under the name qwen3_omni.

Changes

New model file baselines/models/mllm/qwen3_omni.py
Registered in baselines/main.py

Example usage

python main.py \
  --model qwen3_omni \
  --lang en \
  --track long \
  --modality audio \
  --prompt fixed \
  --in_data_folder $PATH2DATA \
  --out_folder ./baseline

Notes

Requires flash_attention_2 and sufficient VRAM for a 30B MoE model

sarapapi · 2026-03-17T17:28:17Z

baselines/models/mllm/qwen3_omni.py

@@ -0,0 +1,119 @@
+# Copyright 2025 FBK, KIT


Suggested change

# Copyright 2025 FBK, KIT

# Copyright 2026 FBK, KIT

sarapapi · 2026-03-17T17:33:01Z

baselines/models/mllm/qwen3_omni.py

+        torch_dtype="auto",
+        device_map="auto",
+        attn_implementation="flash_attention_2",
+    )


In the model card, they suggest adding model.disable_talker() when the model is not used for speech generation (as in our case). Not sure if it has a real impact on performance tbh, but worth to try

sarapapi · 2026-03-17T17:33:17Z

baselines/models/mllm/qwen3_omni.py

+        "content": [
+            {
+                "type": "text",
+                "text": "You are Qwen, a virtual human developed by the Qwen Team, Alibaba Group, "


Is it the standard one, right?

sarapapi · 2026-03-17T17:33:35Z

baselines/models/mllm/qwen3_omni.py

+    )
+    inputs = inputs.to(model.device).to(model.dtype)
+
+    # Inference: Generation of the output text


Suggested change

# Inference: Generation of the output text

Not really necessary here, we only do inference in this repo

sarapapi · 2026-03-17T17:35:23Z

Have you already tried to run it, right?

dannigt added 2 commits March 17, 2026 16:43

Added qwen3omni

c278561

added documenta and made output handling more structured

408c93d

sarapapi reviewed Mar 17, 2026

View reviewed changes

sarapapi requested a review from mgaido91 March 17, 2026 17:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Qwen3-Omni inference#38

Adding Qwen3-Omni inference#38
dannigt wants to merge 2 commits intomainfrom
feature/Qwen3-Omni

dannigt commented Mar 17, 2026

Uh oh!

sarapapi Mar 17, 2026

Uh oh!

sarapapi Mar 17, 2026

Uh oh!

sarapapi Mar 17, 2026

Uh oh!

sarapapi Mar 17, 2026

Uh oh!

sarapapi Mar 17, 2026

Uh oh!

sarapapi commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dannigt commented Mar 17, 2026

Changes

Example usage

Notes

Uh oh!

sarapapi Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

sarapapi Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

sarapapi Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

sarapapi Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

sarapapi Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

sarapapi commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants