Skip to content

[WIP] Add Gemini and GPT-Audio#35

Open
zhopto3 wants to merge 26 commits intomainfrom
add-gemini
Open

[WIP] Add Gemini and GPT-Audio#35
zhopto3 wants to merge 26 commits intomainfrom
add-gemini

Conversation

@zhopto3
Copy link
Collaborator

@zhopto3 zhopto3 commented Feb 4, 2026

Description

Outputs

List of benchmarks for which outputs are provided:

  • FLEURS
  • CoVoST2
  • Europarl-ST
  • WMT

Exclusions

Excluded non-generic benchmarks due to budget constraints

@Gldkslfmsd Gldkslfmsd changed the title [WIP] Add Gemini [WIP] Add Gemini and GPT-Audio Feb 6, 2026
@Gldkslfmsd
Copy link
Collaborator

Gldkslfmsd commented Feb 6, 2026

Description

  • Model Name: gpt-audio
  • Model Type: speechLLM
  • Checkpoint: N/A
  • Paper/Technical Report:
  • License:
  • Motivation:

Outputs

List of benchmarks for which outputs are provided:

  • FLEURS
  • CoVoST2
  • Europarl-ST
  • WMT
  • WinoST
  • CommonAccent
  • ManDi
  • CS-Dialogue
  • CS-FLEURS
  • LibriStutter
  • NoisyFLEURS (ambient)
  • NoisyFLEURS (babble)
  • EmotionTalk
  • mExpresso
  • ACL 60/60 (long)
  • ACL 60/60 (short)
  • MCIF (long)
  • MCIF (short)

Exclusions

too expensive

@JAVI897 JAVI897 changed the title [WIP] Add Gemini and GPT-Audio [WIP] Add Gemini Mar 5, 2026
@JAVI897
Copy link
Collaborator

JAVI897 commented Mar 5, 2026

Can we merge it? @zhopto3 @sarapapi

@sarapapi sarapapi changed the title [WIP] Add Gemini [WIP] Add Gemini and GPT-Audio Mar 5, 2026
@sarapapi
Copy link
Owner

sarapapi commented Mar 5, 2026

Can we merge it? @zhopto3 @sarapapi

No, I've asked @zhopto3 to create a clean PR including only Gemini inference code, outputs, and evals. Then, we'll close this one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants