[WIP] Add Gemini and GPT-Audio by zhopto3 · Pull Request #35 · sarapapi/hearing2translate

zhopto3 · 2026-02-04T14:40:42Z

Description

Model Name: gemini-2.5-flash
Model Type: speechLLM
Checkpoint: N/A
Paper/Technical Report: https://storage.googleapis.com/deepmind-media/gemini/gemini_v2_5_report.pdf
License: Proprietary
Motivation: Most recent non-preview Gemini release that supports audio input.

Outputs

List of benchmarks for which outputs are provided:

FLEURS
CoVoST2
Europarl-ST
WMT

Exclusions

Excluded non-generic benchmarks due to budget constraints

Gldkslfmsd · 2026-02-06T15:48:04Z

Data test and statistics

JAVI897 · 2026-03-05T15:15:48Z

Can we merge it? @zhopto3 @sarapapi

sarapapi · 2026-03-05T15:21:03Z

Can we merge it? @zhopto3 @sarapapi

No, I've asked @zhopto3 to create a clean PR including only Gemini inference code, outputs, and evals. Then, we'll close this one.

zhopto3 and others added 11 commits February 4, 2026 15:20

Gemini infer scripts

717fd70

wmt output for Gemini

16ddd25

seed for reproducability

0d12706

WMT output w consistent seed

3d70b09

fleurs output

866c1fd

add wmt and fleurs evals for gemini

c2b4dc2

add correlations analysis

9180939

remove merge with manifests (not needed)

ebf0024

fix item-level computation to compute Group-by-item Spearman

c589068

add latex table, fix mcif item-level

fd35732

Merge branch 'main' into add-gemini

1b20a12

Gldkslfmsd changed the title ~~[WIP] Add Gemini~~ [WIP] Add Gemini and GPT-Audio Feb 6, 2026

infer code for openrouter gpt-audio

d82877b

Gldkslfmsd and others added 14 commits February 6, 2026 17:46

gpt-audio processed wmt

7a8c7a2

Convert empty outputs to str

9606ed6

Gemini Europarl

0e850b1

script to count duration of test audio files

da89c6e

test and stat script finished and documented

3a8f261

add wmt evals for gpt-audio

81461fe

add europarl evals for gemini-2.5-flash

e9206be

Merge pull request #36 from sarapapi/data-statistics

8761921

Data test and statistics

Intermediate covost outputs WIP

c1bea46

gemini covost2 outputs

ab9443f

add gemini evals on covost2

71217c0

gemini exception handling

4f3aad2

remove gpt-audio results

5834bb2

remove gpt audio wmt out

d74fa3f

JAVI897 changed the title ~~[WIP] Add Gemini and GPT-Audio~~ [WIP] Add Gemini Mar 5, 2026

sarapapi changed the title ~~[WIP] Add Gemini~~ [WIP] Add Gemini and GPT-Audio Mar 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add Gemini and GPT-Audio#35

[WIP] Add Gemini and GPT-Audio#35
zhopto3 wants to merge 26 commits intomainfrom
add-gemini

zhopto3 commented Feb 4, 2026 •

edited by sarapapi

Loading

Uh oh!

Gldkslfmsd commented Feb 6, 2026 •

edited

Loading

Uh oh!

JAVI897 commented Mar 5, 2026

Uh oh!

sarapapi commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

zhopto3 commented Feb 4, 2026 • edited by sarapapi Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Outputs

Exclusions

Uh oh!

Gldkslfmsd commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Outputs

Exclusions

Uh oh!

JAVI897 commented Mar 5, 2026

Uh oh!

sarapapi commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zhopto3 commented Feb 4, 2026 •

edited by sarapapi

Loading

Gldkslfmsd commented Feb 6, 2026 •

edited

Loading