PATIMT-Bench

This repo contains the code for PATIMT-Bench: A Multi-Scenario Benchmark for Position-Aware Text Image Machine Translation in Large Vision-Language Models. Please give us a like ❤️ if you find it useful !

Data Construction

# input.jsonl should contain image path and original OCR
python adaptive_refine.py --input_file input.jsonl 
python gpt_api_label.py

Evaluation

bash eval.sh

Cite

@inproceedings{
zhuang2025patimtbench,
title={{PATIMT}-Bench: A Multi-Scenario Benchmark for Position-Aware Text Image Machine Translation in Large Vision-Language Models},
author={Wanru Zhuang and Wenbo Li and Zhibin Lan and Xu Han and Peng Li and Jinsong Su},
booktitle={The 2025 Conference on Empirical Methods in Natural Language Processing},
year={2025},
url={https://openreview.net/forum?id=UYcNSzYyi9}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PATIMT-Bench

Data Construction

Evaluation

Cite

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

PATIMT-Bench

Data Construction

Evaluation

Cite