Skip to content

Latest commit

 

History

History
48 lines (35 loc) · 1.5 KB

File metadata and controls

48 lines (35 loc) · 1.5 KB

PATIMT-Bench

This repo contains the code for PATIMT-Bench: A Multi-Scenario Benchmark for Position-Aware Text Image Machine Translation in Large Vision-Language Models. Please give us a like ❤️ if you find it useful !

arXiv:2506.08938 HuggingFace GitHub


Data Construction

# input.jsonl should contain image path and original OCR
python adaptive_refine.py --input_file input.jsonl 
python gpt_api_label.py

Evaluation

bash eval.sh

Cite

@inproceedings{
zhuang2025patimtbench,
title={{PATIMT}-Bench: A Multi-Scenario Benchmark for Position-Aware Text Image Machine Translation in Large Vision-Language Models},
author={Wanru Zhuang and Wenbo Li and Zhibin Lan and Xu Han and Peng Li and Jinsong Su},
booktitle={The 2025 Conference on Empirical Methods in Natural Language Processing},
year={2025},
url={https://openreview.net/forum?id=UYcNSzYyi9}
}