FinDocMRE: A Benchmark for Document-Level Financial Multimodal Reasoning Evaluation

Jun 1, 2025·
Jiayong Zhu
,
Jiangtong Li
,
Jinru Ding
,
Dawei Cheng
,
Jie Xu
,
Feng Yu
· 0 min read
PDF
Abstract
While Large Multimodal Models (LMMs) excel in general visual tasks, their deployment in specialized financial contexts remains insufficient. We introduce FinDocMRE, a multi-image document-level benchmark designed for financial multimodal reasoning. Spanning twelve domains, the benchmark comprises 12,207 samples derived from 2,878 financial reports, designed to evaluate multi-image processing and document-level understanding across five distinct task types.
Type
Publication
Under Review