CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model
Apr 7, 2026·,,,,·
0 min read
Jiangtong Li
Yiyun Zhu
Dawei Cheng
Zhijun Ding
Changjun Jiang

Abstract
Multimodal Large Language Models (MLLMs) have rapidly evolved with the growth of Large Language Models (LLMs) and are now applied in various fields. In finance, the integration of diverse modalities such as text, charts, and tables is crucial for accurate and efficient decision-making. In this paper, we introduce CFBenchmark-MM, a Chinese multimodal financial benchmark with over 9,000 image-question pairs featuring tables, histogram charts, line charts, pie charts, and structural diagrams. Additionally, we develop a staged evaluation system to assess MLLMs in handling multimodal information by providing different visual content step by step.
Type
Publication
Big Data Mining and Analytics (BDMA 2026)