[1]WANG Kongyuan,BI Ying,GUO Weifeng,et al.A Review of Multimodal Medical Image Classification and Cancer Diagnosis[J].Journal of Zhengzhou University (Engineering Science),2027,48(XX):1-10.[doi:10.13705/j.issn.1671-6833.2026.06.005]
Copy
Journal of Zhengzhou University (Engineering Science)[ISSN
1671-6833/CN
41-1339/T] Volume:
48
Number of periods:
2027 XX
Page number:
1-10
Column:
Public date:
2027-12-10
- Title:
-
A Review of Multimodal Medical Image Classification and Cancer Diagnosis
- Author(s):
-
WANG Kongyuan1, BI Ying1, GUO Weifeng1, LIANG Jing2, WU Fangxiang3
-
1. School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, China; 2. School of Electrical Engineering and Automation, Henan Institute of Technology, Xinxiang 453000, China; 3. Division of Biomedical Engineering, University ofSaskatchewan, Saskatoon S7N 5A9, Canada
-
- Keywords:
-
medical image classification; feature extraction; multimodal fusion; cancer diagnosis; clinical application
- CLC:
-
TP391. 4
- DOI:
-
10.13705/j.issn.1671-6833.2026.06.005
- Abstract:
-
Multimodal medical image classification techniques were able to effectively integrate data from differentimaging modalities and to construct more comprehensive and complementary feature representations across multiplelevels, including structural, functional, and metabolic dimensions. As a result, they markedly improved diseaseclassification performance and enhanced the accuracy and reliability of clinical diagnosis, thereby attracting substantial attention from the research community. This review first introduced the fundamental principles and overallworkflow of multimodal medical image classification, covering key stages such as data preprocessing, feature extraction, multimodal information fusion, and final classification and model evaluation. It also summarized the core ideas and mainstream paradigms of multimodal information fusion. Subsequently, it systematically analyzed and compared four multimodal medical image fusion methods at the methodological level, and discussed their clinical application effects and characteristics, with a particular focus on cancer-related tasks, including thyroid cancer prediction, early gastric cancer screening, immune response prediction, breast cancer diagnosis, and dermatological disease detection. Finally, it summarized existing challenges in the field of multimodal medical image classification,including high data acquisition and annotation costs, strong inter-modality heterogeneity, limited model interpretability, and insufficient generalization and robustness, and it provided an outlook on future research trends.