[1]ZHENG Hong,LUO Yujian,LING Kan,et al.Multimodal Cross-scale Feature Fusion for Drug-Target Affinity Prediction[J].Journal of Zhengzhou University (Engineering Science),2026,47(XX):1-8.[doi:10.13705/j.issn.1671-6833.2026.04.001]
Copy
Journal of Zhengzhou University (Engineering Science)[ISSN
1671-6833/CN
41-1339/T] Volume:
47
Number of periods:
2026 XX
Page number:
1-8
Column:
Public date:
2026-09-10
- Title:
-
Multimodal Cross-scale Feature Fusion for Drug-Target Affinity Prediction
- Author(s):
-
ZHENG Hong ; LUO Yujian; LING Kan; F AN Guisheng
-
School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
-
- Keywords:
-
graph neural network; drug-target affinity prediction; feature learning; cross-scale feature fusion; drug development
- CLC:
-
TP183R91R318
- DOI:
-
10.13705/j.issn.1671-6833.2026.04.001
- Abstract:
-
To address the current challenges of relying solely on single-modal features of target proteins and neglecting network-scale features in biological networks, a drug-target affinity prediction (DTA) model based on multimodal cross-scale feature fusion was proposed. Target proteins as both sequences and graphs for feature extraction were presented, extracting semantic and topological features, respectively, to enhance the target proteins’ feature representation. The strong affinity relationships between drugs and target proteins were analyzed to construct a heterogeneous graph network of drug-target interaction. A cross-scale feature fusion method was then used to effectively integrate the scale features of the heterogeneous graph network, enriching the feature representations of both target proteins and drug molecules. Experimental results on the DAVIS and KIBA datasets demonstrated that, compared to the more advanced model, the proposed model achieved reductions in MSE by 0.015 and 0.003, respectively, and increases in CI by 0.005 and 0.004, improving the accuracy and stability of affinity prediction. It demonstrated the effectiveness of multimodal and cross-scale feature fusion in DTA prediction tasks.