Instruction-Guided Multimodal Medical AI for Imaging, Oncology, and Biomedical Decision Support

Keywords

medical artificial intelligence
multimodal medical imaging

Abstract

Multimodal medical artificial intelligence is increasingly used to integrate imaging, molecular, clinical, and textual evidence for disease detection, prognosis prediction, and treatment decision support. This article proposes an instruction-guided multimodal medical AI framework that combines medical image analysis, oncology knowledge extraction, transcriptomic evidence, and clinical reasoning. The framework is designed to support tasks such as low-dose medical image interpretation, breast cancer DCE-MRI segmentation, melanoma detection, oncology risk assessment, and evidence-based biomedical decision support. Vision transformers and convolutional attention networks are used to extract visual features, while instruction-guided language models organize clinical and molecular evidence into interpretable summaries. Transcriptomic and phase-separation studies are incorporated as biomedical knowledge sources to improve the biological grounding of model explanations. The framework emphasizes reliability, traceability, and human-centered interpretation, particularly in high-stakes medical scenarios where algorithmic recommendations require clinical validation. By integrating multimodal imaging, molecular biology, and explainable AI, the article provides a cross-disciplinary approach to medical intelligence systems.

References

Guo, Z., Zhao, K., & Zhang, L. (2026). InstanceRSR: Real-World Super-Resolution via Instance-Aware Representation Alignment. ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 10577–10581. doi: 10.1109/ICASSP55912.2026.11462690.

Yang, J., Chung, C. I., Koach, J., Liu, H., Navalkar, A., He, H., et al., & Shu, X. (2024). MYC phase separation selectively modulates the transcriptome. Nature Structural & Molecular Biology, 31(10), 1567–1579. doi: 10.1038/s41594-024-01322-6.

Chung, C. I., Yang, J., Yang, X., Liu, H., Ma, Z., Szulzewsky, F., Holland, E. C., Shen, Y., & Shu, X. (2024). Phase separation of YAP-MAML2 differentially regulates the transcriptome. Proceedings of the National Academy of Sciences of the United States of America, 121(7), e2310430121. doi: 10.1073/pnas.2310430121.

Lang, H., Zhou, Y., Yu, Y., Su, Z., Zhuge, H., Wang, W., Fang, D., Qin, J., Wei, M., et al. (2026). Multi-modal low-dose medical imaging through instruction-guided unified AI. Frontiers in Medicine, 13, 1691143.

Xie, S., Xu, L., Lei, C., Wang, J., Wang, J., Wang, Z., Sun, Y., Li, D., Li, F., Lin, R., et al. (2026). RST2G: Residual-Guided Spatiotemporal Transformer Graph Fusion Enhancement for Breast Cancer Segmentation in DCE-MRI. Cyborg and Bionic Systems, 7, 0502.

Liu, Y., Li, C., Li, F., Lin, R., Zhang, D., & Lian, Y. (2025). Advances in computer vision and deep learning-facilitated early detection of melanoma. Briefings in Functional Genomics, 24, elaf002.

Zhang, C., Liu, W., Yang, P., Lin, R., Pu, L., & Zhang, H. (2025). Dual roles of innate immune cells and cytokines in shaping the breast cancer microenvironment. Frontiers in Immunology, 16, 1654947.

Ren, X., Ma, Y., Li, J., Liu, Y., Liao, X., Lin, R., & Qiu, Z. (2025). Development of an immune scoring system based on exosome-related gene expression for prognosis and treatment response prediction in breast cancer. Discover Oncology, 16, 957.

Zhao, Z., Chen, X., Feng, Y., Lei, Y., Deng, X., Lin, R., Ling, J., Hou, Z., Yang, Y., et al. (2025). Identification of the important role of CA9 in immune infiltration and prognosis in cervical cancer. Future Science OA, 11(1), 2532314.

Liu, Y., Zhang, D., Lin, R., Lian, Y., & Zhang, W. (2026). Age-stratified risk analysis of gastric cancer: a retrospective hospital-based study of Helicobacter pylori, smoking, and dietary patterns in South China across three age groups. Frontiers in Oncology, 16, 1677546.

Wang, J., Tian, Q., Liu, Y., Cai, C. Y., Fu, S., Li, J., Guan, Y., Liao, X., Su, D., Sun, T., et al. (2025). Targeting metalloptosis in tumor therapy: from molecular mechanisms to application of metal nanoparticles. Molecular Cancer, 24(1), 1–61.

Huang, J., Wang, S., Liao, X., Su, D., Lin, R., Zhang, T., & Zhao, L. (2025). Knowledge map of artificial intelligence in neurodegenerative diseases: a decade-long bibliometric and visualization study. Frontiers in Aging Neuroscience, 17, 1586282.

Li, C., Lin, R., Liu, Y., & Wei, Y. (2024). Evaluating Cognitive and Neuropsychological Assessments—A Comprehensive Review. arXiv preprint arXiv:2402.14655.

Rajkomar, A., Oren, E., Chen, K., et al. (2018). Scalable and accurate deep learning with electronic health records. NPJ Digital Medicine, 1, 18.

Esteva, A., Robicquet, A., Ramsundar, B., et al. (2019). A guide to deep learning in healthcare. Nature Medicine, 25, 24–29.