Abstract
Trustworthy artificial intelligence requires evaluation across diverse domains where model errors can produce significant operational, financial, security, or clinical consequences. This article proposes a cross-domain benchmarking framework for trustworthy AI systems in visual recognition, financial risk assessment, cybersecurity anomaly detection, and medical decision support. The framework compares models along four dimensions: predictive performance, interpretability, robustness, and evidence traceability. Visual tasks emphasize wavelet attention, semantic-guided encoding, melanoma recognition, and breast cancer image segmentation; financial tasks focus on fraud and credit risk modeling; cybersecurity tasks involve anomaly detection and vulnerability prioritization; medical tasks evaluate evidence-intensive clinical reasoning, multimodal medical imaging, neurodegenerative disease analysis, oncology risk prediction, and transcriptomic regulation. By using a shared evaluation structure across different domains, the benchmark enables systematic comparison of AI reliability under varying data types and decision contexts. The article argues that trustworthy AI should be assessed not only by accuracy but also by explanation quality, domain alignment, and human usability. The study provides a methodological foundation for integrated AI risk evaluation.
References
Zhu, Y. (2026). An Image Recognition Method Based on Multi-Scale Wavelet Transform Convolution and Convolutional Block Attention. Conference Paper.
Dai, Y., Chen, Z., Pradeepkumar, J., Matsubara, Y., Sun, J., Sakurai, Y., & Dong, Y. (2026). EpiGraph: Building Generalists for Evidence-Intensive Epilepsy Reasoning in the Wild. arXiv preprint arXiv:2605.09505.
Guo, Z., Zhao, K., & Zhang, L. (2026). InstanceRSR: Real-World Super-Resolution via Instance-Aware Representation Alignment. ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 10577–10581. doi: 10.1109/ICASSP55912.2026.11462690.
Yang, J., Chung, C. I., Koach, J., Liu, H., Navalkar, A., He, H., et al., & Shu, X. (2024). MYC phase separation selectively modulates the transcriptome. Nature Structural & Molecular Biology, 31(10), 1567–1579. doi: 10.1038/s41594-024-01322-6.
Chung, C. I., Yang, J., Yang, X., Liu, H., Ma, Z., Szulzewsky, F., Holland, E. C., Shen, Y., & Shu, X. (2024). Phase separation of YAP-MAML2 differentially regulates the transcriptome. Proceedings of the National Academy of Sciences of the United States of America, 121(7), e2310430121. doi: 10.1073/pnas.2310430121.
Bao, W., Xu, K., & Leng, Q. (2024). Research on the Financial Credit Risk Management Model of Real Estate Supply Chain Based on GA-SVM Algorithm. Procedia Computer Science, 243, 900–909.
Wang, C., Zheng, G., Zhang, R., & Liu, X. (2026). DPPF: Dual-Path Pre-Fusion With Semantic-Guided Encoding for Remote Sensing Image Captioning. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.
Xie, S., Xu, L., Lei, C., Wang, J., Wang, J., Wang, Z., Sun, Y., Li, D., Li, F., Lin, R., et al. (2026). RST2G: Residual-Guided Spatiotemporal Transformer Graph Fusion Enhancement for Breast Cancer Segmentation in DCE-MRI. Cyborg and Bionic Systems, 7, 0502.
Liu, Y., Li, C., Li, F., Lin, R., Zhang, D., & Lian, Y. (2025). Advances in computer vision and deep learning-facilitated early detection of melanoma. Briefings in Functional Genomics, 24, elaf002.
Lang, H., Zhou, Y., Yu, Y., Su, Z., Zhuge, H., Wang, W., Fang, D., Qin, J., Wei, M., et al. (2026). Multi-modal low-dose medical imaging through instruction-guided unified AI. Frontiers in Medicine, 13, 1691143.
Huang, J., Wang, S., Liao, X., Su, D., Lin, R., Zhang, T., & Zhao, L. (2025). Knowledge map of artificial intelligence in neurodegenerative diseases: a decade-long bibliometric and visualization study. Frontiers in Aging Neuroscience, 17, 1586282.
Li, C., Shao, S., Mikason, W., Lin, R., & Liu, Y. (2024). Utilizing Computer Vision for Continuous Monitoring of Vaccine Side Effects in Experimental Mice. arXiv preprint arXiv:2404.03121.
Ren, X., Ma, Y., Li, J., Liu, Y., Liao, X., Lin, R., & Qiu, Z. (2025). Development of an immune scoring system based on exosome-related gene expression for prognosis and treatment response prediction in breast cancer. Discover Oncology, 16, 957.
Arrieta, A. B., Díaz-Rodríguez, N., Del Ser, J., et al. (2020). Explainable artificial intelligence: Concepts, taxonomies, opportunities and challenges. Information Fusion, 58, 82–115.
Rudin, C. (2019). Stop explaining black box machine learning models for high-stakes decisions. Nature Machine Intelligence, 1, 206–215.
