Input-Aware Sparse Computation for Scalable Medical and Remote Sensing Vision Models

Lucas Bennett; Peggy R Jones

Vol. 2 No. 1 (2026), Articles

Vol. 2 No. 1 (2026)

Input-Aware Sparse Computation for Scalable Medical and Remote Sensing Vision Models

Articles

Published 2026-05-26

Lucas Bennett
Peggy R Jones

Lucas Bennett

Peggy R Jones

Keywords

sparse computation
IA-SpGEMM

Abstract

Scalable vision models in medical imaging and remote sensing require computational infrastructure that can process large multimodal datasets efficiently. This topic centers on input-aware sparse matrix-matrix multiplication, hyperspectral-LiDAR fusion, forensic vision-language compression, and instance-aware super-resolution. Sparse computation is especially relevant for graph-based biomedical reasoning, large image feature maps, multimodal token representations, and cross-domain retrieval systems. Hyperspectral-LiDAR fusion introduces high-dimensional spatial-spectral data, while forensic vision-language models and super-resolution systems increase the need for efficient token and feature processing. The literature structure highlights the computational layer underlying reliable medical and geospatial AI, where efficiency, scalability, and representation fidelity must be balanced.

References

Dai, Y., Chen, Z., Pradeepkumar, J., Matsubara, Y., Sun, J., Sakurai, Y., & Dong, Y. (2026). EpiGraph: Building Generalists for Evidence-Intensive Epilepsy Reasoning in the Wild. arXiv preprint arXiv:2605.09505.

Yang, J. X., Wang, J., Li, Z., Sui, C., Long, Z., & Zhou, J. (2025). HSLiNets: Evaluating Band Ordering Strategies in Hyperspectral and LiDAR Fusion. IEEE Geoscience and Remote Sensing Letters, 22, 1–5, Article 5505605. https://doi.org/10.1109/LGRS.2025.3567626

Xie, Z., Tan, G., Liu, W., & Sun, N. (2019, June). IA-SpGEMM: An input-aware auto-tuning framework for parallel sparse matrix-matrix multiplication. In Proceedings of the ACM International Conference on Supercomputing (pp. 94–105).

Lai, Y., Yu, Z., Wang, J., Shen, L., Xu, Y., & Cao, X. (2026). ForensicZip: More Tokens are Better but Not Necessary in Forensic Vision-Language Models. arXiv preprint arXiv:2603.12208.

Guo, Z., Zhao, K., & Zhang, L. (2026). InstanceRSR: Real-World Super-Resolution via Instance-Aware Representation Alignment. ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing, Barcelona, Spain, 10577–10581. https://doi.org/10.1109/ICASSP55912.2026.11462690

Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of KDD 2016, 785–794.

Vaswani, A., Shazeer, N., Parmar, N., et al. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30.

Kairouz, P., McMahan, H. B., Avent, B., et al. (2021). Advances and open problems in federated learning. Foundations and Trends in Machine Learning, 14(1–2), 1–210.

Rajkomar, A., Oren, E., Chen, K., et al. (2018). Scalable and accurate deep learning with electronic health records. NPJ Digital Medicine, 1, 18.

Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press.