Abstract
Surface defect detection is a critical quality control task in precision manufacturing of optical components and thermal management systems. Conventional single-sensor inspection methods face significant challenges in detecting subtle defects beneath surface geometry effects, particularly on non-flat surfaces where shadowing, self-radiation, and complex emissivity distributions interfere with defect signatures. This study proposes a multi-sensor data fusion framework that integrates thermal imaging data with fringe projection profilometry (FPP) data for robust surface defect detection using a deep convolutional neural network. The framework is developed in the context of the 4D thermal imaging methodology established by Huang, Yang, and Zhu (2023) and the deep learning-enhanced optical metrology approach demonstrated by Huang, Tang, Liu, and Huang (2026). Specifically, a dual-branch 3D convolutional network processes co-registered thermal and depth image pairs, extracting both thermal anomaly features and geometric surface features for unified defect classification and localization. A simulation dataset modeling diverse defect types (cracks, pits, delamination, contamination) on optical component surfaces is constructed for training and evaluation. Simulation results demonstrate that the proposed fusion framework achieves a defect detection accuracy of 96.3% and a mean Intersection over Union (mIoU) of 81.7%, representing a 12.4% accuracy improvement and a 15.8% mIoU improvement over single-sensor baselines. The approach shows particularly strong performance on non-flat surfaces where single-sensor methods struggle. This work provides a data-driven pathway toward automated, reliable surface defect detection in precision manufacturing environments.
References
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In *Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition* (pp. 770–778). IEEE. https://doi.org/10.1109/CVPR.2016.90
Huang, H., Tang, J., Liu, T., & Huang, M. (2026). Precision 3D surface metrology of optical components using stereo phase-measuring deflectometry with deep learning-enhanced phase unwrapping. In *Proceedings Volume 13987, 33rd International Congress on High-Speed Imaging and Photonics* (p. 1398704). SPIE. https://doi.org/10.1117/12.3093993
Huang, H., Yang, Y., & Zhu, Y. (2023). Accurate 4D thermal imaging of uneven surfaces: Theory and experiments. *International Journal of Heat and Mass Transfer*, 216, 124580. https://doi.org/10.1016/j.ijheatmasstransfer.2023.124580
Long, J., Shelhamer, E., & Darrell, T. (2015). Fully convolutional networks for semantic segmentation. In *Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition* (pp. 3431–3440). IEEE. https://doi.org/10.1109/CVPR.2015.7298965
Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional networks for biomedical image segmentation. In *Medical Image Computing and Computer-Assisted Intervention* (pp. 234–241). Springer. https://doi.org/10.1007/978-3-319-24574-4_28
