Abstract
Labeled training data is the primary bottleneck for deploying deep learning in precision optical surface metrology, where expert annotation is expensive, time-consuming, and subject to inter-observer variability. This study proposes a label-efficient deep learning framework for optical surface metrology that combines self-supervised pretraining on large volumes of unlabeled measurement data with active learning-based sample selection for annotation, dramatically reducing the number of required labeled samples. Built upon the deep learning methodologies established by Huang, Tang, Liu, and Huang (2026) in optical metrology and the 4D thermal imaging approach of Huang, Yang, and Zhu (2023), the framework first pretrains convolutional encoders on unlabeled optical measurement data using a contrastive learning objective, learning rich representations of surface geometry and thermal patterns without any manual labels. Subsequently, an active learning module selects the most informative unlabeled samples for manual annotation, focusing annotation effort on the data points that will most improve model performance. Evaluated across three representative optical metrology tasks—thermal image reconstruction, phase unwrapping, and surface defect detection—the framework achieves within 5% of fully supervised performance using only 8–12% of the labeled dataset. Active learning sample selection outperforms random selection by 31% in label efficiency, and self-supervised pretraining provides a 2.3× speedup in convergence during fine-tuning. The proposed framework provides a practical pathway toward data-efficient deep learning deployment in precision optical metrology with limited annotation budgets.
References
Chen, T., Kornblith, S., Norouzi, M., & Hinton, G. (2020). A simple framework for contrastive learning of visual representations. In *Proceedings of the 37th International Conference on Machine Learning* (pp. 1597–1607). PMLR.
He, K., Fan, H., Wu, Y., Xie, S., & Girshick, R. (2020). Momentum contrast for unsupervised visual representation learning. In *Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition* (pp. 9729–9738). IEEE. https://doi.org/10.1109/CVPR42600.2020.00975
Huang, H., Tang, J., Liu, T., & Huang, M. (2026). Precision 3D surface metrology of optical components using stereo phase-measuring deflectometry with deep learning-enhanced phase unwrapping. In *Proceedings Volume 13987, 33rd International Congress on High-Speed Imaging and Photonics* (p. 1398704). SPIE. https://doi.org/10.1117/12.3093993
Huang, H., Yang, Y., & Zhu, Y. (2023). Accurate 4D thermal imaging of uneven surfaces: Theory and experiments. *International Journal of Heat and Mass Transfer*, 216, 124580. https://doi.org/10.1016/j.ijheatmasstransfer.2023.124580
Settles, B. (2009). Active learning literature survey. University of Wisconsin-Madison, Department of Computer Sciences. https://api.semanticscholar.org/CorpusID:195035976
