Frequency-Aware Convolutional Attention for Industrial Defect Image Classification

Keywords

industrial defect detection

Abstract

Industrial defect image classification requires models that can distinguish subtle texture changes, small structural abnormalities, and low-contrast surface defects. Conventional convolutional neural networks can extract hierarchical features, but they may lose fine-grained information during pooling and downsampling. This article proposes a frequency-aware convolutional attention model for industrial defect classification. The framework introduces multi-scale wavelet transform convolution to capture high-frequency defect patterns and low-frequency shape information simultaneously. A convolutional block attention module is then used to refine feature maps by learning channel-wise and spatial importance. The proposed model is suitable for surface defect detection in metal, textile, electronic component, and manufacturing inspection scenarios. The article argues that integrating frequency-domain analysis with deep learning improves recognition reliability under noise, uneven illumination, and background clutter. Experimental design focuses on classification accuracy, robustness, interpretability, and computational efficiency. The study provides a practical direction for intelligent visual inspection systems in industrial quality control.

References

Zhu, Y. (2026). An Image Recognition Method Based on Multi-Scale Wavelet Transform Convolution and Convolutional Block Attention. Conference Paper.

Wang, C., Zheng, G., Zhang, R., & Liu, X. (2026). DPPF: Dual-Path Pre-Fusion With Semantic-Guided Encoding for Remote Sensing Image Captioning. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

Guo, Z., Zhao, K., & Zhang, L. (2026). InstanceRSR: Real-World Super-Resolution via Instance-Aware Representation Alignment. ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 10577–10581. doi: 10.1109/ICASSP55912.2026.11462690.

Woo, S., Park, J., Lee, J. Y., & Kweon, I. S. (2018). CBAM: Convolutional Block Attention Module. European Conference on Computer Vision, 3–19.

He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. Proceedings of CVPR 2016, 770–778.

Simonyan, K., & Zisserman, A. (2015). Very deep convolutional networks for large-scale image recognition. International Conference on Learning Representations.

Szegedy, C., Liu, W., Jia, Y., et al. (2015). Going deeper with convolutions. Proceedings of CVPR 2015, 1–9.

Lin, T. Y., Dollár, P., Girshick, R., et al. (2017). Feature Pyramid Networks for object detection. Proceedings of CVPR 2017.

Liu, P., Zhang, H., Lian, W., & Zuo, W. (2019). Multi-level Wavelet Convolutional Neural Networks. arXiv preprint arXiv:1907.03128.

Zhao, X., Zhang, W., & Xiao, X. (2022). Wavelet-Attention CNN for image classification. Multimedia Systems.

Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press.