What question did this study set out to answer?

The aim is to develop a Vision Transformer model for accurate tool wear monitoring, particularly for Inconel 718.

May 29, 2026Open Access

Implementation of Vision Transformer Model for Robust Tool Wear Monitoring in Milling of Inconel 718

Key Points

The aim is to develop a Vision Transformer model for accurate tool wear monitoring, particularly for Inconel 718.
Comparison of Vision Transformer (ViT) model with EfficientNet-b0 for tool wear identification.
Validation on unseen image datasets under varied conditions.
Assessment of accuracy and computational efficiency.
ViT model achieved higher classification accuracy compared to EfficientNet-b0.
Fewer training epochs required for ViT model to converge.
Demonstrated strong generalization and robustness under different lighting conditions.

Abstract

Tool wear monitoring is essential for ensuring machining efficiency and product quality, particularly for difficult-to-machine materials such as Inconel 718 (IN718). Traditional deep learning models, such as Conventional Convolutional Neural Networks (CNNs), often struggle to capture complex wear patterns and lack accuracy across varying machining conditions while developing image-based tool wear identification systems. To address these limitations, this paper presents a Vision Transformer (ViT) model for identifying tool-wear categories during end-milling of IN718. The performance of the ViT-based model is systematically compared with a CNN-based EfficientNet-b0 model. The robustness and generalization of the ViT-based model are validated on two previously unseen image datasets: one with conditions similar to those of the training data and another acquired under varying lighting conditions. The results indicate that the ViT model outperforms the EfficientNet-b0 model in terms of classification accuracy and computational efficiency. The ViT model achieves higher accuracy with fewer training epochs and faster convergence. Furthermore, it exhibits strong generalization across different lighting conditions, demonstrating robustness to variations in the machining environment. The findings presented in this work clearly demonstrate ViT’s effectiveness in tool wear classification and its potential as a reliable, efficient algorithm for developing tool wear monitoring systems for practical machining applications.

Implementation of Vision Transformer Model for Robust Tool Wear Monitoring in Milling of Inconel 718

Key Points

Abstract

Cite This Study