Abstract
Fault diagnosis of bogie core components under complex operating conditions remains a challenging task due to the large number of fault categories and the strong coupling among multi-source sensor signals. To address this issue, this paper proposes a large-scale fault diagnosis method for bogie systems based on multi-sensor information fusion and Vision Transformer (ViT). First, continuous wavelet transform (CWT) is employed to convert multi-sensor vibration signals into time–frequency representations. These representations are then segmented into patches and jointly encoded as input tokens for the ViT model, enabling effective feature extraction and global dependency modeling across multiple sensors.The proposed method is validated on the Beijing Jiaotong University Railway Bogie Dataset (BJTU-RAO), involving a challenging 51-class fault diagnosis task under multiple operating conditions. Experimental results demonstrate that the ViT-based model achieves fast convergence, high diagnostic accuracy, and stable performance, with an average accuracy above 95%.
Keywords
Get full access to this article
View all access options for this article.
