A Binaural MFCC-CNN Sound Quality Model of High-Speed Train

14Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

The high-speed train (HST) is one of the most important transport tools in China, and the sound quality of its interior noise affects passengers’ comfort. This paper proposes a HST sound quality model. The model combines Mel-scale frequency cepstral coefficients (MFCCs), the most popular spectral-based input parameter in deep learning models, with convolutional neural networks (CNNs) to evaluate the sound quality of HSTs. Meanwhile, two input channels are applied to simulate binaural hearing so that the different sound signals can be processed separately. The binaural MFCC-CNN model achieves an accuracy of 96.2% and outperforms the traditional shallow neural network model because it considers the time-varying characteristics of noise. The MFCC features are capable of capturing the characteristics of noise and improving the accuracy of sound quality evaluations. Besides, the results suggest that the time and level differences in sound signals are important factors affecting sound quality at low annoyance levels. The proposed model is expected to optimize the comfort of the interior acoustic environment of HSTs.

Cite

CITATION STYLE

APA

Ruan, P., Zheng, X., Qiu, Y., & Hao, Z. (2022). A Binaural MFCC-CNN Sound Quality Model of High-Speed Train. Applied Sciences (Switzerland), 12(23). https://doi.org/10.3390/app122312151

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free