Content-Based Audio Classification and Retrieval Using Segmentation, Feature Extraction and Neural Network Approach

15Citations
Citations of this article
29Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The volume of audio data is increasing tremendously daily on public networks like Internet. This increases the difficulty in accessing those audio data. Hence, there is a need of efficient indexing and annotation mechanisms. Non-stationarity and discontinuity present in the audio signal rise the difficulty in segmentation and classification of audio signals. The other challenging task is to extract and select the optimal features in audio signal. The application areas of audio classification and retrieval system include speaker recognition, gender classification, music genre classification, environment sound classification, etc. This paper proposes a machine learning- and neural network-based approach which performs audio pre-processing, segmentation, feature extraction, classification and retrieval of audio signal from the dataset. We have proposed novel approach of classification and retrieval using FPNN by combining fuzzy logic and PNN characteristics. We found that FPNN classifier gives better accuracy, F1-score and Kappa coefficient values compared to SVM, k-NN and PNN classifiers.

Cite

CITATION STYLE

APA

Patil, N. M., & Nemade, M. U. (2019). Content-Based Audio Classification and Retrieval Using Segmentation, Feature Extraction and Neural Network Approach. In Advances in Intelligent Systems and Computing (Vol. 924, pp. 263–281). Springer Verlag. https://doi.org/10.1007/978-981-13-6861-5_23

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free