Vocal Data Assesment To Envision Distinctive Features of An Individual

  • et al.
N/ACitations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

There is a lot of audio data generated on a day to day bases, which goes to waste without undergoing due processing. If we process this data, it can be beneficial for a multitude of purposes. Vocal data is unstructured, which makes it even harder for processing. This data has to undergo thorough pre-processing to convert it to a machine-understandable form. We aim to perform analysis of human voice to extract meaningful data and make a prediction of their age, gender, and accent. The developed system uses the Mel-frequency Cepstral Coefficient (MFCC), zero-cross-rate(ZCR), chroma_stft, spectral_centroid, spectral_bandwidth, and spectral_rolloff algorithms as a tool for Feature Extraction. The algorithms used for making inferences are support vector machine (SVM), K-nearest neighbors, and SVR. The work can be extended even further by combining video data with the audio data for analysis. The system can also be improved by increasing the number of languages it can detect.

Cite

CITATION STYLE

APA

Garg*, A., Agrawal, K., & Akilandeshwari, Mrs. P. (2020). Vocal Data Assesment To Envision Distinctive Features of An Individual. International Journal of Innovative Technology and Exploring Engineering, 9(6), 1335–1338. https://doi.org/10.35940/ijitee.f3771.049620

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free