SpeakerSense: Energy efficient unobtrusive speaker identification on mobile phones

91Citations
Citations of this article
98Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Automatically identifying the person you are talking with using continuous audio sensing has the potential to enable many pervasive computing applications from memory assistance to annotating life logging data. However, a number of challenges, including energy efficiency and training data acquisition, must be addressed before unobtrusive audio sensing is practical on mobile devices. We built SpeakerSense, a speaker identification prototype that uses a heterogeneous multi-processor hardware architecture that splits computation between a low power processor and the phone's application processor to enable continuous background sensing with minimal power requirements. Using SpeakerSense, we benchmarked several system parameters (sampling rate, GMM complexity, smoothing window size, and amount of training data needed) to identify thresholds that balance computation cost with performance. We also investigated channel compensation methods that make it feasible to acquire training data from phone calls and an automatic segmentation method for training speaker models based on one-to-one conversations. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Lu, H., Bernheim Brush, A. J., Priyantha, B., Karlson, A. K., & Liu, J. (2011). SpeakerSense: Energy efficient unobtrusive speaker identification on mobile phones. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6696 LNCS, pp. 188–205). https://doi.org/10.1007/978-3-642-21726-5_12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free