Audio-visual identity verification and robustness to imposture

Walid Karam; Chafic Mokbel; Hanna Greige; Gerard Chollet

Conference ProceedingsOPEN ACCESS

Audio-visual identity verification and robustness to imposture

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5558 LNCS 796-805

DOI: 10.1007/978-3-642-01793-3_81

2Citations

7Readers

Abstract

The robustness of talking-face identity verification (IV) systems is best evaluated by monitoring their behavior under impostor attacks. We propose a scenario where the impostor uses a still face picture and a sample of speech of the genuine client to transform his/her speech and visual appearance into that of the target client. We propose MixTrans, an original text-independent technique for voice transformation in the cepstral domain, which allows a transformed audio signal to be estimated and reconstructed in the temporal domain. We also propose a face transformation technique that allows a frontal face image of a client to be animated, using principal warps to deform defined MPEG-4 facial feature points based on determined facial animation parameters. The robustness of the talking-face IV system is evaluated under these attacks. Results on the BANCA talking-face database clearly show that such attacks represent a serious challenge and a security threat to IV systems. © Springer-Verlag Berlin Heidelberg 2009.

Author supplied keywords

Cite

CITATION STYLE

APA

Karam, W., Mokbel, C., Greige, H., & Chollet, G. (2009). Audio-visual identity verification and robustness to imposture. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5558 LNCS, pp. 796–805). https://doi.org/10.1007/978-3-642-01793-3_81

Audio-visual identity verification and robustness to imposture

Abstract

Author supplied keywords

Cite

Register to see more suggestions