Multimodal Fusion of Voice and Gesture Data for UAV Control

Xiaojia Xiang; Qin Tan; Han Zhou; Dengqing Tang; Jun Lai

Journal ArticleOPEN ACCESS

Multimodal Fusion of Voice and Gesture Data for UAV Control

Xiang X
Tan Q
Zhou H
et al.

Drones (2022) 6(8)

DOI: 10.3390/drones6080201

11Citations

8Readers

Abstract

To enable unmanned aerial vehicle (UAV) operators to efficiently and intuitively convey their commands to a swarm of UAVs, we propose the use of natural and human-centric input modalities, such as voices and gestures. This paper addresses the fusion of input modalities such as voice and gesture data, which are captured through a microphone and a Leap Motion controller, respectively, to control UAV swarms. The obtained experimental results are presented, and the achieved performance (accuracy) is analyzed. Finally, combined human factor ergonomics test with a questionnaire to verify the method’s validity.

Author supplied keywords

gesture
multimodal fusion
unmanned aerial vehicle (UAV) control
voice

Cite

CITATION STYLE

APA

Xiang, X., Tan, Q., Zhou, H., Tang, D., & Lai, J. (2022). Multimodal Fusion of Voice and Gesture Data for UAV Control. Drones, 6(8). https://doi.org/10.3390/drones6080201

Multimodal Fusion of Voice and Gesture Data for UAV Control

Abstract

Author supplied keywords

Cite

Register to see more suggestions