This paper presents evaluation results of a method for tracking speakers in seminars from multiple cameras. First, 2D human tracking and detection is done for each view. Then, 2D locations are converted to 3D based on the calibration parameters. Finally, cues from multiple cameras are integrated in a incremental way to refine the trajectories. We have developed two multi-view integration methods, which are evaluated and compared on the CHIL speaker tracking test set. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Wu, B., Singh, V. K., Nevatia, R., & Chu, C. W. (2007). Speaker tracking in seminars by human body detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4122 LNCS, pp. 119–126). Springer Verlag. https://doi.org/10.1007/978-3-540-69568-4_8
Mendeley helps you to discover research relevant for your work.