This paper introduces and describes a manually generated synchronization ground truth, accurate to the level of the audio sample, for the Jiku Mobile Video Dataset, a dataset containing hundreds of videos recorded by mobile users at different events with drama, dancing and singing performances. It aims at encouraging researchers to evaluate the performance of their audio, video, or multimodal synchronization methods on a publicly available dataset, to facilitate easy benchmarking, and to ease the development of mobile video processing methods like audio and video quality enhancement, analytics and summary generation that depend on an accurately synchronized dataset.
CITATION STYLE
Guggenberger, M., Lux, M., & Böszörmenyi, L. (2015). A synchronization ground truth for the jiku mobile video dataset. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8936, pp. 87–98). Springer Verlag. https://doi.org/10.1007/978-3-319-14442-9_8
Mendeley helps you to discover research relevant for your work.