Modality-Balanced Embedding for Video Retrieval

12Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Video search has become the main routine for users to discover videos relevant to a text query on large short-video sharing platforms. During training a query-video bi-encoder model using online search logs,\textit we identify a modality bias phenomenon that the video encoder almost entirely relies on text matching, neglecting other modalities of the videos such as vision, audio, \etc This modality imbalance results from a) modality gap: the relevance between a query and a video text is much easier to learn as the query is also a piece of text, with the same modality as the video text; b) data bias: most training samples can be solved solely by text matching. Here we share our practices to improve the first retrieval stage including our solution for the modality imbalance issue. We propose \modelname (short for Modality Balanced Video Retrieval) with two key components: manually generated modality-shuffled (MS) samples and a dynamic margin (DM) based on visual relevance. They can encourage the video encoder to pay balanced attentions to each modality. Through extensive experiments on a real world dataset, we show empirically that our method is both effective and efficient in solving modality bias problem. We have also deployed our ∼\modelname∼ in a large video platform and observed statistically significant boost over a highly optimized baseline in an A/B test and manual GSB evaluations.

Cite

CITATION STYLE

APA

Wang, X., Ke, B., Li, X., Liu, F., Zhang, M., Liang, X., & Xiao, Q. (2022). Modality-Balanced Embedding for Video Retrieval. In SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 2578–2582). Association for Computing Machinery, Inc. https://doi.org/10.1145/3477495.3531899

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free