Multi-party interactive live video streaming applications have attracted millions of daily active users and are anticipated a blooming market in the next few years. A fundamental research problem in live streaming is bitrate coordination, which selects the proper upload/download bitrate for multiple participants in the system to maximize the users’ quality of experience (QoE). The existing bitrate adaptation methods fail to achieve optimal performance across a broad set of network conditions with conflict QoE objectives, and lack of the ability of adaptation to dynamic user requirements. In this paper, we proposed a novel meta-reinforcement learning based solution called MetaLive for multi-party live video streaming bitrate adaptation. The proposed framework formulates the bitrate coordination problem as a reinforcement learning task, and introduces a meta-training method to train an agent to learn to carry out various complex tasks from historical experience, and generate bitrate adaptation policies to maximize expected QoEs in diverse environments. We implement MetaLive based on an emulation platform, and use real-world network traces to evaluate its performance. Extensive experiments show that MetaLive achieves the best comprehensive QoE compared with the state-of-the-arts in a variety of network scenarios.
CITATION STYLE
Yang, Y., Li, X., Xu, Y., Li, W., Hu, J., Xu, T., … Lu, S. (2023). MetaLive: Meta-Reinforcement Learning Based Collective Bitrate Adaptation for Multi-Party Live Streaming. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 14100 LNCS, pp. 65–80). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-39698-4_5
Mendeley helps you to discover research relevant for your work.