Deep reinforced bitrate ladders for adaptive video streaming

24Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In the typical transcoding pipeline for adaptive video streaming, raw videos are pre-chunked and pre-encoded according to a set of resolution-bitrate or resolution-quality pairs on the server-side, where the pair is often named as bitrate ladder. Different from existing heuristics, we argue that a good bitrate ladder should be optimized by considering video content features, network capacity, and storage costs on the cloud. We propose DeepLadder, a per-chunk optimization scheme which adopts state-of-The-Art deep reinforcement learning (DRL) method to optimize the bitrate ladder w.r.t the above concerns. Technically, DeepLadder selects the proper setting for each video resolution autoregressively. We use over 8,000 video chunks, measure over 1,000,000 perceptual video qualities, collect real-world network traces for more than 50 hours, and invent faithful virtual environments to help train DeepLadder efficiently. Across a series of comprehensive experiments on both Constant Bitrate (CBR) and Variable Bitrate (VBR)-encoded videos, we demonstrate significant improvements in average video quality bandwidth utilization, and storage overhead in comparison to prior work as well as the ability to be deployed in the real-world transcoding framework.

Cite

CITATION STYLE

APA

Huang, T., Zhang, R. X., & Sun, L. (2021). Deep reinforced bitrate ladders for adaptive video streaming. In NOSSDAV 2021 - Proceedings of the 2021 Workshop on Network and Operating System Support for Digital Audio and Video, Part of MMSys 2021 (pp. 67–73). Association for Computing Machinery, Inc. https://doi.org/10.1145/3458306.3458873

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free