Overview of intelligent video coding: from model-based to learning-based approaches

Siwei Ma; Junlong Gao; Ruofan Wang; Jianhui Chang; Qi Mao; Zhimeng Huang; Chuanmin Jia

ArticleOPEN ACCESS

Overview of intelligent video coding: from model-based to learning-based approaches

Visual Intelligence

DOI: 10.1007/s44267-023-00018-7

32Citations

20Readers

Abstract

Intelligent video coding (IVC), which dates back to the late 1980s with the concept of encoding videos with knowledge and semantics, includes visual content compact representation models and methods enabling structural, detailed descriptions of visual information at different granularity levels (i.e., block, mesh, region, and object) and in different areas. It aims to support and facilitate a wide range of applications, such as visual media coding, content broadcasting, and ubiquitous multimedia computing. We present a high-level overview of the IVC technology from model-based coding (MBC) to learning-based coding (LBC). MBC mainly adopts a manually designed coding scheme to explicitly decompose videos to be coded into blocks or semantic components. Thanks to emerging deep learning technologies such as neural networks and generative models, LBC has become a rising topic in the coding area. In this paper, we first review the classical MBC approaches, followed by the LBC approaches for image and video data. We also discuss and overview our recent attempts at neural coding approaches, which are inspiring for both academic research and industrial implementation. Some critical yet less studied issues are discussed at the end of this paper.

Cite

CITATION STYLE

APA

Ma, S., Gao, J., Wang, R., Chang, J., Mao, Q., Huang, Z., & Jia, C. (2023, December 1). Overview of intelligent video coding: from model-based to learning-based approaches. Visual Intelligence. Springer. https://doi.org/10.1007/s44267-023-00018-7

Overview of intelligent video coding: from model-based to learning-based approaches

Abstract

Cite

Register to see more suggestions