Early CTU Termination and Three-steps Mode Decision Method for Fast Versatile Video Coding

Sang Quang Nguyen; Tien Huu Vu; Duong Dinh Trieu; Minh Dinh Bao; Minh Do Ngoc; Xiem Hoang Van

doi:10.25073/2588-1086/vnucsce.375

Sang Quang Nguyen, Mr., Tien Huu Vu, Dr., Duong Dinh Trieu, Dr., Minh Dinh Bao, Mr., Minh Do Ngoc, Mr., Xiem Hoang Van

PDF

Published Dec 16, 2022

DOI: https://doi.org/10.25073/2588-1086/vnucsce.375

How to Cite

NGUYEN, Sang Quang et al. Early CTU Termination and Three-steps Mode Decision Method for Fast Versatile Video Coding. VNU Journal of Science: Computer Science and Communication Engineering, [S.l.], v. 39, n. 2, dec. 2022. ISSN 2588-1086. Available at: <//jcsce.vnu.edu.vn/index.php/jcsce/article/view/375>. Date accessed: 15 july 2025. doi: https://doi.org/10.25073/2588-1086/vnucsce.375.

ABNT APA BibTeX CBE EndNote - EndNote format (Macintosh & Windows) MLA ProCite - RIS format (Macintosh & Windows) RefWorks Reference Manager - RIS format (Windows only) Turabian

Issue

Vol 39 No 2

Section

Original Articles

Abstract

Versatile Video Coding (VVC) has been recently becoming popular in coding video data due to its compression efficiency. To reach this performance, Joint Video Experts Team (JVET) has introduced a number of coding improvement techniques to VVC model. Among them, VVC Intra coding proposed a new concept of quad-tree nested multi-type tree (QTMT) and extended the predicted modes with up to 67 options. As a result, the complexity of the VVC Intra encoding also greatly increases. To make VVC Intra coding more feasible in real time applications, we propose in this paper a novel fast mode decision method together with a deep learning based fast QTMT. At the first stage, we use a learned convolutional neural network (CNN) to predict the coding unit map and then fed into the VVC encoder to early terminate the block partitioning process. After that, we design a statistical model to predict a list of most probable modes (MPM) for each selected Coding Unit (CU) size. Finally, we introduce a novel three-steps mode decision algorithm to estimate the optimal directional mode without sacrificing the compression performance. The proposed early CU splitting and fast intra prediction are integrated into the latest VTM reference software. Experimental results show that the proposed method can save 50.2% encoding time with a negligible BD-Rate increase.

Keywords: VVC Intra coding, Early-Terminate Hierarchical, CNN, Most probable mode (MPM).

Article Sidebar

Article Details

Main Article Content

Abstract