Efficient inter mode prediction based on model selection and rate feedback for H.264/AVC

Kuan I. Lee, An Chao Tsai, Jhing Fa Wang, Jar-Ferr Yang

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

H.264/AVC is a standard developed for various low-complexity video applications and high-definition television. To improve coding performance, H.264/AVC may optionally adopt the rate-distortion optimization (RDO) method to find the best encoding mode among various inter and intra modes. However, the exhaustive RDO search among different modes increases the H.264/AVC encoder complexity and limits its application. In this paper, we propose an inter mode prediction algorithm for P slices based on spatial and temporal consistency analysis to reduce the complexity of the RDO computation. We apply the stochastic method to analyze the spatial consistency and use rate information for temporal consistency. The experimental results show a 0.03 peak signal-to-noise ratio loss, a 0.87% bit rate increase, and a 58.39% encoding time reduction on average.

Original languageEnglish
Article number5734814
Pages (from-to)708-716
Number of pages9
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume21
Issue number6
DOIs
Publication statusPublished - 2011 Jun 1

Fingerprint

Feedback
High definition television
Signal to noise ratio

All Science Journal Classification (ASJC) codes

  • Media Technology
  • Electrical and Electronic Engineering

Cite this

@article{b4ccaaf4b3204f21836805117bc71efd,
title = "Efficient inter mode prediction based on model selection and rate feedback for H.264/AVC",
abstract = "H.264/AVC is a standard developed for various low-complexity video applications and high-definition television. To improve coding performance, H.264/AVC may optionally adopt the rate-distortion optimization (RDO) method to find the best encoding mode among various inter and intra modes. However, the exhaustive RDO search among different modes increases the H.264/AVC encoder complexity and limits its application. In this paper, we propose an inter mode prediction algorithm for P slices based on spatial and temporal consistency analysis to reduce the complexity of the RDO computation. We apply the stochastic method to analyze the spatial consistency and use rate information for temporal consistency. The experimental results show a 0.03 peak signal-to-noise ratio loss, a 0.87{\%} bit rate increase, and a 58.39{\%} encoding time reduction on average.",
author = "Lee, {Kuan I.} and Tsai, {An Chao} and Wang, {Jhing Fa} and Jar-Ferr Yang",
year = "2011",
month = "6",
day = "1",
doi = "10.1109/TCSVT.2011.2129730",
language = "English",
volume = "21",
pages = "708--716",
journal = "IEEE Transactions on Circuits and Systems for Video Technology",
issn = "1051-8215",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "6",

}

Efficient inter mode prediction based on model selection and rate feedback for H.264/AVC. / Lee, Kuan I.; Tsai, An Chao; Wang, Jhing Fa; Yang, Jar-Ferr.

In: IEEE Transactions on Circuits and Systems for Video Technology, Vol. 21, No. 6, 5734814, 01.06.2011, p. 708-716.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Efficient inter mode prediction based on model selection and rate feedback for H.264/AVC

AU - Lee, Kuan I.

AU - Tsai, An Chao

AU - Wang, Jhing Fa

AU - Yang, Jar-Ferr

PY - 2011/6/1

Y1 - 2011/6/1

N2 - H.264/AVC is a standard developed for various low-complexity video applications and high-definition television. To improve coding performance, H.264/AVC may optionally adopt the rate-distortion optimization (RDO) method to find the best encoding mode among various inter and intra modes. However, the exhaustive RDO search among different modes increases the H.264/AVC encoder complexity and limits its application. In this paper, we propose an inter mode prediction algorithm for P slices based on spatial and temporal consistency analysis to reduce the complexity of the RDO computation. We apply the stochastic method to analyze the spatial consistency and use rate information for temporal consistency. The experimental results show a 0.03 peak signal-to-noise ratio loss, a 0.87% bit rate increase, and a 58.39% encoding time reduction on average.

AB - H.264/AVC is a standard developed for various low-complexity video applications and high-definition television. To improve coding performance, H.264/AVC may optionally adopt the rate-distortion optimization (RDO) method to find the best encoding mode among various inter and intra modes. However, the exhaustive RDO search among different modes increases the H.264/AVC encoder complexity and limits its application. In this paper, we propose an inter mode prediction algorithm for P slices based on spatial and temporal consistency analysis to reduce the complexity of the RDO computation. We apply the stochastic method to analyze the spatial consistency and use rate information for temporal consistency. The experimental results show a 0.03 peak signal-to-noise ratio loss, a 0.87% bit rate increase, and a 58.39% encoding time reduction on average.

UR - http://www.scopus.com/inward/record.url?scp=79957985689&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79957985689&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2011.2129730

DO - 10.1109/TCSVT.2011.2129730

M3 - Article

VL - 21

SP - 708

EP - 716

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

SN - 1051-8215

IS - 6

M1 - 5734814

ER -