TY - JOUR
T1 - Efficient inter mode prediction based on model selection and rate feedback for H.264/AVC
AU - Lee, Kuan I.
AU - Tsai, An Chao
AU - Wang, Jhing Fa
AU - Yang, Jar Ferr
N1 - Funding Information:
Manuscript received April 19, 2009; revised September 17, 2009, December 30, 2009, April 18, 2010 and July 30, 2010; accepted October 24, 2010. Date of publication March 17, 2011; date of current version June 3, 2011. This work was supported in part by the National Science Council, Taiwan, under Grant NSC 99-2218-E-006-001. This paper was recommended by Associate Editor G. Wen.
PY - 2011/6
Y1 - 2011/6
N2 - H.264/AVC is a standard developed for various low-complexity video applications and high-definition television. To improve coding performance, H.264/AVC may optionally adopt the rate-distortion optimization (RDO) method to find the best encoding mode among various inter and intra modes. However, the exhaustive RDO search among different modes increases the H.264/AVC encoder complexity and limits its application. In this paper, we propose an inter mode prediction algorithm for P slices based on spatial and temporal consistency analysis to reduce the complexity of the RDO computation. We apply the stochastic method to analyze the spatial consistency and use rate information for temporal consistency. The experimental results show a 0.03 peak signal-to-noise ratio loss, a 0.87% bit rate increase, and a 58.39% encoding time reduction on average.
AB - H.264/AVC is a standard developed for various low-complexity video applications and high-definition television. To improve coding performance, H.264/AVC may optionally adopt the rate-distortion optimization (RDO) method to find the best encoding mode among various inter and intra modes. However, the exhaustive RDO search among different modes increases the H.264/AVC encoder complexity and limits its application. In this paper, we propose an inter mode prediction algorithm for P slices based on spatial and temporal consistency analysis to reduce the complexity of the RDO computation. We apply the stochastic method to analyze the spatial consistency and use rate information for temporal consistency. The experimental results show a 0.03 peak signal-to-noise ratio loss, a 0.87% bit rate increase, and a 58.39% encoding time reduction on average.
UR - http://www.scopus.com/inward/record.url?scp=79957985689&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79957985689&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2011.2129730
DO - 10.1109/TCSVT.2011.2129730
M3 - Article
AN - SCOPUS:79957985689
SN - 1051-8215
VL - 21
SP - 708
EP - 716
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 6
M1 - 5734814
ER -