用户名: 密码: 验证码:
H.264/AVC中帧内编码和帧间编码技术研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
在过去的十几年中,国际标准化组织制定了一系列的国际视频编码标准并广泛应用到各种领域。其中,ITU-T制定了满足实时视频通信应用的标准,例如H.261、H.263、H.263+、H.263++;ISO制定了满足视频存储、广播视频和视频流需要的MPEG标准,包括MPEG-1、MPEG-2及MPEG-4等。而H.264/AVC是ITU-T和ISO联合推出的新标准,采用了近几年视频编码方面的先进技术,以较高编码效率和网络友好性成为新一代国际视频编码标准。
     H.264/AVC设计的最初目标是在相同解码质量下,能够提供比MPEG-4和H.263高一倍的压缩性能。无论从客观还是主观度量,H.264都可以提供比现有的视频压缩编码标准高的多的解码质量。
     为了实现高效压缩,H.264/AVC中采用了大量的新技术。这些技术包括:帧内预测;可变块大小的运动补偿;1/4像素精度的运动矢量;整数4×4变换;切换条带;上下文自适应二进制算术编码。采用了这些先进的编码技术后,压缩性能和解码质量都获得了很大的提高。一些研究的实验数据表明,H.264/AVC在信噪比,压缩效率和视觉效果方面,都远远好于现有的其它视频编码标准。
     H.264/AVC中采用的这些先进编码技术既增加了基本模块的复杂度,也成倍增加算法的复杂度,这就限制了H.264的应用,尤其是在实时视频通信和一些资源或功耗受限的嵌入式应用中。基于此,本文进行了一下研究:
     首先,为了解决H.264/AVC帧内预测模式多,计算复杂和预测不精确的问题,本文提出了一种帧内预测算法。该算法利用边缘检测算子和空间相邻块的相关性快速选出所需的候选预测模式,对于每种候选预测模式又改进了预测算法。试验结果表明:新的预测算法可以提高帧内预测编码所需要的时间。
     其次,H.264/AVC采用了多参考帧运动估计来提高编码效率,增加的计算量与参考帧数目成正比。为了避免搜索多余的参考帧,本文提出了一种低复杂度参考帧确定算法来快速确定当前宏块所需的参考帧。试验结果表明,本算法在保持率失真性能的前提下有效地减少了多参考帧预测的计算复杂度并节省了时间。
In the past about ten years, a series of international video coding standards are developed and widely used in a variety of domains. Some standards for real-time telecommunication applications have been established by ITU-T, including H.261, H.263, H.263+, H.263++. Others for video storage, broadcast and stream application by ISO/IEC, such as MPEG-1/2/4.H.264/AVC is the newest standard approved by the ITU-T and ISO/IEC organizations, which represents a number of advances in video coding technology and becomes the newest international video coding standard because of both high coding efficiency and network friendliness.
     It can provide both objective and subjective image quality superior to existing standards. The main goals of this standardization are to develop a simple and straightforward video coding design with enhanced compression performance. Compared to MPEG-4 advanced simple profile, up to 50% of bit-rate reduction can be achieved.
     Some new techniques, such as spatial prediction in intra coding, adaptive block size motion compensation with small block sizes, Quarter-pel accurate motion compensation, 4x4 integer transformation, synchronization/switching pictures, multiples reference pictures and context adaptive binary arithmetic coding (CABAC), are used in this standard.With the use of these techniques, both compression efficiency and encoding quality are greatly improved. The testing results show that H.264 can greatly outperform existing video coding standards in PSNR, visual quality and compression efficiency.
     The new features of H.264/AVC increase not only the complexity of coding basic modules, but also the one of algorithms by times. That makes it very difficult for practical applications such as in real time video communication application or in some embedded system. So major works are as follows:
     First, in order to solve the problem that there are many prediction modes, the complexity of computation and which can't predict pixels located at the boundary of moving objects accurately in H.264/AVC intra prediction, an intra prediction algorithm is presented. The test result indicates that the adjacent pixels-based intra prediction algorithm can improve intra coding efficiency remarkably.
     Second, H.264/AVC uses multi-reference frames to enhance the coding efficiency. the quantity of increased computation is linearly proportional to the number of reference frames. On the study of the multi-macro-block motion estimation of H.264 standard, a low-complexity and fast multi-frame selection method is proposed. The experimental results show that the speed of the proposed method is over two times faster than that of the original scheme adopted in JVT reference software JM12.2 and JM10.2 while keeping the similar video quality and bit-rate.
引文
[1]ITU-T Recommendation H.263,Video coding for low bit rate communication,May,1996.
    [2]ITU-T Draft for H.263++.Annexes U,V,and W to Recommendation H.263.2000
    [3]ISO/IEC 14496-2,Amendment 1,Information technology-coding of audio-visual objects-Part 2:Visual,2001.
    [4]Jorn Ostermann,Jan Bormans,Peter List,Detlev Marpe,Matthias Narroschke,Fernando Pereira,Thomas Stockhammer,and Thomas Wedi,Video coding with H.264/AVC:Tools,Performance,and Complexity,IEEE CIRCUITS AND SYSTEMS MAGAZINE,2004,7-28.
    [5]Yao Wang,Jorn Ostermann,Ya-Qin Zhang,Video processing and communications,北京,Tsinghua University Press,2003
    [6]M.Tekalp,"Digital Video Processing" Englewood Cliffs:Printice Hall,1995.
    [7]Iain E.G.Richardson,Video Codec Design,John Wiley & Sons,2002.
    [8]余兆明,数字电视和高清晰度电视,北京:人民邮电出版社,1996
    [9]"Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification(ITU-T Rec.H.264 | ISO/IEC 14496-10 ARC)",Joint Video Team(JVT)of ISO/IEC MPEG and ITU-T VCEG,JVT-G050,March 2003
    [10]M.Ravasi,M.Mattavelli and C.Clerc,"A Computational Complexity Comparison of MPEG4 and JVT Codecs",Jiont Video Team(JVT)of ISO/IEC MPEG&ITU-T VCEG,JVT-D 153rl-L,July 2002
    [11]Detlev Marpe,Heiko Schwarz,Thomas Wiegand.Context-Based Adaptive Binary Arithmetic Coding in the H.264/AVC Video Compression Standard.IEEE Trans.Circuits Syst.Video Technol.2003,vol.13(7):598-603
    [12]M.C.Hong,H.S.Hahn.A Loop/Post Filter to Suppress Blocking and Ringing Artifacts for H.26L Video Codec.In Proc.2002,IEEE Int.Conf.Image Processing.2002 vol.1:940-947
    [13]VideoLocus Inc.AVC Real-Time SD Encoder Demo.ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6,JVT 4th Meeting,JVT-D023.Klagenfurt,Austria,2002
    [14]Gary Sullivan,JVT IPR Status Report.ISO/IEC/JTC1/SC29/WG11 and ITU-T SG16 Q.6,JVT 3rd Meeting,JVT-C110.Fairfax,Virginia,USA,2002
    [15]Girod B,Farber N.Feedback-Based Error Control for Mobile Video Transmission.Proc.Of IEEE.1999,87(10):1707-1723
    [16]J.Barrileaux,R.Hinkle et at."Efficient Vector Quantization for Color Image Encoding",Pro.ICASSP 1987:740-743
    [17]T.Wiegand,G.Sullivan,G.Bjontegaard,A.Luthra,"Overview of the H.264/AVC Video Coding Standard ",IEEE Trans.Circuits Syst.Video Technol,vol.13,pp.560-576,July 2003.
    [18]Iain E.G.Richardson,H.264 and MPEG-4 Video Compression,Wiley,West Sussex,England,2003.
    [19]Malvar Henrique S.Hallapuro Antti,Karczewicz.Marta,Kerofsky Louis,"Low-complexity Transform and Quantization in H.264/AVC ",IEEE Trans.Circuits Syst.Video Teclmol,vol.13.pp.598-603,July 2003
    [20]K.R.Rao and P.Yip,Discrete Cosine Transform,Academic Press,1990.
    [21]Mathias Wien,"Variable Block-Size Transforms for H.264/AVC",IEEE Trans.Circuits Syst.Video Technol,vol.13.pp.604-613,July 2003.
    [22]T.Wedi,"Multi-Frame Motion-Compensated Video Compression for the Digital Set-Top Box",ICIP 2002,Sept.2002
    [23]G.Bjontegaard and K.Lillevold,Context-adaptive VLC coding of coefficients,JVT document JVT-C028,Fairfax,May 2002.
    [24]Detlev Marpe,Heiko Schwarz,Thomas Wiegand,Context Based Adaptive Binary Arithmetic Coding in the H.264/AVC Video Compression Standard[J],IEEE Trans Circuits Syst,Video Technol,2003,13(7):560-576
    [25]D.Huffman,A method for the construction of minimum redundancy codes,Proc.of the IRE,40,pp.1098-1101,1952.
    [26]J.Rissanen and G.G.Landgon,Jr.,"Arithmetic coding," IBM J.Res.Dev.,vol.23,no.2,pp.149-162,Mar.1979.
    [27]S.W.Golomb,Run-length encoding,IEEE Trans.on Inf.Theory,IT-12,1966:399-401.
    [28]S.Wenger,"H.264/AVC over IP",IEEE Trans.Circuits Syst.Video Technol, vol.13,pp.645-656,July 2003
    [29]T.Stockhammer,M.M.Hannuksela,T.Wigand,"H.264/AVC in Wireless Environments",IEEE Trans.Circuits Syst.Video Technol,vol.13,pp.657-673,July 2003
    [30]Cliff Reader.History of MPEG Video Compression.2000:87-92
    [31]Draft ITU-T Recommendation H.263 version 2.ITU-T,1997-09
    [32]向友君,郭宝龙。基于起点预测的快速运动估计算法。西安电子科技大学学报。2003,30(3):386-390。
    [33]Gary Sullivan JVT IPR Status Report.ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q6,JVT 3rd Meeting,JVT-C110,Fairfax,Virginia,USA,2002
    [34]Joch,Anthony,Kossentini.Faouzi,Hallapuro.Antti,"H.264/AVC Baseline Profile Decoder Complexity Analysis",IEEE Trans.Circuits Syst.Video Technol,vol.13,pp.704-716,July 2003.
    [35]Fukunaga S,Nakai T,Inoue H.Error Resilient Video Coding by Dynamic Replacing of Reference Pictures.Proc.of IEEE Global Telecommunications Conference New York USA,1996,3(11):1503-1508
    [36]Sullivan G..J,Wiegand T.Rate-Distortion Optimization for Video Compression.IEEE Signal Processing Mag.1998,15(11):74-90
    [37]Choi J,Park D.A Stable Feedback Control of the Buffer State Using the Controlled Lagrange Multiplier Method.IEEE Trans.Image Processing.1994,3(9):546-588
    [38]Thomas wiegand.Joint Final Committee Draft(JFCD)of Joint Video Specification(ITU-T Rec.H.264|ISO/IEC 14496-10 AVC).ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6,JVT 4th Meeting,JVT-D157.Klagenfurt,Austria,2002
    [39]尹宝才,孙磊,孔德慧,季鹏飞,基于相邻像素的帧内预测算法。北京工业大学学报(自然科学版)。
    [40]Greg Conklin.New Intra Prediction Modes.ITU-T telecommunications Standardization Sector STUDY GROUP 16 Question 6,VPEG 14th Meeting,VCEG-N54 Santa Barbara,24-27 September,2001
    [41]Siu-Leong Yu,Christos Chrysafis.New Intra Prediction Using Intra-Macroblock Motion Compensation.ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6,JVT 3rd Meeting,JVT-C151.Fairfax,Virginia,USA,6-10 March,2002
    [42]Zhou Minhua.Scalable Intra Prediction.ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6,JVT 3rd Meeting,JVT-D026.Klagenfurt,Austria,22-26 July,2002
    [43]Zhang Nan,Yin Baocai,Kong Dehui,Yue Wenying,Spatila Prediction Based Intra-Coding,IEEE International Conference on Multimedia and Expo(ICME 2004),Taiwan,2004,TP1-4:97-100
    [44]Peng Zhang,Debin Zhao,Siwei Ma,Yan Lu,Wen Gao.Multiple Modes Intra-Prediction in Intra Coding.IEEE International Conference on Multimedia and Expo(ICME 2004),Taipei,Taiwan,2004,TP1-4:419-422
    [45]Greg Conklin,Low Complexity Diagonal Mode for Intra Prediction.ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6,JVT 4th Meeting,JVT-D059.Klagenfurt,Austria,22-26 July,2002
    [46]Zhou Minhua.More Results on Intra Prediction with Simplified Prediction Modes.ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6,JVT 5th Meeting,JVT-E040.Geneva,Switzerland,9-17 October,2002
    [47]Woo-Shik Kim,Dae-Sung Cho,Kyung-Wook Kim.Intra Prediction Enhancements.ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6,JVT 5th Meeting,JVT-E051.Geneva,Switzerland,9-17 October,2002
    [48]Greg Conklin.Reducing Pixel Dependencies for Intra-Prediction.ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6,JVT 5th Meeting,JVT-E111.Geneva,Switzerland,9-17 October,2002
    [49]Joint Video Team(JVT)of ISO/IEC MPEG and ITU-T VCEG,"Draft ITU-T Recommendation and Final Draft International Standard Joint Video Specification(ITU-T Rec.H.264(E)ISO/IEC 14496-10 AVC)."March 2005.
    [50]B.Girod,M.Fliter,"Multi-Frame Motion-Compensated Video Compression for the Digital Set-Top Box",ICIP 2002,Sept.2002
    [51]T.Wiegand,X.Zhang,and B.Girod,"Long-term Memory Motion-compensated Prediction",IEEE Trans.Circuits Syst.Video Technol,vol.9,pp.70-84,Fed.1999.
    [53]Yu-Wen Huang,Bing-Yu Hsieh,Tu-Chih Wang,etc."Analysis and reduction of reference frames for motion estimation MPEG-4 AVC/JVT/H.264 ".IEEE International Conference on Acoustics Speech and Signal Processing,2003,2(3):145-148.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700