用户名: 密码: 验证码:
基于DCT和小波变换三维视频编码的研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
数字视频压缩编码技术是实现多媒体通信的关键技术之一。图象序列不仅表现在帧内象素之间的相关性,而且帧间象素之间也同样具有高度的相关性。本文对运动图像从三维的角度将图象序列视为立体图象,通过三维离散余弦变换(3DDCT)或三维小波变换(3DWT)变换后进行变换域的低码率压缩展开了较深入的研究。本文的主要内容包括:
     1.对3DDCT算法进行深入研究,结合XYZ编码的特点,提出部分3DDCT算法及相对应的编解码流程,减少了运算量,提高了编码效率。
     2.针对XYZ中量化策略进行了研究和实验,结合人眼视觉特性通过进化算法得出一个改进的量化表,该量化表可以有效提高编码效率。
     3.针对XYZ编码中仍然使用2DDCT的RLE+Huffman的编码策略问题进行了研究,提出改进的RLE+Huffman编码策略,新的编码策略能较适应3DDCT的特点,实验表明,可以提高编码效率。
     4.针对运动剧烈的视频序列XYZ编码效率下降的问题,提出结合运动补偿的3D MCDCT编码算法,将视频序列沿运动轨迹进行XYZ编码,使编码效率得到显著提高,但编码的复杂度加大,不易于实时应用。
     5.数字视频序列可以看作是从模拟视频信号中抽样得到,帧间相关性过强是由于采样过密造成的;而帧问相关性太弱是由于采样频率太小造成的。本文从采样定理的角度提出变速XYZ视频编码,对帧间相关性太强的序列可以采用帧间跳帧的方法提高编码效率:而对帧间相关性太弱的视频序列可以采用帧间插帧的方法提高帧间相关性从而提高帧间编码效率。本文给出了定时长变速XYZ编码和变时长变速XYZ编码。
     6.针对零树结构在多分辩图象编码中的有效作用,本文提出基于3DDCT的嵌入式三维零树结构视频编码,将3DDCT系数分解成多分辩率塔式结构,采用零树结构及算术编码,得到很好的编码效果。
     7.对基于三维小波变换的视频编码进行了研究,结合三维零树结构及算术
Digital video coding is one of the key technologies of multimedia communication The relationship not only exsits intrafiame , but also exsits interframe of video sequences . This thesis study low bitrate video coding through three dimension space: the video sequence regarded as three dimensional image is transformed by three dimensional diecrete cosine transform (3DDCT) or three dimensional wavelet transform(3DWT), then coded in the transformed space. The main work includes:
    1. The 3DDCT algorithm is deeply studied in this thesis . Combined the characteristic of XYZ coding, a video coding program based on part 3DDCT algorithm is proposed, which can raise code and decode efficiency by reducing the caculation of transform.
    2. The quantize policy of XYZ video coding is researched . Combined the visual characteristic of human eye, a new quantization matrix on XYZ video coding is given through evolutionary algorithms. The simulation results show that this method can give high compression ratio and good reconstruct picture qanlity.
    3. The code policy RLE+Huffman using in the XYZ video coding is still the policy based on 2DDCT, which is not suited to XYZ coding based on 3DDCT. A new code policy is proposed in this thesis which is more suitable for XYZ coding. Experiment results show this new method has higher performance than previous ones.
    4. The XYZ compression efficiency decrease in case of video sequence with violent motion. To counter video sequences with violent motion, a new video coding scheme based on three dimensional cosine transformation with motion compensation is presented. Every eight oringal sequence pictures are compensated through motion estimation, and then use XYZ method to code. The simulations demonstrate that this method is very efficient.
    5. Digital video sequence can be regarded as sampling of analogue video signal, sampling too intensive can make the intraframe relationship higher, and sampling too sparse can make the intraframe relationship lower. Based on sampling theory, a variant rate XYZ video coding method is proposed in this thesis, which can improve the coding efficiency.
    6. Zerotree coding is very efficient in multiresolution image coding. With reorganize the
引文
1.[美]R.C.冈萨雷斯,P.温茨著(李叔梁等译),数字图像处理,科学出版社,1981
    2.[美]A.罗申菲尔德,A.C.卡克著(余英林等译),“数字图像处理”,人民邮电出版社,1982.
    3.荆仁杰,叶秀清等,计算机图像处理,浙江大学出版社,1992
    4.赵荣椿,赵忠明等,数字图像处理导论,西北工业大学出版社,1995,西安
    5.徐建华,图像处理与分析,科学出版社,1992
    6.刘政凯,翟建雄,数字图像恢复与重建,中国科技大学出版社,1989
    7.余英林著,数字图像处理与模式识别,华南理工大学出版社,1990
    8. Aggelos K.Katsaggelos, Recent-Trends in Image Restoration and Enhancement Techniques,Proceedings of IEEE Asia Pacific Conference on Circuits and Systems'96, pp.458-459,1996.
    9. Pratt, Digital Image Processing, NewYork: John Wiley&Sons, 1978
    10. ITU-T SGXV, Video codec for audiovisual services at Px64kbit/s, ITU-T Recommendation H.261, July 1990.
    11.郑伟国,数字视频压缩编码关键技术的研究,中山大学博士论文,1998.
    12. Karel Rijkse, ITU Standardization of very Low Bit Rate Video Coding algorithms, Signal Processing:Image Communication, pp.553~565,Jul. 1995.
    13.徐孟侠,数字电视的进展,通信学报,Vol.16,No.5,pp.60-68,Sep.1995.
    14. ISO/IEC, Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s, ISO/IEC CD 11172-2, MPEG-1, Nov. 1991
    15.张宗念,图像、视频压缩编码方法以及网络实时视频监控的研究,华南理工大学博士学位论文,2000
    16. ISO/IEC, Coding of Moving Pictures and Associated Audio:MPEG-2 System, ISO/IEC 13818-1, Nov. 1994
    17. ITU-T Standardization Sector of ITU, Video coding for very low bitrate??communication, ITU-T Recommendation H.263, Mar. 1996
    18.张旭东等,甚低码率视频压缩编码算法研究和标准化进展,电子科学学刊,Vol.19No.6pp836-842Nov.1997.
    19. ISO/IEC,MPEG-4 Project Description, N1177, Munich MPEG meeting, January 1996.
    20.刘占平,董士海,MPEG-4标准及相关进展,中国图象图形学报,Vol.4(A),No.6 pp.514-518 June 1999
    21. Koenen R. MPEG-4 multimedia for our time. IEEE Spectrum. 1999, 2:26-33
    22.徐孟侠,图像编码的进展,通信学报,Vol.14 No,2 pp.40-47 March 1993
    23. H.G. Musmann, EPirseh,and HJ.Grallert, Advances in picture coding, Proe. IEEE, Vol. PROC-73, pp.523-548, Apr. 1985
    24. k.-H.Tzou, H.G.Musmann, and K.Aizawa, Special Issue on Very Low Bit Rate Video Coding, IEEE Trans. On Circuits and Systems for Video Technology, Vol.4, No.3, pp.213-367, June 1994
    25. W.Li, Y.-Q.Zhang, and M.L.Liou, Special Issue on Advances in Image and Video Compression, Proc. Of the IEEE, Vol.83, No.2, pp. 135-340, Feb. 1995
    26.徐孟侠,“数字电视的进展”,通信学报,Vol.16,No.5,pp.60-68,Sep.1995.
    27. Karel Rijkse, "ITU Standardization of Very Low Bit Rate Video Coding algorithms",Signal Processing:Image Communication, pp.553~565,Jul. 1995.
    28. J.H.Snyder, et al., "Tools for Real-Tune Signal-Processing Research", IEEE Communication Magazine, pp.64-74, Nov. 1993.
    29. HaiBo Li, et al., "Image Sequence Coding at Very Low Bitratew:A Review", Trans on Image Processing, Vol.3 No.5.,pp.589-608, Sep. 1994.
    30.姚庆栋,毕厚杰,王兆华,徐孟侠,“图像编码基础”,浙江大学出版社,1993.
    31.吴乐南编著,徐孟侠审,“数据压缩的原理与应用”,电子工业出版社,1995.
    32. B.B.Mandebrot, "The Fractal Geometry of Nature",W.H. Freeman and Company, New York, 1977.
    33.Kenneth Falconer著,曾文曲,刘世耀等译,“分形几何——学基础及其应用”,东北工学院出版社,1991.
    34. M.F. Barnsley, "Fractal Everywhere", Academic Press New York, 1988.35. M.F. Barnsley and L.P. Hurd, "Fractal Image Compression", AK Peters, Ltd., 1992.
    36. A. Jacquin, Fractal Image Coding:A Review, Proc. IEEE, Vol. 81, No. 10, 1993, pp. 1451-1465.
    37. A. Jacquin, Image coding based on a fractal theory of iterated contractive image transformations, IEEE Trans. Image Process. 1(1) (January 1992) 18-30
    38.邓华秋,菱形分形图象编码及脉冲多普勒气象雷达基本问题的研究,华南理工大学博士学位论文,1997.
    39.王舟,基于远程相关性的图像处理,华南理工大学博士学位论文,1998.
    40.曾文曲,王向阳,王福龙等著,分形理论与分形的计算机模拟,东北大学出版社,1993.
    41. Y. Fisher Ed., Fractal Image Compression:Theory and Application, Springer-Verlag, New York, 1994.
    42. G. Davis, Self-quantization of wavelet subtrees, Proc. SPIE Wavelet Applications Ⅱ, Orlando, Vol. 2491, April 1995, pp. 141-152
    43. G. Davis, A wavelet-based analysis of fractal image compression, IEEE Trans. Image Process. Vol. 7, No. 2, February 1998, pp141-154
    44. Ying Zhang, Lai-Man Po, Variable tree size fractal compression for wavelet pyramid image coding, Signal Processing: Image Communication 14 (1999) 195-208
    45.冉启文著,“小波分析方法及其应用”,哈尔滨工业大学出版社,1995.
    46.Y.迈耶著,尤众译,小波与算子,第一卷,世界图书出版公司北京分公司出版,1992
    47. I. Daubechies, Orthonorman based of compactly supported wavelet, Comm. Pure. Appl. Math. 41(1988)909-998
    48. I. Daubechies, Ten Lectures on Wavelets,' Ed. Society for Industrial and Applied Mathematics, Philadelphia, Pennsylvania, 1992
    49. S.G. Mallat, A theory for multiresolution signal decomposition: The wavelet representation, IEEE Trans. Pattern Anal. Machine Intell.??PAMI-11(1989) 647-693
    
    50. A. S. Lewis, G. Knowles, Image compression using the 2D wavelet transform, IEEE Trans. Image Process 1(2)(1992) 244-250
    
    51. A. S. Lewis, G. Knowles, Video compression using 3D wavelet transforms, Electron. Lett. 26(6)(1990) 396-397
    
    52. G. Knowles, VLSI architecture for the discrete wavelet transform, Electron. Lett. 26(15)(1990) 1184-1185
    
    53. S. Mallat, S. Zhong, Compact image coding from edges with wavelets, IEEE Proc. ICASSP, 1991, pp. 2745-2748
    
    54. G. Beylkin, R. Coifman, V. Rokhlin, Fast wavelet transforms and numerical algorithms I, CPAM,XLIV(1991) 141-183
    
    55. D.Marr, E. Hildreth, Theory of edge detection, Proc. Roy. Soc. London 207(1980)187-217
    
    56. T.Chang, C.-C.Jay Kuo, Texture analysis and classification with tree-structured wavelet transform, IEEE Trans. Image Process. 2(4) (1993) 429-441
    
    57. Wen-Chuang Huang, Long-Wen Chang, Predictive subband image coding with wavelet transform, Signal Processing:Image Communication 13 (1998)171-181
    
    58. R. Rinaldo and G.Calvagno, Image coding by block prediction of multiresolution subimages, IEEE Trans. Image Processing, Vol.4, pp. 909-920, July 1995
    
    59. M.Antonini,M.Barlaud,P.Mathieu,I.Daubechies, Image coding using wavelet transform, IEEE Trans. Image Process. 1(2)(April 1992) 205-220
    
    60. J.Shapiro, Embedded image coding usong zerotrees of wavelet coefficients, IEEE Trans. Sognal Process. 41 (12)(December 1993) 3445-3462
    
    61. A. Said, W. A. Pearlman, A new, fast, and efficient image codec based on set partitioning in hierarchical trees, IEEE Trans. Circuits Syst. Video Technol., Vol. 6, No. 3,pp. 243-250, June 1996
    62.周建鹏,杨义先,一种基于小波变换的低比特率混合图像编码方法,电子学报,Vol.27,No.2,pp.126-128,Feb.1999
    63. Raymond Westwater, Borko Furht, Real-time Video Compression, Kluwer Academic Plulishers, Boston, 1997
    64.狄红卫,低码率图像压缩研究,华南理工大学博士学位论文,1999
    65. W.H. Chen, C. H. Smith, and S.C. Fralick, A fast computational Algorithm for the discrete cosine transform, IEEE Trans. Commun., Vol. COM-25, pp. 1004-1009, Sept. 1977
    66. B.G. Lee, A new algorithm to compute the discrete cosine transform, IEEE Trans. Acoust., Speech, and Signal Process., Vol. ASSP-32, No. 6, pp. 1243-1245, Dec. 1984
    67. J. Makhoul, A fast cosine transform in one and two dimensions, IEEE Trans. Acoust., Speech, Signal Processing, Vol. ASSP-28, pp. 27-34, Dec. 1980
    68.韦岗,邱伟著,现代信号处理理论与技术,华南理工大学出版社,1994
    69.黎洪松著,数字视频技术及其应用,清华大学出版社,1998
    70.胡广书著,数字信号处理—理论、算法与实现,清华大学出版社,1998
    71.李振辉,李仁和著,探索图象文件的奥妙,清华大学出版社,1996
    72.张春田,张劲松,运动补偿视频编码中DCT编码效率的研究,电子学报,Vol.24,No.1,pp.1-5,Jan.1996
    73. Z. Xiong, O. Gulerguz, and M. T. Orchard, A DCT-based embedded image coder, IEEE Signal Proc. Letters, 3(11), Nov. 1996
    74. David B.Fogel, An Introduction to Simulated Evolutionary Optimization, IEEE Trans.on Neural Networks,Vol. 5,No. 1, pp. 3-14, Jan., 1994.
    75. Thomas Back and Hans-Paul Schwefel, An Overview of Evolutionary Algorithms for Parameter Optimization, Evolutionary Computation, Vol. 1,No. 1,pp. 1-23,1993
    76. Thomas Back, and Hans-Paul Schwefel, Evolutionary Computation: An overview, Proceedings of IEEE conference on Evolutionary Computation, pp.20-29,1996.
    77. David B.Fogel, Evolutionary Computation——Toward a New Philosophy of Machine Intelligence, IEEE Press, 1992
    78.姚新,陈国良,等,进化算法研究进展,计算机学报,第18卷,第9期,??pp.694-706,1995年9月。
    79. Holland,J.H., Adaptation in Natural and Artificial Systems, Ann Arbor:. The University of Michigan Press, 1975.
    80. Schwefel, H.-P., Numeriscbe Optimuerung von Computer-Modellen mittels der Evolutionsstrategie, Vol. 26 of Interdisciplinary Systems research Basel:Birkhauser, 1977
    81. Fogel,L. J., et al, Artifical Intelligence through Simulated Evolution, Newyork: John wiley, 1966.
    82. Z. Michalewicz, Genetic Algorithms+Data Structures=Evolutionary Programs, 3rd ed., Springer, 1996.
    83. Thomas Back, Book Review: Proceedings of the Fifth International Conference on Genetic Algorithms, Evolutionary Computation, Vol.2,No.2,pp. 181-191.
    84.王哲,进化算法在图像处理中的应用,华南理工大学博士学位论文,1998.
    85. Chengjian Wei, Susu Yao and Zhenya He, A Modified Evolutionary Programming, Proceedings of IEEE international conference on Evolutionary Computation, pp.135-138,1996.
    86. Robert Hinterding, Gaussian Mutation and Self-adaption for Numeric Genetic Algorithms, Proceedings of IEEE international conference on Evolutionary Computation, pp.384-389,1995.
    87. Frank Kursawe, Towards Self-adapting Evolution Strategies, Proceedings of IEEE International Conference on Evolutionary Computation, pp.283-288, 1995.
    88. I.De Falco,R.Del Balio and Tarantiono, Testing Parallel Evolution Strategies on the Quadratic Assignment Problem, Proceedings ofIEEE SMC'93,pp.254-259.
    89. Thomas Back, Ulrich Hammel, Evolution Strategies Applied to Perturbed Objective Functions, Proceedings of the first IEEE Conference on Evolutionary Computation, 1994, pp. 40-45.
    90. Hmhlenbein, et al, Parallel Genetic Algorithms, Population Genetics and Combinatorial Optimization, in Proceedings of the 3th Conf on Gas, J D Schaffer, EdSanMateo, CA:Morgan Kaufmann, 1989, pp. 416-421.
    91.许洁斌,“运动估值技术的研究及其在视频图象压缩编码中的应用”,华南理??工大学博士学位论文,1998.
    92.余松煜,图象通信中的运动估值,通信学报,3.1993
    93.朱斌,张春田,运动补偿视频编码的一种非DCT编码方法,电子学报,Vol.27,No.2,pp.124-125.Feb.1999
    94. R.Rajagopalan, E.Feig, and M.T.Orchard, Motion Optimization of Ordered Blocks for Overlapped Block Motion Compensation, IEEE Trans. on Circuits and Systems for Video Techno., Vol.8, No.2, pp119-123, April 1998
    95. S.C.Pei, C.W.Ko, M.S. Su, Global motion estimation in model-base image coding by tracking three-dimensional contour feature points, IEEE Trans. on Circuits and Systems for Video Techno., Vol.8, No.2, pp 181-190, April 1998
    96. M.Ghanbari, The cross-search algorithm for motion estimation, IEEE Trans. Commun., Vol.38, No.7, pp.950-953, Jul. 1970
    97. A.Netravali,J.D.Robbins, Motion compensated television coding part Ⅰ, Bell Syst. "Teeh.J., Vol.58, No.3, pp.629-668, Apr. 1979
    98. S.Kappagantula,et.al, Motion Compensated Interframe Image Prediction, IEEE Trans. Comm., Vol.com33, No.9, pp. 128-138, Sep. 1985
    99. K.Y.Yoo, J.K.Kim, A combined estimation and compensation method.of global and local motions for video compression, Proceeding of PCS'97, pp.337-342, 1997
    100. H.Jozawa, K.Kamikura, et al., Two-stage motion compensation using adaptive global MC and local affine MC, IEEE Trans. on Circuits and System for Video Teehno., Vol.7, No. 1, pp.75-85, Feb. 1997
    101. M.Vetterli, Fast 2-D discrete cosine transform, Proc.IEEE ICASSP, pp.1538-1541 1985
    102. E.feig, S.Winograd, Fast algorithms for the discrete cosine transform, IEEE Trans. Signal Processing, Vol.40, pp.2174-2193, Sept. 1992
    103. H.R.Wu, D.Tan, Implementation of Cho and Lee's 2D DCT algorithm using LLM 1D DCT algorithm, Proc. 1997 Int. Workshop on Intelligent Signal Processing and Communication Systems, Nov. 1997, Sect.24, pp.2.1-2.5
    104. N.I.Cho and S.U.Lee, A fast 4x4 DCT algorithm for the recursive 2-D DCT, IEEE Trans. Signal Processing, Vol.40, pp.2166-2173, Sept. 1992105. H.R.Wu and Z.Man, Comments on "Fast algorithm and implementation of 2D discrete cosine transform, IEEE Trans. on Circuits and Systems for Video Teehno., Vol.8, No2, pp128-129, April 1998
    106. Y.L.Chan and W.C.Siu, Variable temporal-length 3-D discrete cosine transform coding, IEEE Trans. on Image Processing, Vol.6, No.5, pp.758-763, May 1997
    107. J.A.Roese and W.K.Pratt, Interframe cosine transform image coding, IEEE Trans. Commun., Vol.25, pp. 1329-1338, Nov. 1977
    108. P.G.Howard and J.S.Vitter, Arithmetic coding for data compression, Proc.IEEE, Vol.82, pp.857-865, June 1994
    109. Y.L.Chan and W.C.Siu, A new adaptive interframe transform coding using directional classification, Proc. IEEE Int. Conf. Image Processing, No.2, pp.977-981, Nov.1995
    110. Y.L.Chan and W.C.Siu, Fast interframe transform coding based on characteristic of transform coefficients and frame difference, Proc. IEEE Int. Syrup., Circuits and Systems, pp.449-452, Apr.1995
    111. B.Olstad, Noise reduction in ultrasound images using multiple linear regression in a temporal context, Proc. SPIE, Vol.1451, pp.269-281, 1992
    112. K.W.Cheung, C.H.Cheung and L.M.Po, Embedded zerotree image coding based on integer cosine Iransform, http://www.image.cityu.edu.hk
    113. I.H.Witten, R.M.Neal, J.G.Cleary, Arithmetic coding for data compression, Communications of the ACM, Vol.30, No.6, pp.520-540, June 1987
    114. Z.Xiong, O.Gulerguz, and M.T.Orchard, A DCT-based embedded image coder, IEEE Signal Proc. Letters, 3(11), Nov. 1996
    115. F.J.Hampson, J.C.Pesquet, M-band nonlinear subband decomposition with perfect reconstruction, IEEE Trans. on Image Processing, Vol.7, No. 11, Nov. 1998
    116. O.J.Kwon, R.Chellappa, Region adaptive subband image coding, IEEE Trans. on Image Processing, Vol.7, No.5, May.1998
    117. T.R.Fischer, "A pyramid vector quantizer, IEEE Trans. Inform. Theory, Vol. IT-32, pp.568-583, July 1986
    118. D.Hargreaves and J.Vaisey, Bayesian motion estimation and interpolation in??interlaced video sequences, IEEE Trans. on Image Processing, Vol.6, No.5, pp.764- 769 May 1997
    119. T.Sikora, The MPEG-4 video standard verification model, IEEE Trans. CSVT, Vol.7, No. 1, Feb. 1997
    120. ISO/IEC, ISO/IEC international standard 14496-2, MPEG-4 Visual, 1999.9

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700