用户名: 密码: 验证码:
图像及视频超分辨率重建技术研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
高分辨率图像能提供目标场景更多的细节信息,对于图像的分析和处理有重要作用。随着视频和图像处理技术应用的普及和快速增长,人们对视频和图像序列的高分辨率要求越来越高,这对感光器制造技术提出了新的挑战。然而,当前通过减小单位像素尺寸或增大感光器芯片尺寸的硬件方案都存在技术上的瓶颈,同时价格昂贵的高精密感光器不适合于普及应用。因此,在现有条件限制下,如何提高图像和视频序列的空间分辨率,就成为人们研究的重点。超分辨率重建技术就在这种背景下应运而生。该项技术通过信号处理的软件方法实现图像空间分辨率的提升,它利用多幅关于同一场景的具有亚像素级位移的低分辨率图像来重建高分辨率图像。超分辨率重建可突破图像采集设备的分辨率限制,充分利用多帧低分辨率图像之间的互补信息,实现亚像素级的图像信息融合。利用此项技术能够在不升级现有采集设备的情况下,提高采集图像的分辨率;也能够在不增加传输信号带宽的情况下,改善图像或视频的画面质量。因此,超分辨率重建技术在遥感、军事、医学成像等领域有很广泛的应用前景,当前,超分辨率重建技术己成为图像处理领域最热门的研究内容。随着研究的加深,超分辨率研究范围已经从传统的原始图像序列拓展到压缩视频,从空间分辨率拓展到时间分辨率,从单视点视频拓展到多视点视频。
     本文对超分辨率重建技术中的若干关键问题进行了研究,主要集中在图像配准、立体图像视差估计、立体视频的空域/时域超分辨率重建等方面。具体工作和研究成果如下:
     1、超分辨率重建通常包含图像配准、插值、复原及后处理等操作,图像配准是其中最为基本的一步,它的性能直接影响到图像超分辨率重建的效果。本文提出一种结合小波多分辨率分解、NEDI插值和三步法块匹配的亚像素配准方法,在保证配准精度的同时,可大大减少搜索量,提高配准速度。
     2、基于传统香农熵的互信息是一个定量测度,它以图像中各种灰度值出现的概率为出发点,只考虑了两幅图像中对应像素的关系,忽略了水平面上像素之间的空间相关性,使得配准的峰值不够尖锐,配准的最佳位置难于确定。本文提出一种结合图像梯度、方差和归一化互信息的相似性测度,有效融合图像定性的空间位置关系和定量的灰度统计信息,缓解了基于单一互信息测度配准容易出现的局部极值问题,同时对噪声图像具有一定的鲁棒性。
     3、提出一种结合自适应加权和互信息准则的局部立体匹配方法,将前者在代价聚合中的优势,以及后者在代价计算中的优势联合起来,有效提高了立体匹配的抗辐射干扰性能力。实验结果表明,本文所提出的立体匹配算法获得的视差精度与当前优秀的局部算法精度相当,尤其能够很好地处理现有算法难以解决的各类光照差异问题,具有较鲁棒的抗干扰的能力,对于噪声等辐射畸变也有较高的准确性和适应性。
     4、结合分布式视频编码架构,提出了一种空时超分辨率重建方法,通过挖掘立体视频在时间/视点间的冗余信息,实现了对低帧率序列的帧率提升和低空间分辨率序列的分辨率增强,实验结果表明,所提出的超分辨率重建方法在主观视觉和客观性能方面都取得了较好的效果。
A high quality image always contains further detailed information of targets, and it is of great value for analysis and post-process. Along with the rapid improvements of video and image processing technologies in recent years, the demand for high resolution video and image sequences grows fast, which gradually raises new challenges for manufacturing technology on image sensors. At present, however, it is impossible or hard to break the bottleneck by hardware schemes, such as reducing the pixel size or increasing the chip size. In the meanwhile, prices of expensive high precision sensor are not applicable to wider applications. So, how to enhance the spatial resolution of video and image sequences under these limitations becomes an active research topic. The technique of super-resolution (SR) reconstruction is developed under this circumstance. Super-resolution reconstruction refers to a software technique that enhances the resolution of images or videos by using digital signal processing technology. It reconstructs high resolution and high quality image(s) from a group of degraded low resolution images with sub-pixel shifts from the same scene. It breaks though the resolution limit of image acquisition equipment and can achieve data fusion on sub-pixel level by wealthy complementary information. Utilizing SR techniques can improve the resolution of images without updating the existing low-resolution sampling devices; and also can enjoy high-quality videos without increasing the signal transmission bandwidth. SR reconstruction processing has proved to be useful in many practical applications, such as remote sensing, military detection, medical image, etc. Currently, image super-resolution reconstruction has become one of the hottest areas of image processing. With the deepening of the research, the range of this technique has expanded from traditional original image sequence to compressed video, from spatial resolution to temporal resolution, from mono-view video to multi-view video.
     The dissertation investigates several key issues of super-resolution of video and image sequences including image registration, stereo matching and spatio-temporal super-resolution for stereo video. The main contributions and innovation points of the dissertation are as follows:
     1. Super-resolution reconstruction usually consists of three steps:registration, reconstruction and restoration. As the necessary step of super-resolution reconstruction, the performance of the registration algorithm greatly affects the quality of the reconstructed images. This paper proposed a sub-pixel registration method using a combination of wavelet-based multi-resolution decomposition, NEDI interpolation and three-step search block-matching algorithm. The proposed approach can greatly reduce the search volume and accelerate the speed of the registration, while maintaining registration accuracy.
     2. Mutual information is a quantitative criteria based on the traditional Shannon entropy. It takes into account only the relationships between corresponding individual pixels and not those of each pixel's neighborhood, such that the alignment of the peak is not sharp, the best alignment position is difficult to locate. We propose a new similarity metric, which combines mutual information and a weighting function based on image gradient and image variance. This method effectively joints qualitative spatial positional relationship and quantitative statistical information of grayscale. The proposed similarity measurement alleviates the local minima problem and is more robust to noise than only with mutual information.
     3. We propose a new local stereo matching approach of combined adaptive weight and mutual information, which incorporating the advantage of the former in cost aggregation, as well as the advantage of the latter in cost computation. The experimental results show that the accuracy obtained by our proposed stereo matching algorithm is almost as excellent as state of the art local algorithm. Especially, this method can effectively improve the performance with kinds of radiometric differences, which is difficult to solve by other existing algorithms.
     4. We propose a spatio-temporal super-resolution reconstruction method for a distributed multi-view video coding architecture. The super-resolution scheme achieves temporal resolution enhancement and spatial resolution enhancement for mixed resolution video by exploiting inter-view/temporal correlation. Simulation results indicate that the super-resolution method has achieved good results in subjective visual and objective performance.
引文
[1]张广军.机器视觉.北京:科学出版社,2005.
    [2]马颂德,张正友.计算机视觉一计算理论与算法基础.北京:科学出版社,2003.
    [3]S.C. Park, M. K. Park, and M. G. Kang, "Super-resolution image reconstruction:a technical overview," IEEE Signal Processing Magazine, vol.20, no.3, pp.21-36, 2003.
    [4]S. Farsiu, "A fast and robust framework for image fusion and enhancement," Ph.D. dissertation, Electrical Engineeting, UC Santa Cruz, December 2005.
    [5]S. Chaudhuri, Super-Resolution Imaging, Kluwer Academic Publishers, Boston MA,2001.
    [6]A.K. Katsaggelos, R. Molina and J. Mateos, Super Resolution of Images and Video. Morgan & Claypool, San Rafael,2007.
    [7]V. Bannore, Iterative-Interpolation Super-Resolution Image Reconstruction:A Computationally Efficient Technique. Springer, Berlin,2009.
    [8]P. Milanfar, Super-Resolution Imaging. CRC Press, Boca Raton,2010.
    [9]M.G. Kang and S. Chaudhuri, "Super-resolution image reconstruction," IEEE Signal Processing Magazine, vol.20, pp.19-20,2003.
    [10]N. K. Bose, R. H. Chan, and M. K. Ng, Eds., "Special issue on high-resolution image reconstruction," International journal of imaging systems and technology, vol.14,2004.
    [11]M. Ng, T. Chan, M. G. Kang, and P. Milanfar, "Special Issue on Superresolution Imaging:Analysis, Algorithms, and Applications," EURASIP Journal on Applied Signal Processing, 2006.
    [12]A. K. Katsaggelos and R. Molina, "Special issue on super resolution," Computer Journal, vol.52, pp.1-167,2008.
    [13]B.R. Hunt, "Super resolution of images: algorithms, principles, performance," International Journal of Imaging Systems and Technology, vol.6, no.4, pp. 297-304,1995.
    [14]S. Borman and R.L. Stevenson, "Spatial resolution enhancement of low-resolution image sequences:a comprehensive review with directions for future research," University of Notre Dame, Technical Report,1998.
    [15]S. Borman and R.L. Stevenson, "Super-resolution from image sequences-a review," In Proceedings of the IEEE Midwest Symposium on Circuits and Systems, pp.374-378,1998.
    [16]M.K. Ng and N.K. Bose, "Mathematical analysis of super-resolution methodology," IEEE Signal Processing Magazine, vol.20, pp.62-74,2003.
    [17]D. Capel and A. Zisserman, "Computer vision applied to super resolution," IEEE Signal Processing Magazine, vol.20, pp.75-86,2003.
    [18]S. Farsiu, D. Robinson, M. Elad and P. Milanfar, "Advances and challenges in super-resolution," International Journal of Imaging Systems and Technology, vol. 14, pp.47-57,2004.
    [19]J. D. Ouwerkerk, "Image super-resolution survey," Image and Vision Computing, vol.24, pp.1039-1052,2006.
    [20]J. Tian and K. K. Ma, "A survey on super-resolution imaging," Springer, Signal, Image and Video Processing, vol.5, pp.329-342,2011.
    [21]S. Farsiu, D. Robinson, M. Elad and P. Milanfar, "Fast and robust multi-frame super-resolution," IEEE Transactions on Image Processing, vol.13, no.10, pp. 1327-1344,2004.
    [22]S. Baker and T. Kanade, "Limits on super-resolution and how to break them," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, no.9, pp.1167-1183,2002.
    [23]D. Capel, Image Mosaicing and Super-resolution. Springer,2004.
    [24]W. T. Freeman, T. R. Jones, and E. C. Pasztor, "Example-based super resolution," IEEE Computer Graphics and Applications, vol.22, no.2, pp.56-65,2002.
    [25]R. Hardie, "A fast image super-resolution algorithm using an adaptive Wiener filter," IEEE Transactions on Image Processing, vol.16, no.12. pp.2953-2964, 2007.
    [26]J. Yang, J. Wright, T. S. Huang and Y. Ma, "Image super-resolution via sparse representation," IEEE Transactions on Image Processing, vol.19, no.11, pp. 2861-2873,2010.
    [27]S. Borman, "Topics in multiframe superresolution restoration," Ph.D. dissertation. University of Notre Dame, Notre Dame, IN, May 2004
    [28]M. R. Banham and A. K. Katsaggelos, "Digital Image Restoration," IEEE Signal Processing Magazine, vol.14, no.2, pp.24-41,1997.
    [29]T. S. Huang and R. Y. Tsai, "Multi-frame image restoration and registration," Advances in Computer Vision and Image Processing, vol.1, pp.317-339,1984.
    [30]S. P. Kim, N. K. Bose and H. M. Valenzuela, "Recursive reconstruction of high resolution image from noisy under sampled multi frames," IEEE Transactions on Acoustics, Speech, Signal Processing, vol.38, pp.1013-1027,1990.
    [31]S. P. Kim, W. Y. Su, "Recursive high-resolution reconstruction of blurred multiframe images," IEEE Transactions on Image Processing, vol.2, pp.534-539, 1993.
    [32]N. K. Bose, H. C. Kim and H. M. Valenzuela, "Recursive implementation of total least squares algorithm for image reconstruction from noisy, undersampled multiframes," IEEE International Conference on Acousics, Speech and Signal Processing, vol.5, pp.269-272,1993.
    [33]S. H. Rhee and M. G. Kang, "Discrete cosine transform based regularized high-solution image reconstruction algorithm," Optical Engineering, vol.38, no.8, pp.1348-1356,1999.
    [34]D. Keren, S. Peleg and R. Brada, "Image sequence enhancement using subpixel displacements," Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol.5, no.1, pp.742-746,1988.
    [35]K. Aizawa, T. Komatsu, and T. Saito, "Acquisition of very high resolution images using stereo cameras," In SPIE Visual Communications and Image Processing, vol.1, pp.318-328,1991.
    [36]T. Komatsu, K. Aizawa, T. Igarashi, and T. Saito, "Signal-processing based method for acquiring very high resolution image with multiple cameras and its theoretical analysis," Proc. Inst. Elec. Eng., vol.140, pp.19-25,1993.
    [37]N. Nguyen and P. Milanfar, "An efficient wavelet-based algorithm for image super resolution," IEEE International Conference on Image Processing, 2000, vol. 2, pp.351-354.
    [38]M. Irani and S. Peleg, "Improving resolution by image registration," CVGIP: Graphical Models and Image Processing, 1991, vol.53, pp.231-239.
    [39]H. Stark and P. Oskoui, "High resolution image recovery from image-plane arrays, using convex projections," Journal of the Optical Society of America A, vol.6, pp. 1715-1726,1989.
    [40]A. M. Tekalp, M. K. Ozkan, and M. I. Sezan, "High-resolution image reconstruction from lower-resolution image sequences and space-varying image restoration," IEEE International Conference on Acousics, Speech and Signal Processing, pp.169-172,1992.
    [41]R. R. Schultz and R. L. Stevenson, "A bayesian approach to image expansion for improved definition," IEEE Transactions on Image Processing, vol.3, no.2, pp. 233-242,1994.
    [42]R. Schultz and R. Stevenson, "Extraction of High-Resolution Frames from Video Sequences," IEEE Transactions on Image Processing, vol.5, no.6, pp.996-1011, 1996.
    [43]M. Elad and A. Feuer, "Restoration of a single super resolution image from several blurred, noisy and undersampled measured images," IEEE Transactions on Image Processing, vol.6, no.12, pp.1646-1658,1997.
    [44]M. Elad and A. Feuer, "Super-Resolution Restoration of an Image Sequence: Adaptive Filtering Approach," IEEE Transactions on Image Processing, vol.8, no. 3, pp.387-395,1999.
    [45]C. A. Segall, R. Molina, A. K. Katsaggelos, and J. Mateos, "Bayesian resolution enhancement of compressed video," IEEE Transactions on Image Processing, vol. 13, no.7, pp.898-911,2004.
    [46]C. A. Segall, R. Molina, and A.K. Katsaggelos, "High-resolution images from low-resolution compressed video," IEEE Signal Processing Magazine, vol.20, pp. 37-48,2003.
    [47]R. Molina, A. K. Katsaggelos, L.D. Alvarez, and J. Mateos, "Towards a new video compression scheme using super-resolution," In Proceedings of SPIE-The International Society for Optical Engineering,2006, pp 1-13.
    [48]D. Barreto, L. Alvarez, R. Molina, et al., "Region-based super-resolution for compression," Multidimensional Systems and Signal Processing, vol.18 (2-3), pp 59-81,2007.
    [49]J. Wang, S. Zhu and Y. Gong, "Resolution enhancement based on learning the sparse association of image patches," Pattern Recognition Letters, vol.31, pp. 1-10,2010.
    [50]M. V. Joshi, S. Chaudhuri and R. Panuganti, "Super-Resolution Imaging:Use of Zoom as a Cue," Image and Vision Computing, vol.22, no 14, pp.1185-1196, 2004.
    [51]L. G. Brown, "A survey of image registration techniques," ACM Computing Surveys, vol.24, no.4, pp.325-376,1992.
    [52]B. Zitova, and J. Flusser, "Image Registration Methods:A Survey," Image and Vision Computing, vol.21, pp.977-1000,2003.
    [53]F. Maes, A. Collignon, D. Vandermeulen, G. Marchal, and P. Suetens, "Multi-modality image registration by maximization of mutual information," IEEE Transactions on Medical Imaging, vol.16, no.2, pp.187-198,1997.
    [54]J. P. W. Pluim, J. B. A. Maintz, and M. A. Viergever, "Mutual-information-based registration of medical images:a survey," IEEE Transactions on Medical Imaging, vol.22, no.8, pp.986-1004,2003.
    [55]J. P. W. Pluim, J. B. A. Maintz, and M. A. Viergever, "Interpolation artefacts in mutual information-based image registration," Computer Vision and Image Understanding, vol.77, no.2, pp.211-232,2000.
    [56]叶波,陆雪松,张素,陈亚珠.基于互信息图像配准中的局部极值问题研究.计算机工程与应用,vol.43, no.6, pp.58-61,2007.
    [57]D. Plattard, M. Soret, J. Troccaz, P. Vassal, J.-Y. Giraud, G. Champleboux, X. Artignan and M. Bolla, "Patient set-up using portal images:2D/2D image registration using mutual information," Computer Aided Surgery, vol.5, no.4, pp. 246-262,2000.
    [58]M. Jenkinson and S.M. Smith, "A global optimization method for robust affine registration of brain images," Medical Image Analysis, vol.5, no.2, pp.143-156, 2001.
    [59]M. J. D. Powell, "An efficient method for finding the minimum of a function of several variables without calculating derivatives," Computer Journal, vol.7, pp. 155-162,1964.
    [60]P. J. Burt and E. H. Adelson, "The Laplacian pyramid as a compact image code," IEEE Transactions on Communications, vol.9, no.4, pp.532-540,1983.
    [61]E. H. Adelson, E. P. Simoncelli, and W. T. Freeman, "Pyramids and multiscale representations," in Proceedings of the 13th European Conference on Visual Perception, Paris,1990.
    [62]A. Rosenfeld, Ed., "Multiresolution Image Processing and Analysis," Springer Verlag,1984.
    [63]R. L. Allen, F. A. Kamangar and E. M. Stokely, "Laplacian and orthogonal wavelet pyramid decompositions in coarse-to-fine registration," IEEE Transactions on Signal Processing, vol.41, no.12, pp.3536-3541,1993.
    [64]Q. Tian and M.N. Huhns, "Algorithms for subpixel registration," Computer Vision, Graphics, and Image Processing, vol.35, pp.220-233,1986.
    [65]R. W. Frischholz and K. P. Spinnler, "A class of Algorithms for Real-Time Subpixel Registration," Europto Conference, Munich,1993.
    [66]I. G. Karybali, E. Z. Psarakis, K. Berberidis and G. D. Evangelidis, "An Efficient Spatial Domain Technique for Subpixel Image Registration," Signal Processing: Image Communication, Elsevier, vol.23, no.9, pp.711-724,2008.
    [67]P. Thevenaz and M. Unser, "A pyramid approach to sub-pixel image fusion based on mutual linformation," in Proceedings of the IEEE International Conference on Image Proccssing(ICIP"96), vol.1, pp.265-268,1996.
    [68]P. Thevenaz, U.E. Ruttimann and M. Unser. "A Pyramid Approach to Subpixel Registration Based on Intensity," IEEE Transactions on Image Processing, vol.7, no.1, pp.27-41,1998.
    [69]黎俊,彭启民,范植华.亚像素级图像配准算法研究.中国图象图形学报,vol. 13, no.11, pp.2071-207,2008.
    [70]J. Tsao, "Interpolation artifacts in multimodality image registration based on maximization of mutual information," IEEE Transactions on Medical Imaging, vol.22, pp.854-964,2003.
    [71]G. K. Rohde, A. Aldroubi and D. M. Healy, Jr., "Interpolation artifacts in sub-pixel image registration," IEEE. Trans. Image Processing, vol.18, no.2, pp.333-345, 2009.
    [72]P. Viola and W. M. Wells, "Alignment by maximization of mutual information," in Proceedings of the 5th International Conference on Computer Vision, pp.16-23, 1995.
    [73]P. Viola and W. M. Wells, "Alignment by maximization of mutual information,' International Journal of Computer Vision, vol.24, no.2, pp.137-154,1997.
    [74]A. Collignon, F. Maes, D. Delaere, D. Vandermeulen, P. Suetens, and G. Marchal, "Automated multimodality medical image registration using information theory," in Proceedings of the 14th International Conference on Information Processing in Medical Imaging (IPMI95), Computational Imaging and Vision, vol.3, pp. 263-274,1995.
    [75]C. Studholme, D.L.GHill, and D.J.Hawkes, "An overlap invariant entropy measure of 3D medical image alignment," Pattern Recognition, vol.32, no.1, pp. 71-86,1999.
    [76]X. Li and M. T. Orchard, "New edge-directed interpolation," IEEE Transactions on Image Processing, vol.10, no.10, pp.1521-1527,2001.
    [77]T. Koga, K. Iinuma, A. Hirano, Y. Iijima and T. Ishiguro, "Motion compensated interframe image coding for video conference," in Proceedings of NTC81, pp.G5.3.1,1981.
    [78]Y. S. Kim, J. H. Lee and J. B. Ra, "Multi-sensor image registration based on intensity and edge orientation information," Pattern Recognition, vol.41, no.11, pp.3356-3365,2008.
    [79]A. Anthony, O. Lofffeld, "Image registration using a combination of mutual information and spatial information," IEEE International Conference on Geoscience and Remote Sensing Symposium, pp.4012-4016,2006.
    [80]J. P. W. Pluim, J. B. A. Maintz and M. A. Viergever, "Image registration by maximization of combined mutual information and gradient information," IEEE Transactions on Medical Imaging, vol.19, no.8, pp.809-814,2000.
    [81]M. Belis and S. Guiasu, "A Quantitative-Qualitative Measure Of Information In Cybernetic Systems," IEEE Transactions on Information Theory, vol.14, pp.593-594,1968.
    [82]Y. He, A. B. Hamza and H. Krim, "A generalized divergence measure for rubust image registration," IEEE Transactions on Signal Processing, vol.51, no.5, pp. 1211-1220,2003.
    [83]X. M. Cao, Q. Q. Ruan, "A survey on evaluation methods for medical image registration," IEEE/ICME International Conference on Complex Medical Engineering, pp.718-721,2007.
    [84]J. Tsao and P. C. Lauterbur, "Generalized clustering-based image registration for multi-modality images," in Proceedings of the IEEE Engineering in Medicine and Biology Society, pp.667-670,1998.
    [85]D. Robinson and P. Milanfar, "Fundamental performance limits in image registration," IEEE Transactions on Image Processing, vol.13, pp.1185-1199, 2004.
    [86]A. Barjatya, "Block Matching Algorithms for Motion Estimation," DIP 6620 Spring 2004 Final Project Paper.
    [87]M. Santamaria and M. Trujillo, "A comparison of block-matching motion estimation algorithms," in Proceedings of the 7th Colombian Computing Congress, 2012, pp.1-6.
    [88]R. Li, B. Zeng, and M. L. Liou, "A new three step search algorithm for block motion estimation," IEEE Transactions on Circuits and Systems for Video Technology, vol.4, no.4, pp.438-442,1994
    [89]S. Zhu and K. K. Ma, "A new diamond search algorithm for fast block-matching motion estimation," IEEE Transactions on Image Processing, vol.9, no. pp. 287-290,2000.
    [90]S. Zhu, X. Shen, J. Tian and K. Belloulata, "A new cross-diamond search algorithm for fast block motion estimation," in Proceedings of the 16th IEEE International Conference on Image Processing,2008, pp.1581-1584.
    [91]D. Marr, Vision:A Computational Investigation into the Human Representation and Processing of Visual Information, W. H. Freeman, San Francisco,1983.
    [92]D. Scharstein and R. Szeliski, "A Taxonomy and evaluation of dense two-frame stereo correspondence algorithms," International Journal of Computer Vision, vol. 47, no.1-3, pp.7-42, Apr.2002.
    [93]R. Szeliski, Computer Vision:Algorithms and Applications. Springer. Berlin, Germany,1st edition,2011.
    [94]Y. Boykov, O. Veksler, and R. Zabih, "Fast approximate energy minimization via graph cuts," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no.11, pp.1222-1239, Nov.2001.
    [95]C. Lei, J. Selzer, and Y.H. Yang, "Region-tree based stereo using dynamic programming optimization," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.2378-2385,2006.
    [96]P. Felzenszwalb and D. Huttenlocher, "Efficient belief propagation for early vision," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.1, pp.261-268,2004.
    [97]T. Kanade and M. Okutomi, "A stereo matching algorithm with an adaptive window:Theory and experiment," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.16, no.9, pp.920-930, 1994.
    [98]A. F. Bobick and S. S. Intille, "Large occlusion stereo," International Journal of Computer Vision, vol.33, no.3, pp.181-200, 1999.
    [99]H. Hirschmueller, "Improvements in Real-Time Correlation-Based Stereo Vision," IEEE Workshop on Stereo and Multi-Baseline Vision, 2001.
    [100]K. J. Yoon and I. S. Kweon, "Adaptive support weight approach for correspondence search," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, no.4, pp.650-656, 2006.
    [101]J. Kopf, M. F. Cohen, D. Lischinski and M. Uyttendaele, "Joint bilateral upsampling," In ACM Transactions on Graphics (Proc. SIGGRAPH 07), pp.96, 2007.
    [102]F. Tombari, S. Mattoccia and L. D. Stefano, "Segmentation-based adaptive support for accurate stereo correspondence," in Proceedings of the 2nd Pacific-Rim Symposium on Image and Video Technology, pp.427-438,2007.
    [103]N. Y. C. Chang, T. H. Tsai, B. H. Hsu, Y. C. Chen, and T. S. Chang, "Algorithm and architecture of disparity estimation with mini-census adaptive support weight," IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no.6, pp.792-805,2010.
    [104]Z. Gu, X. Su, Y. Liu, and Q. Zhang, "Local stereo matching with adaptive support-weight, rank transform and disparity calibration," Pattern Recognition Letters, vol.29, no.9, pp.1230-1235,2008.
    [105]L. De-Maeztu, A. Villanueva and R. Cabeza, "Stereo matching using gradient similarity and locally adaptive support-weight," Pattern Recognition Letters, vol. 32, no.13, pp.1643-1651,2011.
    [106]江静,张雪松.基于计算机视觉的深度估计方法.光电技术应用,vol.26,no.1, pp.51-55,2011.
    [107]M. Herbert, "Active and passive range sensing for robotics," in Proceedings of the IEEE International Conference on Robotics and Automation, vol.1, pp.102-110, 2000.
    [108]郑南宁.计算机视觉与模式识别.北京:国防工业出版社,1998.
    [109]G. Egnal and R. P. Wildes, "Detecting binocular half-occlusions:empirical comparisons of five approaches," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, no.8, pp.1127-1133,2002.
    [110]S. Birchfield and C.Tomasi, "Depth discontinuities by pixel-to-pixel stereo," International Journal of Computer Vision, vol.35, no.3, pp.269-293,1999.
    [111]白明,庄严,王伟.双目立体匹配算法的研究与进展.控制与决策,vol.23, no. 7, pp.722-723,2008.
    [112]C. Zitnick and T. Kanade, "A Cooperative Algorithm for Stereo Matching and Occlusion Detection," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, no.7, pp.675-684,2000.
    [113]D. Scharstein and R. Szeliski. Middlebury stereo evaluation-version 2, http://vision.middlebury.edu/stereo/
    [114]Y. Boykov, O. Veksler, and R. Zabih. "A variable window approach to early vision," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.20, no.12, pp.1283-1294,1998.
    [115]O. Veksler, "Stereo correspondence with compact windows via minimum ratio cycle," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, no.12, pp.1654-1660, 2002.
    [116]O. Veksler, "Fast variable window for stereo correspondence using integral images," in Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, vol.1, pp.556-561,2003.
    [117]A. Fusiello, V. Roberto, and E. Trucco, "Efficient stereo with multiple windowing," in Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition,1997, pp.858-863.
    [118]S. B. Kang, R. Szeliski, and C. Jinxjang. "Handling occlusions in dense multi-view stereo," in Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition,2001. vol.1. pp.103-110.
    [119]C. Geogoulas, L. Kotoulas, G. Sirakoulis, Ch. I. Andreadis, and A. Gasteratos, "Real-time disparity map computation module," Microprocessors and Microsystems, vol.32, no.3, pp.159-170,2008.
    [120]H. Hirschmuller and D. Scharstein, "Evaluation of stereo matching costs on images with radiometric differences," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.31, no.9, pp.1582-1599,2009.
    [121]R. Zabih and J. Woodfill, "Non-parametric local transforms for computing visual correspondance," in Proceedings of the European Conference of Computer Vision, pp.151-158,1994.
    [122]R. Chrastek and J. Jan, "Mutual information as a matching criterion for stereo pairs of images," Analysis of Biomedical Signals and Images, vol.14, pp. 101-103,1998.
    [123]G. Egnal, "Mutual information as a stereo correspondence measure," Dept. Comput. Inf. Sci., Univ. Pennsylvania, Philadelphia, Tech. Rep. MS-CIS-00-20, 2000.
    [124]C. Fookes, A. Lamanna, and M. Bennamoun, "A new stereo image matching technique using mutual information," in Proceedings of the International Conference on Computer Graphics and Imaging, pp.168-173,2001.
    [125]C. Fookes, M. Bennamoun and A. Lamanna, "Improved stereo image matching using mutual information and hierarchical prior probabilities," in Proceedings of the 16th International Conference on Pattern Recognition, pp.937-940,2002.
    [126]J. Kim, V. Kolmogorov, and R. Zabih, "Visual correspondence using energy minimization and mutual information," in Proceedings of the 9th IEEE International Conference on Computer Vision, vol.2, pp.1033-1040,2003.
    [127]H. Hirschmuller, "Accurate and efficient stereo processing by semi-global matching and mutual information," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.2, pp.807-814,2005.
    [128]H. Hirschmuller, "Stereo processing by semi-global matching and mutual information," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.30, no.2, Feb.2008.
    [129]C. L. Zitnick, S. B. Kang, M. Uyttendaele, S. Winder, and R. Szeliski, "High-quality video view interpolation using a layered representation," in SIGGRAPH,2004.
    [130]I. Sarkar and M. Bansal, "A wavelet-based multiresolution approach to solve the stereo correspondence problem using mutual information," IEEE Transactions on Systems, Man, and Cybernetics-Part B:Cybernetics, vol.37, no.4,2007.
    [131]Y. S. Heo, K. M. Lee, and S. U. Lee, "Mutual information-based stereo matching combined with SIFT descriptor in Log-chromaticity color space," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 445-452,2009.
    [132]H. Hirschmuller and D. Scharstein, "Evaluation of cost functions for stereo matching," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.1, pp.1-8,2007.
    [133]Y. S. Heo, K.M. Lee, and S. U. Lee, "Illumination and camera invariant stereo matching," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR),2008.
    [134]Y. S. Heo, K.M. Lee, and S. U. Lee, "Robust stereo matching using adaptive normalized cross-correlation," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33. pp.807-822,2011.
    [135]D. W. Scott, "On optimal and data-based histograms," Biometrika, vol.66, no.3, pp.605-610,1979.
    [136]P. Legg, P. L. Rosin, D. Marshall and J. E. Morgan, "Improving accuracy and efficiency of registration by mutual information using Sturges histogram rule,' Proceedings of Medical Image Understanding and Analysis, pp.26-30,2007.
    [137]D. Freedman and P. Diaconis, "On the histogram as a density estimator," Probability Theory and Related Fields, vol.57, no.4, pp.453-476,1981.
    [138]D. Scharstein and C. Pal, "Learning conditional random fields for stereo," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR),2007.
    [139]P. Merkle, K. Muller, A. Smolic and T. Wiegand, "Efficient compression of multi-view video exploiting inter-view dependences based on H.264/MPEG4-AVC," in Proceedings of the International Conference on Multimedia and Expo (ICME), pp.1717-1720,2006.
    [140]ISO/IEC JTC1/SC29/WG11, "CE10:Multi-view video coding using view interpolation method," JVT-V080, Marrakech, Morocco, January 2007.
    [141]ISO/IEC JTC1/SC29/WG11, "Disparity vector prediction methods in MVC,' JVTV040, Hangzhou, China, October 2006.
    [142]B. Girod, A. Aaron, S. Rane and D. Rebollo-Monedero, "Distributed video coding," Proceedings of the IEEE Special Issue on Advances in Video Coding and Delivery, vol.93, no.1, pp.71-83,2005.
    [143]J. D. Areia, J. Ascenso, C. Brites and F. Pereira, "Wyner-Ziv stereo video coding using a side information fusion approach," in Proceedings of IEEE Multimedia Signal Processing (MMSP), pp.453-456,2007.
    [144]G. Petrazzuoli, M. Cagnazzo and B. Pesquet-Popescu, "High order motion interpolation for side information improvement in DVC," in Acoustics, Speech and Signal Processing (ICASSP), pp.2342-2345,2010.
    [145]C. Guillemot, F. Pereira, L. Torres, T. Ebrahimi and R. Leonardi et al., "Distributed monoview and multiview video coding," IEEE Signal Processing Magazine, vol.24, no.5, pp.67-76,2007.
    [146]T. Maugey, W. Miled, M. Cagnazzo and B. Pesquet-Popescu, "Fusion schemes for multiview distributed video coding," In European Signal Processing Conference, vol. 1,pp.559-563,2009.
    [147]X. Guo, Y. Lu, F. Wu, W. Gao and S. Li, "Distributed multiview video coding," in SPIE Visual Communications and Image Processing, vol.6077,2006.
    [148]M. Quaret, F. Dufaux, and T. Ebrahimi, "Fusion-based multiview video coding," ACM International Workshop on Video Surveillance and Sensor Networks,2006.
    [149]T. Sikora, "Trends and perspectives in image and video coding," Proceedings of the IEEE, vol.93, no.1, pp.6-17,2005.
    [150]G. Sullivan and T. Wiegand, "Video Compression-From Concepts to the H.264/AVC Standard," in Proceedings of the IEEE, vol.93, no.1, pp.18-31, 2005.
    [151]J. Dong and K.N. Ngan, "Present and Future Video Coding Standards," in Intelligent Multimedia Communication:Techniques and Applications, Springer, pp.75-124,2010.
    [152]H. Schwarz, D. Marpe, and T. Wiegand, "Overview of the scalable video coding extension of the H.264/AVC standard," IEEE Transactions on Circuits and Systems for Video Technology, vol.17, no.9, pp.1103-1120,2007.
    [153]A. Vetro, T. Wiegand, and G. J. Sullivan, "Overview of the stereo and multiview video coding extensions of the H.264/AVC standard," Proceedings of the IEEE, Special Issue on 3D Media and Displays, vol.99, no.4, pp.626-642,2011.
    [154]T. Wiegand, G.J. Sullivan, G. Bjontegaard and A. Luthra,, "Overview of the H.264/AVC video coding standard," IEEE Transactions on Circuits and Systems for Video Technology, vol.13, no.7, pp.560-576,2003.
    [155]G. J. Sullivan, J. R. Ohm, W. J. Han, T. Wiegand and T. Wiegand, "Overview of the high efficiency video coding (HEVC) standard," IEEE Transactions on Circuits and Systems for Video Technology, vol.22, no.12, pp.1649-1668,2012.
    [156]L. Yu, F. Yi, J. Dong and C. Zhang, "Overview of AVS-Video:tools, performance and complexity," in Proceedings of SPIE International Conference on Visual Communications and Image Processing (VCIP), pp.679-690,2005.
    [157]A. Kubota, A. Smolic, M. Magnor, M. Tanimoto, T. Chen and C. Zhang, "Multiview Imaging and 3DTV," IEEE Signal Processing Magazine, vol.24, no.6, pp.10-21, Nov.2007.
    [158]A. Smolic, "3D video and free viewpoint video—from capture to display," Pattern Recognition, vol.44, no.9, pp.1958-1968,2011.
    [159]S. B. Kang, R. Szeliski, and P. Anandan, "The geometry-image representation tradeoff for rendering," in Proceedings of the IEEE International Conference on Image Processing,2000.
    [160]A. Smolic, P. Merkle, K. Muller, C. Fehn, P. Kauff, and T. Wiegand, "Compression of Multi-View Video and Associated Data," in Three-Dimensional Television:Capture, Transmission, and Display, Springer,2007.
    [161]A. Smolic and P. Kauff, "Interactive 3-D Video Representation and Coding Technologies," Proceedings of the IEEE, vol.93, no.1, pp.98-110,2005.
    [162]C. Buehler, M. Bosse, L. McMillan, S. Gortler, and M. Cohen, "Unstructured lumigraph rendering," in Proceedings of ACM SIGGRAPH, pp.425-432,2001.
    [163]K. Mueller, A. Smolic, K. Dix, P. Merkle, P. Kauff, and T. Wiegand, "Reliability-based Generation and View Synthesis in Layered Depth Video," in Proceedings of the IEEE International Workshop on Multimedia Signal Processing, 2008.
    [164]A. Smolic, et al., "Coding algorithms for 3DTV--A survey," IEEE Transactions on Circuits and Systems for Video Technology, vol.17, no.11, pp.1606-1621, 2007.
    [165]J.-R. Ohm, "Stereo/multiview video encoding using the mpeg family of standards," in Proceedings of SPIE, Stereoscopic Displays and Virtual Reality Systems VI, vol.3639, pp.242-253,1999.
    [166]M. Flierl and B. Girod, "Multiview Video Compression," IEEE Signal Processing Magazine, vol.24, no.6, pp.66-76,2007.
    [167]X. San, H. Cai, J. G. Lou and J. Li, "Multiview image coding based on geometric prediction," IEEE Transactions on Circuits and Systems for Video Technology, vol.17, no.11, pp.1536-1548,2007.
    [168]P. Merkle, A. Smolic, K. Muller and T. Wiegand, "Efficient prediction structures for multiview video coding," IEEE Transactions on Circuits and Systems for Video Technology, vol.17, no.11, pp.1461-1473,2007.
    [169]Y. Kim, J. Kim, and K. Sohn, "Fast disparity and motion estimation for mufti-view video coding," IEEE Transactions on Consumer Electronics, vol.53, no.2, pp.712-719,2007.
    [170]K. Muller, P. Merkle, and T. Wiegand, "Compressing time varying visual content," IEEE Signal Processing Magazine, vol.24, pp.58-65,2007.
    [171]G. J. Sullivan, "Standards-based approaches to 3D and multiview video coding," in Proceedings of SPIE Conference on Applications of Digital Image Processing XXXII,2009.
    [172]X. Guo, Y. Lu, F. Wu and W. Gao, "Inter-view direct mode for multiview video coding", IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no.12, pp.1527-1532, 2006.
    [173]ITU-T and ISO/IEC JTC 1, "Advanced video coding for generic audiovisual services," ITU-T Recommendation H.264 and ISO/IEC 14496-10 (MPEG-4 AVC), 2010.
    [174]M. Flierl, A. Mavlankar, and B. Girod, "Motion and disparity compensated coding for multi-view video," IEEE Transactions on Circuits and Systems for Video Technology, vol.17, no.11, pp.1474-1484,2007.
    [175]K. Yamamoto, et al., "Multiview video coding using view interpolation and color correction," IEEE Transactions on Circuits and Systems for Video Technology, vol.17, no.11, pp.1436-1449,2007.
    [176]P. Merkle, A. Smolic, K. Muller, and T. Wiegand, "Multi-view video plus depth representation and coding," in Proceedings of the IEEE International Conference on Image Processing(ICIP2007), vol.1, pp. Ⅰ.201-1.204,2007.
    [177]K. Muller, P. Merkle, and T. Wiegand, "3-D video representation using depth maps," Proceedings of the IEEE, vol.99, no.4, pp.643-656,2011.
    [178]胡晓飞,朱秀昌.从单视点DVC到多视点DVC的研究进展.中国图象形学报,vol.14, no.10, pp.1925-1934,2009.
    [179]D. Slepian and J. K. Wolf, "Noiseless coding of correlated information sources,' IEEE Transactions on Information Theory, vol.19. no.4, pp.471-480,1973.
    [180]A. D. Wyner and J. Ziv, "The rate-distortion function for source coding with side information at the decoder," IEEE Transactions on Information Theory, vol.22, no. 1,pp.1-10,1976.
    [181]F. Dufaux, W. Gao, S. Tubaro, and A. Vetro, "Distributed video coding:Trends and perspectives," EURASIP Journal on Image and Video Processing, vol.2009, 2009.
    [182]A. Aaron, R. Zhang, and B. Girod, "Wyner-Ziv coding of motion video," in Proceedings of the Asilomar Conference on Signals, Systems, and Computers, 2002.
    [183]A. Aaron, S. Rane, R. Zhang, and B. Girod, "Wyner-Ziv coding for video: Applications to compression and error resilience," In Proceedings of the IEEE Data Compression Conference,2003, pp.93-102.
    [184]R. Purit and K. Ramchandran, "PRISM:A new robust video coding architecture based on distributed compression principles," in Proceedings of the Allerton Conference on Communication, Control and Computing,2002.
    [185]F. Dufaux, M. Ouaret and T. Ebrahimi, "Recent advances in multi-view distributed video coding," in Proceedings of SPIE Mobile Multimedia/Image Processing for Military and Security Applications, vol.6157, pp.1-11.2007.
    [186]X. Artigas, E. Angeli, and L. Torres, "Side information generation for multiview distributed video coding using a fusion approach," in Proceedings of the 7th Nordic Signal Processing Symposium (NORSIG'06),2006, pp.250-253.
    [187]X. Artigas, F. Tarres and L. Torres, "A comparison of different side information generation methods for multiview distributed video coding," in Proceedings of the International Conference on Signal Processing and Multimedia Applications (SIGMAP),2007, pp.450-455.
    [188]Y. Li, X. Ji, D. Zhao and W. Gao, "Region-based Fusion Strategy for Side Information Generation in DMVC," in Proceedings of SPIE-IS&T Electronic Imaging, vol.6822,2008.
    [189]L. Stelmach, W. J. Tam, D. Meegan and A. Vincent, "Stereo image quality:effects of mixed spatio-temporal resolution," IEEE Transactions on Circuits and Systems for Video Technology, vol.10, mo.2, pp.188-193,2000.
    [190]T. Schierl and S. Narasimhan, "Transport and Storage Systems for 3-D Video Using MPEG-2 Systems, RTP, and ISO File Format," Proceedings of the IEEE, vol.99, no.4, pp.671-683,2011.
    [191]A. Vetro, A. M. Tourapis, K. Muller and T. Chen, "3D-TV Content Storage and Transmission," IEEE Transactions on Broadcasting, vol.57, no.2, pp.384-394, 2011.
    [192]F. Brandi, R. L. D. Queiroz, and D. Mukherjee, "Super-resolution of video using key frames and motion estimation," in Proceedings of IEEE International Conference on Image Processing(ICIP), pp.321-324,2008.
    [193]B. C. Song, S. C. Jeong and Y. Choi, "High-resolution image sealer using hierarchical motion estimation and overlapped block motion compensation," IEEE Transactions on Consumer Electronics, vol.56, no.3, pp.1579-1585,2010.
    [194]Y. K. Chen, A. Vetro, H. Sun, and S. Y. Kung, "Frame-Rate Up-conversion using transmitted true motion vectors," in Proceedings of the 2nd IEEE International Workshop on Multimedia Signal Processing, pp.622-627,1998.
    [195]H. Y. Shum and S. B. Kang, "A review of image based rendering techniques," in Proceedings of SPIE International Conference on Visual Communications and Image Processing (VCIP), pp.2-13,2000.
    [196]HHI 3DV ftp site. Depth maps of bookarrival [online]. Available FTP: ftp.hhi.de/HHIMPEG3DV/
    [197]S. Zhu and K. K. Ma, "A new diamond search algorithm for fast block-matching motion estimation," IEEE Transactions on Image Processing, vol.9, no.2, pp. 287-290,2000.
    [198]D. Comaniciu and P. Meer, "Mean shift:a robust approach toward feature space analysis," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no.5, pp.603-619,2002.
    [199]A. Klaus, M. Sormann and K.F. Kamer, "Segment-Based Stereo Matching Using Belief Propagation and a Self-Adapting Dissimilarity Measure," in Proceedings of the International Conference on Pattern Recognitions.2006, vol.3, pp.15-18.
    [200]J. Lu, G. Lafruit and F. Catthoor, "Anisotropic local high-confidence voting for accurate stereo correspondence," in Proceedings of SPIE-IS&T Electronic Imaging, vol.6812,2008, pp.605822-1-605822-10.
    [201]苏秉华,金伟其,牛丽红,刘广荣.超分辨率图象复原及其进展.光学技术.vol.27, no.1, pp.6-9,2001.
    [202]R. Driggers, K. Krapels and S. Young, "The Meaning of Super-Resolution," in Proceedings of the SPIE 5784, pp.103-105,2005.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700