视频点播系统中的视频检索研究

设为首页

收藏本站

网站地图 | English | 公务邮箱

读者指南

学术客户端

NSTL服务站

科技查新

视频点播系统中的视频检索研究

详细信息本馆镜像全文| 推荐本文 | | 获取CNKI官网全文

英文题名：Research on Video Retrieval for VOD System
作者：闫君飞
论文级别：博士
学科专业名称：网络传播系统与控制
中文关键词：视频点播 ; 基于内容的视频检索 ; 图像分割 ; 置信域图切分 ; 尺度不变特征变换
英文关键词：video on demand (VOD) ; content based video retrieval (CBVR) ; image segmentation ; trust region graph cuts (TRGC) ; scale-invariant feature transform (SIFT)
学位年度：2008
导师：吴刚
学科代码：081203
学位授予单位：中国科学技术大学
论文提交日期：2008-05-01

摘要

视频点播(Video on Demand,VOD)是一种基于流媒体技术而实现的网络多媒体应用,多年来受到视频领域学者们的广泛关注。在点播系统中,用户希望以自适应的方式消费视频媒体,在任何时间、任何地点、以任意的方式消费视频媒体。基于内容的视频检索(content based video retrieval,CBVR)对于满足点播系统的用户需求有着突出的作用。然而,传统基于内容的视频检索方法存在着一定的缺陷,难以取得理想的效果。其中主要的问题是,以视频片段作为查询输入的查询方式难以普遍满足用户个性化的需求;检索系统中提取的视频特征相关性差,通用性不足,难以准确概括用户所需要的语义信息。
     图像分割是视频语义提取的一项关键技术,本文提出一种用于视频检索的图像分割方法。利用给定的示例图像感兴趣区域的色彩信息,估计待分割图像前景和背景的色彩统计模型,对每个像素计算其与前景/背景的相似性,结合目标前景的直方图匹配以及自然分割的对比度描述,得出图像分割框架进行优化求解。由于对待分割图像的每个像素进行了色彩似然性的刻画,所以能够克服光照变化、前景色块比例变化、前景尺寸变化等因素对分割准确性的影响。试验结果表明,与仅考虑直方图匹配的分割方法相比,本方法具有更好的普适性,能够有效地用于视频检索系统。
     在关键帧分割算法的基础上,提出一种用于视频点播系统的视频检索方法,用户提交单帧图片中的感兴趣部分作为查询输入,服务器据此对存储的视频摘要的关键帧数据进行分割,判断帧数据与查询图片的相似性,并将结果返回给用户选择播放。用户查询输入可以来自影片的海报宣传画、影片花絮镜头等等,由于仅对用户感兴趣的部分进行处理,所以对背景(非感兴趣区域)的色彩无须约束。试验结果表明此方法能够准确定位到用户期望的查询结果,在检索所需源视频的同时,检索到查询输入的相关同类视频,适用于视频点播系统。
     现有视频检索系统中对于视频的特征提取多采用低级特征(如颜色、纹理等),通用性不足,难以得到理想的的检索效果。本文提出一种基于尺度不变特征(scale-invariant feature transform,SIFT)的视频检索方法,用户提交单帧图片中的感兴趣部分作为查询输入,利用区域内部的尺度不变特征点,得到查询输入与目录服务器中视频摘要片断的数据帧之间特征点的匹配,在此基础上提出一种视频排序方法,对用户查询的视频数据进行定位和播放。试验结果表明该方法能够准确的找到查询输入的源视频与相关同类视频,不受前景目标旋转、尺度变换的影响,对光照条件的变化不敏感,可以有效应用于视频点播系统。
Video on demand (VOD) is an important multimedia application based on streaming media technique. It attracts wide attention from the scholars in the field of video research. Users in video system want to consume video media in a self-adaptive way, and consume it anytime, anywhere and with any format. Content based video retrieval (CBVR) exerts outstanding effects in satisfying consumers' demands in VOD system. Unfortunately, traditional CBVR methods have some shortcomings. The main shortcoming is that the query form of video segment can not satisfy the universal and individual demands of users, and the video character extracted from the retrieval system can not represent the semantic information of users because of its lack of relativity and universality. This problem cumbers CBVR to achieve ideal performance. In this dissertation, all proposed methods are to solve this problem.
     Image segmentation is a key technique of the extraction of video semantics. In chapter 3, a novel image segmentation method for video retrieval is proposed. In this method, we first use the color information of the interesting region in example image to estimate the color distribution model of the to-be-segmented image's foreground and background. Then for each pixel, we estimate its similarity to the foreground and background. By integrating the pixel estimation with the histogram matching and contrast description of the objective foreground, we construct a novel graph-cut optimization framework. Compared with other algorithms, due to the depiction of each pixel's color likelihood, our method is more robust for the variety of illumination and the alteration of scale or color proportion of foreground. Experiments show that our method is more effective than the traditional histogram matching algorithm, and more competent for video retrieval.
     In chapter 4, we first point out the shortcomings of the traditional video retrieval format of video segment. Then, based on the algorithm to segment key frame proposed in chapter 4, we propose a novel type for video-on-demand (VOD). The user uses the interested part of single image as the query. The system server, which stores the video summary in the whole system, segments the frames in video summaries according to the query, and computes the distance between the image and the query. Furthermore, the server can localize and play the video that the user requested. The query can come from the poster or from the frame itself, since it only handles the region of interest (ROI), no constraints are required for the color of the background (out of ROI). Experiments show that it is a novel and effective style for video retrieval which can localize the source video and the congeneric video of the query accurately. This method can be applied in the VOD system.
     Existing video retrieval system always extract low level character (such as color, texture etc.) of video system. It results in incompetent effect of retrieval system. In chapter 5, we proposed a video retrieval method using scale-invariant feature transform (SIFT) based on the query format of interested part of single image that proposed in chapter 4. The user uses the interested part of a single image as the query. The system server, which stores all the video summaries, uses the scale-invariant feature transform (SIFT) to match the input query with the ones in the frames of video summaries. Then, the server can localize and retrieve the video that the user requested. Experiments show that this method can localize the source video and the congeneric video of the query accurately which is invariant to foreground scaling and rotation, and partially illumination invariant. This method can be applied in VOD system effectively.

引文

1. Xiang Z, Zhang Q, Zhu W, et al. 2004. Peer-to-peer based multi-media distribution service [J]. IEEE Transactions on Multimedia, 6(4): 343-355.

    2. Dan A, Sitaram D, Shahabuddin P. 1994. Scheduling policies for an on-demand video server with batching. Proc. of ACM Multimedia, San Francisco, California. 15-23.

    3. Chan S H G, Tobagi F. 2001. Distributed servers architecture for networked video services [J]. IEEE/ACM Transactions on Networking, 9(2): 125-136.

    4. Almeida J, Eager D, FERRIS M. 2002. Provisioning content distribution networks for streaming media [A]. IEEE INFOCOM 2002, Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies [C]. 1746-1755.

    5. Eager D, Ferris M, Vernon M. 2000. Optimized caching in systems with heterogeneous client populations [J]. Performance Evaluation, Special Issue on Internet Performance Modeling, 42(2):163-185.

    6. Choi D, Biersack E, Keller U G. 2002. Cost-optimal dimensioning of a large scale video on demand system [A]. Proceedings of the 4th International Workshop on Networked Group Communication[C]. 29-36.

    7. Rejaie R, Handley M, Yu H, et al. 1999. Proxy caching mechanism for multimedia playback streams in the internet [A]. In: Proceedings of the 4th International Web Caching Workshop[C]. San Diego. CA: [s. n.].

    8. Padmanabhan V, Wang H, Chou P, Sripanidkulchai K. 2002. Distributing streaming media content using cooperative networking [C]. Proc. of NOSS2 DAV.

    9. Gadde S, Chase J, Rabinovich M. 2000. Web caching and content distribution: a view from the interior [A]. In: Proc. of the 5th international web caching and content delivery workshop [C]. Lisbon, Portugal: [s. n.].

    10. Wu Jiangxing. 2002. China's High Performance Broadband Information Network [J]. Communication world.

    11. Little T, Venkatesh D. 1994. Prospects for interactive video-on-demand [J]. IEEE Multimedia Mag. 14-24.

    12. Tobagi F. A.. 1995. Distance learning with digital video [J]. IEEE Multimedia Mag. 90-94.
    13. Sutherland J., Litteral L.. 1992. Residential video services [J]. IEEE Commun. Mag. 36-41.

    14. Interactive TV (ITV). Hong Kong: CWHKT.

    15. Deloddere D., Verbiest W., and Verhille H.. 1994. Interactive video on demand [J]. IEEE Commun. Mag. 32(5): 82-88.

    16. Golubchik L, Muntz R R, Chou C F, et al. 2001. Design of fault-tolerant large-scale VOD servers: With emphasis on high-performance and low-cost [J]. IEEE Transactions on Parallel and Distributed Systems, 12(4): 363 -386.

    17. Zhao Yanping, Eager D. L. Vernon, M.K.. 2007. Scalable On-Demand Streaming of Nonlinear Media [J] IEEE/ACM Transactions on Networking! 15(5):1149-1162.

    18. Choe Y. R., Douglas C, Pai V. S.. 2007. A Model and Prototype of a Resource-Efficient Storage Server for High-Bitrate Video-on-Demand [C]. IEEE International conference on Parallel and Distributed Processing Symposium. California, USA. 1-7.

    19. Colby L. 1995. Overview of interactive TV from the viewpoint of the cable TV settop converter's RF modem [C]. Compcon '95. 'Technologies for the Information Superhighway' San Francisco, CA. 200-202.

    20. Lan Xuguang, Zheng Nanning, Xue Jianru, Gao Bin, Wu Xiaoguang. 2007. Adaptive VoD Architecture for Heterogeneous Networks Based on Scalable Wavelet Video Coding [J]. IEEE Transactions on Consumer_Electronics, 53(4): 1401-1409.

    21. Thouin F., Coates M..2007 Video-on-Demand Networks: Design Approaches and Future Challenges[J]. IEEE on Network. 21(2): 42-48.

    22. Wang Daoyi, Liu Yuanan. 2007. A New Architecture of Merged On-Demand Application Network [C]. 2nd International Conference on Pervasive Computing and Applications. Birmingham, UK. 652-655.

    23. Songan Yang, Hua Yang, Yuhan Yang. 2003. Architecture of high capacity VOD server and the implementation of its prototype. IEEE Trans. on Consumer electronics. 49(4): 1169-1177.

    24. Deloddere D., Verbiest W. and Verhille H.. 1994. Interactive video on demand [J]. IEEE Trans. on Communicmions, 32(5): 90-100.

    25. Makoto Nishio, Makio Yoshida and Sbo-ichiro Nakai. 1994. VideoExpress: A Meta-Service system for video on demand [C]. Proceedings of the 1st International Workshop on., vol. 13-14: 87-90.
    26.章毓晋.1995.多媒体技术发展和市场分析[J].计算机世界.5:3-8.
    27.Wactlar H D,Kanade T,et al.1996.Intelligent access to digital video:information project.IEEE computer,1996,29(5):46-52.
    28.章毓晋.2003.基于内容的视觉信息检索[M].科学出版社.北京.
    29.周洞汝等.2000.视频数据库管理系统导论[M].科学出版社.北京.
    30.庄越挺,潘云鹤,吴飞.2002.网上多媒体信息分析与检索[M]清华大学出版社.北京.
    31.Zhuang Y,Liu X,Pan Y.2000.Webscope-CBVR:A custom-based Search Engine for video on WWW[C].Proceeding of IS&T and SPIE Image and Video Communications and Processing
    32.M.Stricker and M.Orengo.1995.Similarity ofcotor images[C].Proc.of the SPIE Conference on storage and retrieval for image and video database.381-392.
    33.Smith J.R.and Chang S.F..1995.Single color extraction and image query[C].Proc.Of IEEE International Conference on Image Processing.80-88.
    34.Faloutsos C.,Flickner M.,Niblack W.,Petkovic D.,Equitz W.and Barber R..1993.Efficient and effective querying by image content.Technical RePort.IBM Research Report.
    35.Lu H.,Ooi B.and Tan K..1994.Efficient image retrieval by color contents[C].Proc.of the 1994 International Conference on Applications of Databases.
    36.Haralick R.,Shanmugan K.and Dinstein I..1973.Texture features for image classification [J].IEEE Transactions on System Management and Cybernetics,SMC-3(6).610-621.
    37.Tamura H.,Mori S.,and Yamawaki T..1978.Texture features corresponding to visual perception[J].IEEE Transactions on System Management and Cybemetics,SMC 8(6).
    38.Rui Y.,Shi A.C.and Huang T.S..1996.Modified fourier descriptors for shape representation:A practical approach[C].Proc.of the First International Workshop on Image Database and Multimedia Seareh.456-461.
    39.Hu M.K..1962.Visual pattern recognition by moment invariants[J].IEEE Transactions on Information Theory,vol 8.459-464.
    40.Lian N.X.,Tan Y.P.and Chan K.L..2003.Efficient Video Retrieval Using Shot Clustering and Alignment[C].Proc.of the 4~(th)International Conference on Information,Communications & Signal Processing and Fourth Pacific-Rim Conference on Multimedia:1801-1805.
    41.Snoek C.G.M.and Worring M..2005.Multimodal Video Indexing:A Review of the State-of-the-art [J]. Multimedia Tools and Applications, 25(1): 5-35.

    42. Choudhary T., Clarkson B., Jebara T. and Pentland A.. 1998. Multimodal person recognition using unconstrained audio and video [C]. In Proc. Int. Conf. Audiovisual Biometic Person Authentication.

    43. Rehg J., Murphy K. and Fieguth P.. 1999. Vision-based speaker detection using Bayesian networks [C]. Proc. ComPut. Vision Pattern Recognition, Fort Collins, CO, 2:110-116.

    44. NaPhade M. R.and Huang T. S.. 2002. Extracting Semantics From Audiovisual Content: The Final Frontier in Multimedia Retrieval [J]. IEEE Trans on Neural Networks, 13(4): 793-810.

    45. Furui S., Kikuchi T, Shinnaka Y. and Hori C. 2004. Speeeh-to-text and speech-to-Speech summarization of spontaneous speech [J]. IEEE Transactions on Speech and Audio processing, 12(4): pp.401-408,2004.

    46. Sato T., Kanade T, Hughes E., Smith M. A. and Satoh S.. 1998. Video OCR: Indexing digital news libraries by recognition of superimposed caption [J]. ACM Multimedia Systems Special Issue on Video Libraries. 385-395.

    47. Hauptmann A. G. and Wei-Hao Lin. 2001. Beyond the Informedia digital video library: video and audio analysis for remembering conversations [C]. IEEE Workshop on Automatic Speech Recognition and Understanding. 296-300.

    48. Chang S. F., Chen W., Meng H., Sundaram H. and Zhong D. 1997. VideoQ: an automated content-based video search system using visual cues [C]. ACM 5th Multimedia Conference, Seattle, WA. 313-324.

    49. Christel M., Kanade T., Mauldi M., et al. 1995. Informedia digital video library [J]. Communications of the ACM, 38(4): 23-34.

    50. http://www.informedia.cs.cmu.edu

    51. Meng J., Chang S F. 1996. A compressed video editing and Parsing system [C]. In: Proceedings of ACM Multimedia 96 Conference, Boston, MA. 43-53.

    52. http://www.ctr.colunibia.edu/VideoQ

    53. http://www.ctr.columbia.edu/webseek

    54. Smith J R, Chang S F. 1997. Visually searching the web for content [J]. IEEE Multimedia Magazine, Summer, 1997, 4(3): 12-20.

    55. Flickner M, Sawhney H S, et al. 1995. Query by image and video content: the QBIC system [J]. IEEE computer, 28(9):23-32.
    56. Marco L, Edoardo A. 1996. JACOB: just a content-based query system for video databases [C]. Proc. ICASSP-96, Atlanta, GA.

    57. Multimedia Description Schemes group. 2001. Text of 15938-5 FCD Information Technology-Multimedia Content Description Interface-Part 5 Multimedia Description Schemes. ISO/IEC JTC 1/SC29/WG11/N3966. SingaPore.

    58. Servetto S, RuiY. 1998. A Region-based representation of images in MARS [C]. Special Issue on Multimedia Signal Processing. Journal on VLSI Signal Proeessing.

    59. Li Ying, Jay Kuo. 2002. Unsupervised real-time speaker identification for daily movies [C]. In Proceedings of SPIE, the International Society for Optical Engineering, 4862: 151-162.

    60. Kittler J., Messer K.. 2001. Generation of semantic cues for sports video annotation [C]. IEEE International Conference on Image Processing, 3: 26-29.

    61. Alexander Hauptmann, Rong Jin. 2001. Video retrieval with the informedia digital video library system [C]. In Proceedings of the 10th text retrieval confence, Gaithersburg, Maryland: 78-84.

    62. 黄知义,周宁. 2005. 基于内容视频检索的关键技术研究[J]. 现代情报. 10:126-129.

    63. Masihi Z.G., Charkari N.M.. 2005. Content based Video Retrieval based on Approximate String Matching [C]. The International Conference on Computer as a Tool(EUROCON), 2: 1300-1303.

    64. Lee A.J.T., Ruey-Wen Hong, Meng-Fang Chang. 2004. An approach to content-based video retrieval [C]. IEEE International Conference on Multimedia and Expo (ICME), 1: 273-276.

    65. Lu Hong, Ooi Beng Chin, Shen Heng Tao, et al. 2006. Hierarchical Indexing Structure for Efficient Similarity Search in Video Retrieval [J]. IEEE Transactions on Knowledge and Data Engineering, 18(11): 1544-1559.

    66. Yuan Junsong, Tian Qi, Ranganath Surendra. 2004. Fast and robust search method for short video clips from large video collection. In: Proceedings of the 17th International Conference on Pattern Recognition (ICPR'04).

    67. Lienhart R., et al. 1997. On the detection and recognition of television commercials [C]. In: Proceedings of IEEE Conference on Multimedia Computing and Systems. 509-516.

    68. Juan M. S. et al. 2002. Shot partitioning based recognition of TV commercials [J]. IEEE Transactions on Multimedia Tools and Applications, 18: 233-247.

    69. Anil K. Jain, Aditya Vailaya Xiong Wei. 1999. Query by video clip [J]. Multimedia system, 7(5):369-384.
    70.刘阳,许松涛,吴志美等.2003.一种分级检索MPEG视频的方法[J].软件学报,14(3):675-681.
    71.Kim Y.T.,Chua T.S..2005.Retrieval of news video using video sequence matching[C].In:Proceedings of the11th International Multimedia Modelling Conference(MMM'05):68-75.
    72.Li Y,Sun J,Tang C -K,et al.2004.Lazy Snapping[J].In ACM Proceedings of SIGGRAPH '04,also ACM Transactions on Graphics,22(3):303-308.
    73.Rother C,Kolmogorov V,Minka T,et al.2006.Cosegmentation of Image Pairs by Histogram Matching—Incorporating a Global Constraint into MRFs[C]//IEEE Conference on Computer Vision and Pattern Recognition,1:993-1000.
    74.Lowe D G.2004.Distinctive Image Features from Scale-Invariant Keypoints[J].International Journal of Computer Vision,60(2):91-110,code available at http://www.cs.ubc.ca/～lowe/keypoints.
    75.彭宇新,NgoChong-wah,董庆杰等.2003.一种通过视频片段进行视频检索的方法[J].软件学报,14(8):1409-1417.
    76.Rui Y.,Hunag T.S.,Mehrotra S..1999.Constructing Table-of-Content for Videos[J].ACM Journal of Multimedia Systems,7(5).
    77.刘桂清.2004.视频摘要技术的研究与实现(D):[博士].长沙:国防科学技术大学.
    78.朱曦,林行刚.2004.视频镜头时域分割方法的研究[J].计算机学报,27(8):1027-1034.
    79.Choubey S.K.,Raghavan V.V..1997.Generic and fully automatic content-based image retrieval using color[J].Pattern Recognition Letters,1(8):1233-1240.
    80.Srinivasan M.V.,Venkatesh S.,Hosie R.1997.Qualitative estimation of camera motion parameters from video sequence[J].Pattern Recognition 1997,30(4):593-606.
    81.Fernando W.A.C.,Canagarajah C.N.,Bull D.R..1999.Video segmentation of classification for content based storage and retrieval using motion vectors[J].SPIE,3656:687-698.
    82.Zhang H J.1995.Video Parsing,retrieval and browsing:An integrated and content based solution[C].In:Proceedings of ACM Multimedia'95,SanFranciseo,CA.15-24.
    83.Zabin R,Miller J and Mai K.1995.Feature-based algorithms for detecting and Classifying scene breaks[C].In:Proceedings of 4ICM.97-103.
    84.Zhang HJ,Karkanhalli,Smoliar S.1993.Automatic partitioning of video[J].Multimedia Systems,1(1):10-28.
    85.Zhang HJ,Wu JH,Zhong D,et al.1997.An integrated system for content-based video retrieval and browsing[J].Pattern Recognition,30(4):643-657.
    86.Meng JH,Juan YJ,ChangSF.1995.Scene change detection in a MPEG compressed video sequence[C].In:Proceedings of IS&T/SPIE,Conference on Multimedia Computing and Networking.San Jose,CA,2417:180-191.
    87.Truong B.T.,Dorai C.,Venkatesh S.2000.Improved fade and dissolve detection for reliable video segmentation[C].In:Proceedings of IEEE International Conference on Image Processing(ICIP2000),Vancouver,BC,Canada,3:961-964.
    88.曹莉华.1998.视频媒体的基于内容处理和检索的研究与实现(D):[博士].长沙:国防科学技术大学七系.
    89.Ngo CW,Pong TC,Chin RT.2001.Video Partitioning by temporal slice coherency[J].IEEE Transactions on Circuits and Systems for Video Technology,11(8):941-953.
    90.John R,Kender and Boon Lock Yeo.1998.Video Scene Segmentation Via Continuous Coherence[C].1998 Conference On Computer Vision And Pattern Recognition,Santa Babara.
    91.Lin Tong and Zhang Hong Jiang,2000.Automatic Scene Extraction By Shot Grouping[C].15~(th)International Conference On Pattern Recognition,Spain.2-8.
    92.Pferffer Silvia,Lienhart Rainer and Effelsberg Wolfgang.1998.Scene Determination Based On Video And Zudio Features[C].Proeeeding Of The IEEE Conference On Multimedia Computing And System,1554:53-58.
    93.Wang Jihua,Chua Tat Seng and Chen Liping.2001.Cinematic Based Model For Scene Boundary Detection[C].Proc.of Multimedia Modeling Conference,Amstersam,Netherlands.
    94.Swanberg D,Shu C F,Jain R.1993.Knowledge guided Parsing in video databases[C].In:Proceedings of SPIE Storage and Retrieval for Image and Video Databases(1908),San Jose,CA,USA.13-21.
    95.Boykin S,Merlino A.2000.Machine learning of event segmentation for news on demand[J].Communications of the ACM,43(2):35-41.
    96.徐骏,周晓峥,于俊清等.2003.基于事件流的新闻视频场景分割方法[J].计算机辅助设计与图形学学报,15(2):228-232.
    97.Lu Hong,Tan Ya-peng.2003.An unsupervised approach to dominant video scene clustering [C].In:Proeeedings of IEEE International Symposium on Circuits and Systems(ISCAS'03),Bangkok,Thailand.680-683.
    98.程文刚,须德,郎从妍.2004.一种有效的视频场景检测方法.中国图像图形学报,9(8):984-990.
    99.Yeung M M,Yeo B L,Liu B.1998.Segmentation of video by clustering and graph analysis [J].ComPuter Vision and Image Understanding,71(1):94-109.
    100.Ngo Chong-Wah,Ma Yu-Fei,Zhang Hong-Jiang.2005.Video Summarization and Scene Detection by Graph Modeling[J].IEEE Transactions on Circuits and Systems for Video Technology.15(2):296-305.
    101.王东辉,朱森良,吴春明.2001.基于时序结构图的视频流描述方法[J].计算机学报.24(9):944-950.
    102.Zhu X Q,Fan J P,KE Ahmed.2003.Hierachical video content description and summarization using unified semantic and visual similarity[J].Multimedia Systems.9:31-53.
    103.Boreczky JS,Wilcox LD.1997.A hidden Markov model framework for video segmentation using audio and image features[C].In:Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing,Seattle,WA,USA.3741-3744.
    104.Iyengar G,Lippman A.1998.Models for automatic classification of video sequences[C].In:Proceedings of SPIE Storage and Retrieval for Image and Video Databases,SanJose,CA,USA.3312-3334.
    105.朱映映,周洞汝.2004.一种基于视频聚类的关键帧提取方法[J].计算机工程.30(4):12-13.
    106.Wolf W.1996.Key frame selection by motion analysis[C].In:Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing.
    107.Ngo C.W.,Pong T.C.,Zhang H.J.,et al.2000.Motion-based video representation for scene change detection[C].In:Proceedings of the ICPR 2000.Barcelona,Spain.
    108.Zhuang Y T,Rui Y,HuangTS,et al.1998.Adaptive key frame extraction using unsupervised clustering[C].In:Proceedings of IEEE International Conference on Image Processing.
    109.Gresle P O,Huang T S.1997.Gisting of video documents:A key frames selection algorithm using relative activity measure.[C]In:The 2~(nd)International Conference on Visual Information Systems.
    110.陈剑赟,老松扬,吴玲达.2003.视频摘要[J].中国图像图形学报.07.
    111.Lienhart R,Pfeiffer S,Effelsberg W.1997.Video abstracting[J].Communications of ACM [J].40(12):55-62.
    112.Li Ying,Zhang Tong,Daniel Tretter.2001.An overview of video abstraction techniques [R/OL].In:HP Corp Technology Report,HPL-2001-191,20010809,External.http://www.hplhp.Com/techreports/2001/HPL-2001-191.html.
    113.Lienhart R.2000.Dynamic video summarization of home video[A].In:Proceedings of SPIE Storage and Retrieval for Media Database 2000[C],San Jose,CA,USA.3972:378-389.
    114.Russell D M.2000.A design pattern-based video summarization technique:moving from low-level signals to high-level structure[A].In:Proceedings of the 33rd Hawaii International Conference on System Sciences[C],Maui,Hawaii,USA.3:3048.
    115.Smith J.R.,and Chang S.F..1995.Tools and Techniques for Color Image Retrieval[C].Proceedings of SPIE Storage and Retrieval for Image and Video Data.
    116.Pass G.,Zabih R.and Miller J..1996.Comparing Images Using Color Coherence Vectors [C].Proceedings of ACM International Conference on Multimedia,Boston,MA.65-73.
    117.Huang J.,Kumar S.,Mitra M.,Zhu W.J.and Zabin R..1997.Image Indexing Using Color Correlogram[C].Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition.
    118.Chang S.F.,Puri A.,Sikora T.and Zhang H.J..2001.Introduction to the Special Issue on MPEG-7[J]IEEE Transactions on Circuits and Systems for Video Technology.11(6).
    119.Martinez J.M..2002.Overview of the MPEG-7 Standard(v8.0).ISO/IECJTC1/SC29/WG11,N4980.
    120.Manjunath B.S.,Ohm J.R.,Vasudevan V.V.and Yamada A..2001.Color and Texture Descriptors[J].IEEE Transactions On Circuits and Systems for Video Technology.11(6):703-715.
    121.Smith J.R.and Chang S.F..1996.Automated Binary Texture Feature Sets for Image Retrieval[C].Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing.
    122.Ma W.Y.and Manjunath B.S..1995.A Comparison of Wavelet Transform Features for Texture Image Annotation[C].Proceedings of IEEE International Conference on Image Processing.
    123.Canny J.1986.A computational approach to edge detection[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,18(8):679-698.
    124.Pentland A.,Picard R.W.and Sclaroff S..1996.Photobook:Content-based Manipulation of Image Databases[J].International Journal of Computer Vision.
    125.Arkin E.M.,Chew L.,Huttenlocher D.,Kedem K.and Mitchell J..1991.An Effciently Computable Metric for Comparing Polygonal Shpaes[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,13(3).
    126.Chuang G.C.H.and Kuo C.C.J..1996.Wavelet Descriptor of Planar Curves:Theory and Applications[J].IEEE Transactions on Image Processing,5(1):56-70.
    127.于跃龙.2005.视频语义信息提取关键技术研究(D):[博士].长沙:国防科学技术大学.
    128.Marichal X.and Villegas P..2000.Objective evaluation of segmentation masks in video sequences[C].In Proceedings of X European Signal Processing Conference(EUSIPCO),(Tampere,Finlnad).2193-2196.
    129.Gu C.and Lee M.C..1998.Semiautomatic segmentation and tracking of semantic video objects[J].IEEE Transactions on Circuits and Systems for Video Technology.8(5):572-584.
    130.Castagno R.,Ebrahimi T.and Kunt M..1998.Video segmentation based on multiple features for interactive multimedia applications[J].IEEE Transactions on Circuits and System for VideoTechnology.8:562-571.
    131.Chalom E.and Bove V..1996.Segmentation of an image sequence using multi-dimensional image attributes[C].In Proceedings of International Conference on Image Processing.525-528.
    132.Marcotegui B.,Zanoguera F.,Correia P.,Rosa R.,Marques F.,Mech R.and Wollborn M..1999.A video object generation tool allowing friendly user interaction[C].In Proceedings of International Conference on Image Processing.391-395.
    133.Gu L.and Bone D..1999.Skin colour region detection in MPEG video sequences[C].In Proc.of 10~(th)International Conference on Image Analysis and Processing,(Venice,Italy).898-903.
    134.Mech R,Wollborn M.1998.Automatic segmentation of moving objects(partial results of core experiment N2) .ISO/ IEC JTC1/SC29/WG11 MPEG98. 31-87.

    135. Mech R, Wollborn M. 1998. A noise robust method of 2D shape estimation of moving objects in video sequences considering a moving camera [J]. Signal Processing. 66(2): 203-217.

    136. Chang M M, Sezan M I, Tekalp A M. 1994. An algorithm for simultaneous motion estimation and scene segmentation [C]. Proc of the IEEE Int'l Conf on Acoust, Speech, Signal, Processing, Adelaide. 221-224.

    137. Stiller C. 1993. A statistical image model for motion estimation [C]. In: Proc of the IEEE Int'l Conf on Acoust, Speech, Signal Processing, Minneapolis, MN. 193-196.

    138. Choi JG, Lee S W, Kim S D. 1997. Spatio-temporal video segmentation using a joint similarity measure [J]. IEEE Trans on Circuits Systems Video Technology, 7(2): 279-286.

    139. Adiv G. 1985. Determining three-dimensional motion and structure form optical flow generated by several moving objects [J]. IEEE Trans on Pattern Analysis Machine Intelligence, 7(4): 384-401.

    140. Murray D W, Buxton B F. 1987. Scene segmentation form visual motion using global optimization [J]. IEEE Trans on Pattern Analysis and Machine Intelligence. 9(2): 220-228.

    141. Hotter M, Thoma R. 1988. Image segmentation based on object oriented mapping parameter estimation [J]. Signal Processing, 15(3): 315-334.

    142. Musmann H G, Hotter M, Ostermann J. 1989. Object-oriented analysis-synthesis coding of moving images[J]. Signal Processing, Image Commun, 1(2): 117-138.

    143. Diehl N. 1991. Object-oriented motion estimation and segmentation in image sequences [J]. Signal Processing, Image Communication, 3(1): 23-56.

    144. Kim M, Choi JG, Lee M H et al. 1997. Performance analysis of an ETRI's global motion compensation and scene cut detection algorithms for automatic segmentation [C]. ISO/IEC JTC1/SC29/WG11 MPEG97. 23-87.

    145. Daniilidis K., Krauss C, Hansen M. and Sommer G. 1998. Real time tracking of moving objects with an active camera [J]. J.Real-Time Imaging 1998,4: 3-20.

    146. Khan S. and Shah M.. 2000. Tracking people in presence of occlusion [C]. In Porc. Asian Conf. on Computer Vision. 1132-1137.

    147. Dockstader S. and Tekalp A.. 2001. On the tracking of articulated and occluded video object motion [J]. J.Real-Time Imaging, 7: 415-432.
    148.Wang D..1998.Unsupervised video segmentation based on watersheds and temporal tracking[J].IEEE Transactions on Circuits and Systems for Video Technology,8(5):539-546.
    149.Paragios N.and Deriche R..1999.Geodesic active regions for motion estimation and tracking[C].In Proceedings of 7~(th)International Conference on Computer Vision(ICCV).688-694.
    150.Peterfreund N..1998.Robust tracking of position and velocity with Kalman snakes[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,21(6):564-569.
    151.Meier T.and Ngan K..1998.Automatic segmentation of moving objects for video object Plane generation[J].IEEE Transactions on Circuits and Systems for Video Technology,8(5):525-538.
    152.Beymer D.,McLauchlan P.,Coifman B.and Malik J..1997.A real-time computer vision system for measuring traffic parameters,in Proceedings of Computer Vision and Pattern Recognition(CVPR).495-501.
    153.Gil S.,Milanese R.and Pun T..1994.Feature selection for object tracking in traffic scenes [C].In Proc.SPIE Int.Symposium on Smart Highways,23(44):253-266.
    154.章毓晋.2001.图像分割[M].科学出版社.北京.
    155.林开颜,吴军辉,徐立鸿.2005.彩色图像分割方法综述[J].中国图像图形学报.(01).
    156.Power W,Clist R.1996.Comparison of supervised learning techniques applied to color segmentation of fruit image[A].In:Proceeding of SPIE,Intelligent Roberts and Computer Vision.XV:Algorithms,Techniques,Active Vision,and Material Handing[C].Boston,MA,USA.370-381.
    157.Hance G A,Umbaugh S E,Moss R H,et al.1996.Unsupervised color image segmentation with application to skin tumor borders[J].IEEE Engineering in Medicine and Biology,15(1):104-111.
    158.Pal N R,Pal S K.1993.A review on image segmentation techniques[J].Pattern Recognition,26(9):1277-1294.
    159.Ohlander R,Price K,Reddy D R.1978.Picture segmentation using a recursive region splitting method[J].Computer Graphics and Image Processing,8(3):313-333.
    160.Guo G D,Yu S,Ma S D.1998.Unsupervised segmentation of color images[A].In:Proceeding of 1998 IEEE International Conference on Image Processing[C].Chicago,IL, USA. 299-302.

    161. Underwood SA, Aggarwal J K. 1977. Interactive computer analysis of aerial color infrared photographs [J]. Computer Graphics and Image Processing, 6(1): 1-24.

    162. Kurugollu F, Sankur B, Harmanci A E. 2001. Color image segmentation using histogram multithresholding and fusion [J]. Image and Vision Computing, 19(13): 915-928.

    163. CelenkM. 1990. A color clustering technique for image segmentation [J]. Computer Vision, Graphics, and Image Processing, 52(2): 145-170.

    164. Carevic D, Caelli T. 1997. Region-based coding of color image using Karhunen-Loeve Transform [J]. Graphics Models and Image Processing, 59 (1): 27-38.

    165. Bezdek J C. 1981.Pattern recognition with fuzzy objective function algorithms [M]. New York: Plenum Press

    166. Xie X L, Beni G. 1991. A validitymeasure for fuzzy clustering [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13 (8): 841-847.

    167. Lim Y W, Lee S U. 1990. On the color image segmentation algorithm based on the thresholding and the fuzzy c-means techniques [J]. Pattern Recognition, 23(9): 935-952.

    168. Liu J Z, Xie W X. 1993. An efficient pyramidal color image segmentation method with fuzzy clustering [J]. Journal of Xidian University, 20 (1): 40-46.

    169. Lin K Y, Xu L H, Wu J H. 2004. A fast fuzzy C-means clustering for color image segmentation [J]. Journal of Image and Graphics, 9(2): 159-163.

    170. Chen T Q, Lu Y. 2002. Color image segmentation--An innovative approach [J]. Pattern Recognition, 35 (2): 395-405.

    171. Michael T U, Arbib A. 1994. Color image segmentation using competitive learning [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(12): 1197-1206.

    172. Tremeau A, Borel N. 1997. A region growing and merging algorithm to color segmentation [J]. Pattern Recognition, 30 (7): 1191-1203.

    173. Cheng H D, Sun Y. 2000. A hierarchical approach to color image segmentation using homogeneity [J]. IEEE Transactions on Image Processing, 9(12): 2071-2082.

    174. Vincent L, Soille P. 1991. Watersheds in digital spaces: an efficient algorithm based on immersion simulations [J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 13 (6): 583-598.

    175. Shafarenko L, PetrouM, Kittler J. 1997. Automatic watershed segmentation of randomly textured color images [J]. IEEE Transactions on Image Processing, 6(11): 1530-1544.

    176. Shiji A, Hamada N. 1999. Color image segmentation method using watershed algorithm and contour information [A]. In: Proceeding of 1999 IEEE International Conference on Image Processing[C]. Kobe, Japan. 305-309.

    177. Geman S, Geman D. 1984. Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6 (11): 721-741.

    178. Huang C L, Cheng T Y, Chen C C. 1992. Color images segmentation using scale space filter and Markov random field [J]. Pattern Recognition, 25 (10): 1217-1229.

    179. Liu J Q, Yang Y H. 1994. Multiresolution color image segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(7): 689-700.

    180. Nevatia. 1977. A color edge detector and its use in scene segmentation [J]. IEEE Transactions on Systems, Man and Cybernetics, 7(11): 820-826.

    181. Hueckel M H. 1973. A local visual operator which recognizes edges and lines [J]. Journal of Association for Computing Machinery, 20(4): 634-647.

    182. Trahanias P E, Venetsanopoulos A N. 1993. Color edge detection using vector order statistics [J]. IEEE Transactions on Image Processing, 2(2): 259-265.

    183. Tsai P, Chang C C, Hu Y C. 2002. An adaptive two-stage edge detection scheme for digital color images [J]. Real-Time Imageing, 8(4): 329-343.

    184. Carron T, Lambert P. 1995. Fuzzy color edge extraction by inference rules quantitative study and evaluation of performances [A]. In: Proceedings of the 1995 International Conference on Image Processing [C]. Washington DC, USA. 2:181-184.

    185. Pal N R, Pal S K. 1993. A review on image segmentation techniques [J]. Pattern Recognition, 26(9): 1277-1294.

    186. Sugeno M. 1977. Fuzzy measures and fuzzy integrals--a survey [A]. In: Fuzzy Automata and Decision Processes [M]. New York: North-Holland. 89-102.

    187. Pham TD, Yan H. 1999. Color image segmentation using fuzzy integral and mountain clustering [J]. Fuzzy Sets and Systems, 107(2): 121-130.

    188. Geng B Y, Lu J F, Yang J Y. 2000. An approach to color image segmentation based on fuzzy domain color and road detection [J]. Journal of Nanjing University of Science and Technology, 24(4): 353-358.
    189. Chien B C, Cheng M C. 2002. A color image segmentation approach based on fuzzy similaritymeasure [A], In: Proceedings of the 2002 IEEE International Conference on Fuzzy Systems [C]. Honolulu, H I, USA. 1: 449-454.

    190. Wang W J. 1997. New similarity measures on fuzzy sets and on elements [J]. Fuzzy Sets and Systems, 85(3): 305-309.

    191. Ito N, Kamekura R, Shimazu Y. 1996. The combination of edge detection and region extraction in nonparametric color image segmentation [J]. Information Science, 92 (2): 277-294.

    192. Huang C L. 1993. Parallel image segmentation using modified Hopfield model [J]. Pattern Recognition Letters, 13(5): 345-353.

    193. Campadelli P, Medici D, Schettini R. 1997. Color image segmentation using Hopfield networks [J]. Image and Vision Computing, 15(3): 161-166.

    194. Shafer S A. 1985. Using color to separate reflection components [J]. Color Research Application, 10(4): 210-218.

    195. D. Greig, B. Porteous, and A. Seheult. 1989. Exact maximum a posteriori estimation for binary images [J]. Journal of the Royal Statistical Society, Series B, 51 (2):271-279.

    196. Boykov Y, Kolmogorov V. 2004. An experimental comparison of mincut /max-flow algorithms for energy minimization in vision [J], IEEE Transaction on Pattern Analysis and Machine Intelligence, 26(9):1124-1137. Code available at http://www.cs.cornell.edu/People/vnk/

    197. Trust-Region Methods for Nonlinear Minimization, http://www.mathworks.com/access/helpdesk/help/toolbox/optim/index.html?/access/helpdesk/help/toolbox/optim/ug/f3137.html&http://www.google.cn/search?complete-l&hl=zh-CN &newwindow=1&rlz=1T4XNLA_zh-CNCN243CN244&q=trust+region&meta=&aq=f

    198. More, J.J. and Sorensen D.C.. 1983. Computing a Trust Region Step [J]. SIAM Journal on Scientific and Statistical Computing. 3: 553-572.

    199. Byrd, R.H., Schnabel R.B. and Shultz G.A.. 1988. Approximate Solution of the Trust Region Problem by Minimization over Two-Dimensional Subspaces [J]. Mathematical Programming. 40: 247-263.

    200. Steihaug T.. 1983. The Conjugate Gradient Method and Trust Regions in Large Scale Optimization [J]. SIAM Journal on Numerical Analysis. 20: 626-637.
    201. Coleman T.F. and Verma A.. A Preconditioned Conjugate Gradient Approach to Linear Equality Constrained Minimization, submitted to Computational Optimization and Applications.

    202. Sorensen D.C.. 1994. Minimization of a Large Scale Quadratic Function Subject to an Ellipsoidal Constraint. Department of Computational and Applied Mathematics, Rice University, Technical Report TR94-27.

    203. Rother C, Kolmogorov V. and Blake A.. 2004. "GrabCut"- Interactive Foreground Extraction using Iterated Graph Cuts [J]. In ACM Proceedings of SIGGRAPH '04, also ACM Transactions on Graphics, 23(3): 309-314.

    204. Wang Chao, Yang Qiong, Chen Mo, Tang Xiaoou and Ye Zhongfu. 2006. Progressive Cut [J]. ACM Multimedia. 251-260.

    205. Swain M, Ballard D. 1991. Color indexing [J]. International Journal of Computer Vision, 7(1):11-32.

    206. Mikolajczyk K. 2002. Detection of local features invariant to affine transformations, PhD Thesis, Institut National Polytechnique de Grenoble France.

    207. Harris C, Stephens M. 1988. A combined corner and edge detector [C]. In 4th Alvey Vision Conference, Manchester, UK, 147-151.

    208. M. Welling. 2005. Robust higher order statistics[C]. AISTATS.

常见问题　|　交通位置　|　联系我们　|　OA远程办公

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700