新闻视频故事单元跟踪关键技术研究

设为首页

收藏本站

网站地图 | English | 公务邮箱

读者指南

学术客户端

NSTL服务站

科技查新

新闻视频故事单元跟踪关键技术研究

详细信息本馆镜像全文| 推荐本文 | | 获取CNKI官网全文

英文题名：Research on Key Technologies for Tracking News Video Stories
作者：文军
论文级别：博士
学科专业名称：控制科学与工程
中文关键词：新闻视频 ; 故事单元分割 ; 故事单元关联分析 ; 故事单元跟踪
英文关键词：News Video ; Story Segmentaion ; Story Correlation Analysis ; Stories Tracking
学位年度：2008
导师：吴玲达
学科代码：081103
学位授予单位：国防科学技术大学
论文提交日期：2008-09-01

摘要

新闻报道是信息的重要载体,用户对新闻报道关注的重点是部分特定新闻事件,因此迫切需要能够自动实现基于新闻事件的新闻报道跟踪。目前主要在文本领域开展了新闻报道话题探测与跟踪研究,与文本媒体类型相比,新闻视频面临结构复杂,媒体模态多样等一系列问题,要在不同来源新闻视频中实现新闻事件各个报道内容的跟踪面临很多困难。
     根据新闻视频结构特点,可以把视频划分为帧、镜头、故事单元、视频四个层次。与新闻事件密切相关的层次是故事单元,因此在新闻视频数据库中研究识别和跟踪报道相同新闻事件故事单元的相关技术成为当前新闻视频研究领域的前沿课题。本文对这个具有重要理论意义和广阔应用前景的课题进行了探索和研究,旨在解决新闻视频故事单元跟踪研究中的部分关键技术,为新闻视频数据库基于新闻事件的信息分析和利用提供可行的解决途径。
     本文首先建立一个新闻视频故事单元跟踪研究的框架,在此基础上重点研究了故事单元分割、故事单元关联分析、故事单元线程化跟踪等关键技术,通过实验验证了研究的可行性和算法效率。论文的主要贡献体现在以下几个方面:
     1、提出了新闻视频故事单元跟踪研究的技术框架。首先对研究中涉及的概念和关键术语进行了阐述,然后研究了新闻视频文件和故事单元描述模型,提出了新闻视频数据库的“故事单元空间”表示方式,为开展故事单元跟踪研究提供了理论基础。在此基础上提出了新闻视频故事单元跟踪研究的技术框架,探讨了研究实现的技术途径和部分关键技术,明确了研究的主要任务。
     2、提出和改进了新闻视频故事单元分割方法。通过对新闻视频故事单元编辑模式的分析,提出了一种有效的视频、音频特征候选分割点选择策略,其中突出研究了自适应的播音员镜头探测方法;同时,研究了不同的集合运算方法来融合分析不同类型的视频特征候选分割点与音频特征候选分割点,对不同来源的新闻视频都可以有效实现故事单元分割。
     3、提出了新闻视频故事单元关联分析方法。分析了相似关键帧与故事单元关联分析的内在联系及各种领域知识;研究了关联分析子数据库构建策略和局部关键点精减策略,在本质上提高了关键帧匹配分析速度;提出了一种利用局部关键点匹配技术的层次化过滤方法快速有效的识别相似关键帧;提出了基于相似关键帧和关联关系传递性的故事单元关联分析技术。
     4、提出了新闻视频故事单元“多线程”跟踪方法。为体现新闻事件报道的“多线程”属性,首先提出了一种融合各个语义层次、各种模态信息的故事单元相似度计算方法,方法结合新闻视频和故事单元的描述模型,重点研究了底层视觉特征中的局部特征相似度计算方法、中层语义概念中基于关键帧场景信息的相似度计算方法、高层语义的文本相似度计算方法以及相似度融合方法;在此基础上,研究了图论知识对于故事单元跟踪研究的有效性,提出了利用有向图理论对故事单元之间的相似关系进行“多线程”跟踪的方法。
     5、设计和实现了一个新闻视频故事单元跟踪系统。详细描述了NStoryThread系统的设计思路和各功能模块,并介绍了原型系统的实现,为研究的应用提供了基础。
     综上所述,本文的主要研究集中在新闻视频故事单元跟踪系统方法的关键技术上,如:新闻视频故事单元分割、故事单元关联分析和故事单元跟踪等,并对各关键技术进行了实验验证。这些研究不仅对新闻视频的分析和挖掘技术具有积极的影响,同时也对多媒体情报分析技术具有显著的理论和实践意义。
The news report is an important information carrier. Users pay attention to the news based on some specific events. Therefore a kind of intelligent services, which can automatically analyzing and tracking the news are in urgent need. The research of event-based news tracking is developed in text. Compared with the report based on text media, news video is faced with some problems, such as complexity of structure and multiplicity of media modals. Tracking reports in news video across different event-based sources is a challenging work.
     News video can be represented by a hierarchical structure consisting of 4 levels: frame, shot, story and video, in which story is the unit relating news events. Based on news video database, the research on the technique of identifying and tracking the stories which report the same news event is becoming the frontier topic in news video research field. Therefore, this thesis explores the topic on event-based news video story tracking technology, which is a research issue with great significance in theory and wide perspective in application. The goal of this thesis is to find a possible way to solve the problems of analyzing and utilizing information in news video database based on event by probing into the key technologies of tracking news stories.
     Firstly, the architecture of tracking news stories is proposed in this thesis. Secondly, the related key techniques of story segmenting, story correlation analyzing and story tracking are discussed. Feasibility and effect of these techniques are validated by experiments. The original contributions of this thesis can be described as follows:
     1. A frame of event-based news video story tracking is proposed. Firstly, relevant concepts and terms are defined. Then, describing modals of news video and story are investigated. And, this thesis proposes a pattern named“story space”for describing news video database. These works provide theoretical basis for tracking news stories. On this basis, this thesis proposes technology frame of tracking news stories, which discusses the approaches and key techniques to realize event-based news story tracking, and points out the problems this thesis concentrates on.
     2. A news video story segmentation method is proposed and improved. In view of characters of news video’s edition, this thesis presents a novel strategy for selecting video and audio candidate points as segmentation boundaries, in which an adaptive method to detect anchorperson shot is studied prominently. Different set operating methods are developed to fuse diverse modal candidate points and get story boundaries efficiently.
     3. News video story correlation analyzing method is proposed. This thesis investigates internal relations and domain knowledge between near duplicate keyframes and correlation analysis. To increase matching speed of keyframes essentially, approach for constructing sub-database and pruning local keypoints is studied. Then a hierarchical approach for identifying near duplicate keyframes based on matching local keypoints is proposed. Finally, this thesis presents a method to identify correlations of related stories based on near duplicate keyframes and transitivity of correlations.
     4. A method for tracking news video stories with“multithreading”is proposed. In order to incarnate“multithreading”of news event reporting, a method for calculating similarity of news stories is presented in detail. It fuses information of all semantic levels and all media modals, in which methods to calculate similarity of local feature in lower visual feature, similarity of keyframe scene class in middle caption, similarity of text in high-level semantic and fusion strategy of these similarities are researched prominently.Then, this thesis studies validity of graph theory for tracking news stories and proposes an approach for tracking the similarity of news stories with“multithreading”based on digraph.
     5. A system for tracking news stories is designed and implemented. The design idea and each functional module of system NStoryThread are described in detail, and the implementation of prototype system is also presented, which provides a support to the applications of the frame and relevant methods.
     As a general, the thesis focuses on the key techniques of tracking news video stories, such as story segmentation, story correlation analysis, story tracking and so on, and each method is validated by experiments. The achievements of this thesis promote the development of news video analyzing and data mining, and also have great theoretic and realistic significance in multimedia information analysis.

引文

[1] James Allan,Jaime Carboneel,George Doddington,et al.Topic detection and tracking pilot study:Final report.Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop,San Francisco,Morgan Kaufmann Publishers,1998:194~218
    [2] J.Allan.Topic detection and tracking:event-based information retrieval.Norvell, Massachusetts,USA,KIuwer Academic Publishers,2002
    [3]贾自艳,何清,张海俊等.一种基于动态进化模型的事件探测和追踪算法.计算机研究与发展,2004,41(17) :1273~1280
    [4]李保利,俞士汶.话题识别与跟踪研究.计算机工程与应用,2003(17) :7~10
    [5]于满泉,骆卫华,许洪波等.话题识别与跟踪中的层次化话题识别技术研究.计算机研究与发展,2006,43(3) :485~495
    [6]雷震,吴玲达,刘宇弛等.基于事件的新闻报道分析技术研究进展.计算机应用研究,2007,24(5) :13~16
    [7]雷震.基于事件的新闻报道分析技术研究.博士学位论文,长沙,国防科技大学研究生院,2006
    [8] Tat-Seng Chua,Shih-Fu Chang,Lekha Chaisorn,Winston Hsu.Story boundary detection in large broadcast news video archives– techniques, txperience and trends.Proceedings of the 12th annual ACM international conference on Multimedia(ACM MM2004),October 10~15,2004,New York,USA:656~659
    [9]谢毓湘.辅助情报分析的视频挖掘技术研究.博士学位论文,长沙,国防科技大学研究生院,2004
    [10]冀中,张春田,苏育挺.新闻视频故事单元分割技术综述.中国图像图形学报,2007,12(11) :1952~1960
    [11] Ichiro Ide,Hiroshi Mo,Norio Katayama,et al.Threading news video topics.Proceedings of the 5th ACM SIGMM international workshop on Multimedia Information Retrieval (MIR2003),November 7,2003,California,USA:239~246
    [12] Ichiro Ide,Hiroshi Mo,Norio Katayama,et al.Topic threading for structuring a large-scale news video archive.Image and Video Retrieval: Third International Conference (CIVR2004),July 21~23,2004,Dublin,Ireland:123~131
    [13] Norio Katayama,Hiroshi Mo,Ichiro Ide,et al.Mining large-scale broadcast video archives towards inter-video structuring.Pacific Rim Conf. on Multimedia(PCM2004),November 30~December 3,2004,Tokyo,Japan:489~496
    [14] Ichiro IDE,Hiroshi MO,Norio KATAYAMA,et al.Exploiting topic thread structures in a news video archive for the semi-automatic generation of video summaries.2006 IEEE International Conference on Multimedia and Expo (ICME2006),July 9~12,2006,Toronto,Canada:1473~1476
    [15] http://www.informedia.cs.cmu.edu/
    [16] http://www-nlpir.nist.gov/projects/trecvid/
    [17] A.Hauptmann,R.V.Baron,M Y.Chen.Informedia at TRECVID 2003 analyzing and searching broadcast news video.In Proceedings of TRECVID2003,November,2003,Gaithersburg,USA
    [18] A.G.Hauptmann,R.Yan,R.Jin,et al.Video Classification and Retrieval with the Informedia Digital Video Library System.In Proceedings of TRECVID 2002, November,2002,Gaithersburg,USA
    [19] Michael G. Christel and Alexander G. Hauptmann.The use and utility of high-level semantic features in video retrieval.International Conference on Image and Video Retrieval (CIVR2005),July 20~22,2005,Singapore:134~144
    [20] Alexander G. Hauptmann.Lessons for the future from a decade of Informedia video analysis research . International Conference on Image and Video Retrieval(CIVR2005), July 20~22,2005,Singapore:1~10
    [21] http://www-nlpir.nist.gov/projects/trecvid/revised.html
    [22] http://www-nlpir.nist.gov/projects/t2002v/t2002v.html
    [23] http://www-nlpir.nist.gov/projects/tv2003/tv2003.html
    [24] http://www-nlpir.nist.gov/projects/tv2003/tv2004.html
    [25] http://www-nlpir.nist.gov/projects/tv2003/tv2007.html
    [26] http://www-nlpir.nist.gov/projects/tv2006/tv2006.html
    [27] http://www-nlpir.nist.gov/projects/tv2007/tv2007.html
    [28] http://www.cdvp.dcu.ie/aboutfishclar.html
    [29] Zhang H J,Gong Y,Smoliar S W,et al.Automatic parsing of news video.Proceeding of the International Conference on Multimedia Computing and Systems,May 15~19,1994,Boston,USA:45~54
    [30] Merlino A,Morey D,Maybury M.Broadcast news navigation using story segmentation.Proceedings of the Fifth ACM International Conference on Multimedia (ACM MM1997),November 9~13,1997,Bedford,USA:381~391
    [31] L Chaisorn,T-S Chua,C-K Koh,et al.A two-level multi-modal approach for story segmentation of large news video corpus.Proceedings of TRECVIDworkshop 2003,November,2003,Gaithersburg,USA
    [32] Sugano,K Hoashi,K Mutsumato,F Sugaya,et al.Shot boundary determination on MPEG compressed domain and story segmentation experiments for TRECVID 2003.Proceedings of TRECVID workshop 2003,November,2003,Gaithersburg,USA
    [33] P.Rennert.StreameSage unsupervised ASR-based topic segmentation.Pro- -ceedings of TRECVID workshop 2003,November 2003,Gaithersburg,USA
    [34] M Franz,J S McCarley,S Roukos,et al.Segmentation and detection at IBM: Hybrid statistical models and two-tiered clustering broadcast news domain.Proceedings of TDT-3 Workshop,February,2000
    [35] Hsu W,S.F Chang,C.W Huang,et al.Discovery and fusion of salient multi-modal features towards news story segmentation.IS&T/SPIE Symposium on Electronic Imaging: Science and Technology - SPIE Storage and Retrieval of Image/Video Database,January 18~22,2004,San Jose,USA:244~258
    [36] Liu Z,Huang J C,Wang Y.Classification of TV Programs based on Audio Information using Hidden Markov Model.Proceedings of IEEE Workshop on Multimedia Signal Processing (MMSP1998),December7~9,1998,LosAngeles,USA:27~32
    [37] Lu L,Zhang H J,Li S Z.Content based audio classification and segmentation by using support vector machines.Multimedia Systems,2003,8(6) :482~492
    [38]马宇飞,白雪生,徐光佑,史元春.新闻视频中口播帧检测方法的研究.软件学报,2001,12 (3) :377~382
    [39]于俊清,汤旸,周向东.利用主色模板匹配检测新闻视频口播帧.计算机辅助设计与图形学学报,2005,17(3):558~562
    [40]杨娜,罗航哉,薛向阳.一种电视新闻节目的播音员检测算法.软件学报,2002,13(8):1559~1567
    [41] Gao X B,Li J,Yang B.A graph theoretical clustering based anchorperson shot detection for news video indexing.International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2003),September 27~30,2003,Xi An,China:108~113
    [42] A. G. Hauptmann and M. J. Witbrock.Story segmentation and detection of commercials in broadcast news video.Advances in Digital Libraries Conference (ADL1998),April 22~24,1998,Santa Barbara,USA:168~179
    [43] Wei Qi,Lie Gu,Hao Jiang,et al.Integrating visual, audio and text analysis fornews video.7th IEEE International Conference on Image Processing (ICIP 2000),September 10~13,2000,British Columbia,Canada
    [44]刘华咏.基于音视频特征和文字信息自动分段新闻故事.系统仿真学报,2004,16(11) :2608~2610
    [45]张春林,张鹏林,胡瑞敏.新闻视频中基于主持人识别的新闻故事探测.计算机工程,2003,29(14) :20~26
    [46] Hsu W,L. Kennedy,C-W Huang,et al.News video story segmentation using fusion of multi-level multi-modal features in trecvid 2003 . 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004),May 17-21,2004,Montreal,Canada:645~652
    [47] Chaisorn L,Chua T S.The segmentation and classification of story boundaries in news video.Proceedings of International Conference on Visual and Multimedia Information Management,May 29~31,2002,Brisbane,Australia:95~109
    [48] Chaisorn L,Chua T S,C.K Koh,et al.A two-level multi-modal approach for story segmentation of large news video corpus.Proceedings of TRECVID workshop 2003,November,2003,Gaithersburg,USA
    [49]王鹏,蔡锐,杨士强.‘文本为主’的多模态特征融合的新闻视频分类算法.清华大学学报(自然科学版),2005,45(4) :475~478
    [50] Yun Zhai,Alper Yilmaz and Mubarak Shah.Story segmentation in news videos using visual and text cues.International Conference on Image and Video Retrieval (CIVR 2005),July 20~22,2005,Singapore :92~102
    [51] Hsu W and Chang S F.Generative discriminative and ensemble learning on multi-modal perceptual fusion toward news video story segmentation.Proceedings of IEEE International Conference on Multimedia and Expo(ICME 2004),June 27~30,2004,Taipei,China:1091~1094
    [52] L. Xie et al.Discover meaningful multimedia patterns with audio-visual concepts and associated text.In IEEE Interational Conference on Image Processing(ICIP 2004),October 24~27, 2004,Singapore:2383~2386
    [53] J. R. Kender and M.R.Naphade.Visual concepts for news story tracking: analyzing and exploiting the NIST Trecvid video annotation experiment.IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR 2005),June 20~25,2005,San Diego,USA
    [54] Y. Zhai and M. Shah.Tracking news stories across different sources.Proceedings of the 13th annual ACM international conference on Multimedia (ACM MM2005),November 6~11,2005,Singapore:2~10
    [55] D Q. Zhang and S F. Chang.Detecting image near-duplicate by stochastic attributed relational graph matching with learning.Proceedings of the 12th annual ACM international conference on Multimedia(ACM MM2004),October 10~16,New york,USA:877~884
    [56] S. C. Cheung and A. Zakhor.Efficient video similarity measurement with video signature.IEEE Transactions on Circuits and Systems for Video Technology, 2003,13(1):59~74
    [57] A. K. Jain,A. Vailaya and W. Xiong.Query by video clip.ACM Multimedia Systems,1999,7(5):369~384
    [58] Y. Peng and C-W. Ngo.Clip-based similarity measure for query-dependent clip retrieval and video summarization.IEEE Transactions on Circuits and Systems for Video Technology,2006,16(5) :612~627
    [59] Junyu Zhou and Wallapak Tavanapong.ShotWeave: a shot clustering technique for story browsing for large video databases.In International Workshop on Multimedia Data Document Engineering,March 24~28,2002,Prague, Czech Republic:299~317
    [60] Jean Marc Odobez,Daniel Gatica Perez and Mael Guillemot.Video shot clustering using spectral methods . Third International Workshop on Content-Based Multimedia Indexing(CBMI 2003),September 22~24,2003,Rennes,France:94~102
    [61] Pinar Duygulu,Jia Yu Pan and David A.Forsyth.Towards auto-documentary: tracking the evolution of news stories.Proceedings of the 12th annual ACM international conference on Multimedia(ACM MM2004),October 10~16,New york,USA:820~827
    [62] D.G.Lowe.Distinctive image features from scale invariant keypoints. International Journal of Computer Vision,2004,60(2):90~110
    [63] Y. Ke and R. Sukthankar.PCA-SIFT: A more distinctive representation for local image descriptors.Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR 2004),June 27~July 2,2004,Washington DC,USA:506~513
    [64] S. F. Chang,W. Hsu,L. Kennedy,et al.Columbia university trecvid-2005 video search and high-level feature extraction.Proceedings of TRECVID workshop 2005,November,2005,Gaithersburg,USA
    [65] W. Hsu and S.F. Chang.Topic tracking across broadcast news videos with visualduplicates and semantic concepts.IEEE International Conference on Image Processing,October 8~11,2006,Atlanta,USA:141~144
    [66] Wan Lei Zhao,Chong Wah Ngo,Hung Khoon Tan,et al.Near-duplicate keyframe identification with interest point matching and pattern learning.IEEE Transaction on Multimedia,2007,9(5):1037~1048
    [67] W. Zhao,Y. G. Jiang and C. W. Ngo.Keyframe retrieval by keypoints: Can point-to-point matching help.International Conference on Image and Video Retrieval(CIVR 2006),July 13~15,2006,Tempe,USA:72~81
    [68] Y. Ke,R. Suthankar and L. Huston.Efficient Near-duplicate detection and sub-image retrieval . Proceedings of the 12th annual ACM international conference on Multimedia(ACM MM2004),October 10~16,New york,USA: 869~876
    [69] Kristen Grauman and Trevor Darrell.Effcient image matching with distributions of local invariant features.IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2005 (CVPR 2005),June 20~26,2005,San Diego,USA:627~634
    [70] Chris Dance,Jutta Willamowski,Lixin Fan,et al.Visual categorization with bags of keypoints.ECCV International Workshop on Statistical Learning in Computer Vision,May 11~14,2004,Prague:59~74
    [71] Xiao Wu,Wan Lei Zhao and Chong Wah Ngo.Near-duplicate keyframe retrieval with visual keywords and semantic context.Proceedings of the 6th ACM international conference on Image and video retrieval(CIVR 2007),July 9~11,2007,Amsterdam,The Netherlands:162~169
    [72] Xiao Wu,Wan Lei Zhao and Chong Wah Ngo.Efficient near-duplicate keyframe retrieval with visual language models.Proceedings of IEEE International Conference on Multimedia & Expo (ICME 2007),July 2~5,2007,Beijing,China:500~503
    [73] Xiao Wu,Alexander G. Hauptmann and Chong Wah Ngo.Practical elimination of near-duplicates from web video search.Proceedings of the 15th international conference on Multimedia(ACM MM2007),September 24~29,2007,Augsburg,Germany:218~227
    [74] Chong Wah Ng,et al.Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation.Proceedings of the 14th annual ACM international conference on Multimedia(ACM MM2006),October 23~27,2006,Santa Barbara,USA:845~854
    [75] Yu gang Jiang,Xiaoyong Wei,Chong Wah Ngo,et al.Modeling local interest points for semantic detection and video search at TRECVID 2006.Proceedings of TRECVID workshop 2006,November,2006,Gaithersburg,USA
    [76] Chong Wah Ngo,Zailiang Pan,Xiaoyong Wei,et al.Motion driven approaches to shot boundary detection: low-level feature extraction and BBC rush characterization at TRECVID 2005.Proceedings of TRECVID workshop 2005,November,2005,Gaithersburg,USA
    [77] Xiao Wu.Threading stories and generating topic structures in news videos across different sources.Proceedings of the 13th annual ACM international conference on Multimedia(ACM MM2005),November 6~11,2005,Singapore:1047~1048
    [78] Xiao Wu,Chong Wah Ngo and Qing Li.Co-clustering of time-evolving news story with transcript and keyframe . Proceedings of IEEE International Conference on Multimedia & Expo (ICME 2005),July 6~8,2005,Amsterdam, The Netherlands:117~120
    [79] Xiao Wu,Victor C. S. Lee and Joseph Kee Yin Ng.A preemptive scheduling algorithm for wireless real-time on-demand data broadcast.Proceedings of the 11th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA 2005),August 17~19,2005,Hongkong,China:17~22
    [80] Xiao Wu,Chong Wah Ngo and Alexander G. Hauptmann.Multi-modal news sory clustering with pairwise visual near-duplicate constraint.IEEE Transaction on Multimedia, 2008,10(2):188~199
    [81] Xiao Wu,Alexander G. Hauptmann and Chong Wah Ngo.Novelty and redundancy detection with multimodalities in cross-lingual broadcast domain.Computer Vision and Image Understanding,2008,110(3):418~431
    [82] Victor C. S. Lee,Xiao Wu and Joseph Kee Yin Ng.Scheduling real-time requests in on-demand data broadcast environments . Real-Time Systems Journal,2006,34(2):83~99
    [83] Xiao Wu,Alexander G. Hauptmann and Chong-Wah Ngo.Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts . Proceedings of the 15th international conference on Multimedia(ACM MM2007),September 24~29,2007,Augsburg,Germany:168~177
    [84] Yan Tao Zheng,Shi Yong Neo,et al.Fast near-duplicate keyframe detection inlarge-scale video corpus for video search.International Workshop on Advanced Image Technology 2007 (IWAIT 2007),January 8~9,Bangkok,Thailand
    [85] Yan Tao Zheng,Shi Yong Neo,et al.The Use of Temporal, Semantic and visual partitioning model for efficient near duplicate detection in large scale news corpus.International Conference on Image and Video Retrieval(CIVR2007),July 9~11,2007,Amsterdam,The Netherlands:409~416
    [86] R. Nallapati,A. Feng,F. Peng and J. Allan.Event threading within news topics.In Proceedings of the thirteenth ACM international conference on Information and Knowledge Management, November 8~13,2004,Washington,USA:446~453
    [87] X. Zhu,J. Fan,A.K. Elmagarmid and X. Wu.Hierarchical video content description and aummarization using unified semantic and visual similarity.Multimedia systems,2003,9(1):31~53
    [88] Ichiro Ide,Kazuhiro Noda,Akira Ogawa,et al.Semantic analysis of a large-scale news video archive . Proceedings of Asia-Pacific Workshop on Visual Information Processing (VIP 2006),November,2006,Beijing,China:166~171
    [89] Ichiro Ide,Tomoyoshi KINOSHITA,Tomokazu TAKAHASHI.MediaWalker: A video archive explorer based on time-series semantic structure.Proceedings of the 15th international conference on Multimedia(ACM MM2007),September 24~29,2007,Augsburg,Germany:162~163
    [90] Akira Ogawa,Tomokazu Takahashi,Ichiro Ide,et al.Cross-lingual retrieval of identical news events by near-duplicate video segment detection . 14th International Multimedia Modeling Conference (MMM 2008),January 9~11,Kyoto,Japan:287~296
    [91] Ichiro Ide,Kazuhiro Noda,Tomokazu Takahashi,et al.Genre-adaptive near-duplicate video segment detection.Proceedings of IEEE International Conference on Multimedia & Expo(ICME2007),July 2~5,2007,Beijing,China:484~487
    [92] Marcel Worring,Cees Snoek,Ork de Rooij,et al.Mediamill: advanced browsing in news video archives . International Conference on Image and Video Retrieval(CIVR 2006),July 13~15,2006,Tempe,USA:533~536
    [93] Ork de Rooij,Cees G. M. Snoek and Marcel Worring.Query on demand video browsing.Proceedings of the 15th international conference on Multimedia(ACM MM2007),September 24~29,2007,Augsburg,Germany:811~814
    [94] Jelena Tesic,Apostol Natsev,Joachim Seidl,et al.IBM multimodal interactive video threading . International Conference on Image and Video Retrieval(CIVR2007),July 9~11,2007,Amsterdam,The Netherlands:124~126
    [95] Xiao Wu,Chong-Wah Ngo and Qing Li.Threading and autodocumenting news videos: a promising solution to rapidly browse news topics.IEEE Signal Processing Magazine,2006,23(2):59~68
    [96] Jedrzej Z. Miadowicz,John M. Gauch and Abhishek Shivadas.Image based tracking of news stories.Seventh IEEE International Symposium on Multimedia (ISM 2005),December 12~14,2005,Irvine,USA:545~550
    [97] Jedrzej Z. Miadowicz.Story tracking in video news broadcasts.Ph.D. Thesis, University of Kansas,2004
    [98]庄越挺,潘云鹤,吴飞.网上多媒体信息检索.清华大学出版社,2002
    [99]谢毓湘,栾悉道,吴玲达等.一种基于解压的镜头探测方法.系统工程与电子技术,2003,25 (8):1028~1031
    [100] Gunsel.B and Ferman A. M.Video indexing through integration of syntactic and semantic features . 3rd IEEE Workshop on Application of Computer Vision(WACV 1996),December 1996,Sarasota,USA:90~95
    [101]徐骏,胡宏斌,周洞汝.新闻视频中主持人识别方法的研究.计算机工程,2002,28(3):165~166
    [102] L. D’Anna,G. Marrazzo,G. Percannella,et al.A multi-stage approach for anchor shot detection.Structural,Syntactic and Statistical Pattern Recognition,Joint IAPR International Workshops(SSPR 2006 and SPR 2006),August 17~19, 2006,Hong Kong,China:773~782
    [103] Jun Yang , Alex Hauptmann . Multi-modality analysis for person type classification in news video.Storage and Retrieval Methods and Applications for Multimedia 2005,part of the IS&T/SPIE Symposium on Electronic Imaging 2005 (EI 2005),January 18~19,2005,San Jose,USA:165~172
    [104]于俊清,汤旸,闫冬,周洞汝.基于规则分析的新闻视频口播帧检测.计算机工程与应用,2004(6):84~86
    [105] Chia-Hung Yeh,Min-Kuan Chang,Ko-Yen Lu and Maverick Shih.Robust TV news story identification via visual characteristics of anchorperson scenes.2006 IEEE Pacific-Rim Symposium on Image and Video Technology (PSIVT 2006),December 11~13,2006,Taiwan,China:621~630
    [106] Akira Yanagawa,Winston Hsu and Shih-Fu Chang.Anchor shot detection inTRECVID-2005 broadcast news videos . ADVENT Technical Report #213-2005-7 Columbia University,December,2005
    [107] Kim Shearer,Chitra Dorai and Svetha Venkatash.Incorporating domain knowledge with video and voice data analysis in news broadcasts.Proc. of the 1st International Workshop On Multimedia Data Mining (MDM/KDD’2000),August 20,2000,Boston, USA:46~53
    [108] M. De Santo,P. Foggia,G. Percannella,et al.An unsupervised algorithm for anchor shot detection.The 18th International Conference on Pattern Recognition (ICPR 2006),August 20~24,2006,HongKong,China:1238~1241
    [109] Jian Gao,Meng-Qi Guo,Qi-Jie Zhao.An unsupervised anchorperson shot detection based on the distribution properties.International Conference on Machine Learning and Cybernetics,August 19~22,2007,HongKong,China:3945~3950
    [110]高健,郭梦琦,沈辉等.一种基于分布特点的口播帧识别算法.光电子·激光,2007,18(16):717~720
    [111] Anan Liu,Sheng Tang,Yongdong Zhang,et al.A novel anchorperson detection algorithm based on spatio-temporal slice.14th International Conference on Image Analysis and Processing (ICIAP 2007),September 10~14,2007,modena,Italy:371~375
    [112] Xi-Dao Luan,Yu-Xiang Xie,Ling-Da Wu,et al.AnchorClu: An anchorperson shot detection method based on clustering.Proceedings of the Sixth International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT’05),December 5~8,2005,Dalian,China:840~844
    [113]李默,李弼程,邓子健.新闻视频主持人镜头的半屏幕检测算法.计算机工程与应用,2005(15):183~185
    [114] Lienhart R,Pfeiffer S,Effelsberg W.Video abstracting.Communications of the ACM,1997,40(12):54~62
    [115] H.P.Moravec.Towards automatic visual obstacle avoidance.Proceedings of the 5th International Joint Conference on Artificial Intelligence,August,1977:584~590
    [116] H.P. Moravec.Visual mapping by a robot rover.Proceedings of the 6th International Joint Conference on Artificial Intelligence,August 1979,Tokyo,Japan:599~601
    [117] W.Fostner . A feature based correspondence algorithm for imagematching.International Archives of Photogrammetry and Remote Sensing,1986(26):150~166
    [118] Harris C and StephensM.A combined corner and edge detector.In Proceedings of the Fourth Alvey Vision Conference,September,1988,Manchester,England:147~151
    [119] S.M. Smith and M. Brady.SUSAN: A new approach to low level image processing.International Journal of Computer Vision,1997,23(1):45~78
    [120] M. Trajkovic and M. Hedley.Fast corner detection.Image and Vision Computing,1998,16(2):75~87
    [121] Z.Zheng, H.Wang and EKTeoh.Analysis of gray level corner detection.Pattern Recognition Letters,1999(20):149~162
    [122] J. Shi and C. Tomasi.Good features to track.In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR 1994),June,1994,Seattle,USA:593~600
    [123] H.Wang and M. Brady.Real-time corner detection algorithm for motion estimation.Image and Vision Computing,1995,13(9):695~703
    [124] E. Rosten and T. Drummond . Machine learning for high-speed corner detection.9th European Conference on Computer Vision(ECCV2006),May 7~13,2006,Graz, Austria:430~443
    [125] Kenney , Zuliani , Manjunath , et al . An axiomatic approach to corner detection.IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2005 (CVPR 2005),June 20~26,2005,San Diego,USA:191~197
    [126] Z. Zhang,R. Deriche,O. Faugeras and Q. Luong.A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry.Artificial Intelligence,1995,78(1-2):87~119
    [127] C. Schmid and R. Mohr.Matching by local invariants.Research report 2644 INRIA Rhone-Alpes,1995,Grenoble,France
    [128] T. Lindeberg.Scale-space theory: A basic tool for analysing structures at different scales.Applied Statistics,1994,21(2):223~261
    [129] T. Lindeberg.Detecting salient blob-like image structures and their scales with a scale-space primal sketch: A method for focus-of-attention.International Journal of Computer Vision,1993,11 (3):283~318
    [130] T. Lindeberg.Feature detection with automatic scale selection.International Journal of Computer Vision ,1998,30 (2):77~116
    [131] L. Bretzner and T. Lindeberg.Feature tracking with automatic selection of spatial scales.Computer Vision and Image Understanding,1998,71(3):385~392
    [132] Mikolajczyk and Schmid . Scale & Affine invariant interest point detectors.International Journal of Computer Vision,2004,60(1):63~86
    [133] J. Matas,O. Chum,M. Urban and T. Pajdla.Robust wide baseline stereo from maximally stable extremum regions.Proceedings of the British Machine Vision Conference,September 2~5,2002,London,England:384~393
    [134] Mikolajczyk and Schmid.A performance evaluation of local descriptors.IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,10(27):1615~1630
    [135] Mikolajczyk,Tuytelaars,Schmid,et al.A comparison of affine region detectors.International Journal on Computer Vision ,2005,65(1-2):43~72
    [136] H. Bay , T. Tuytelaars and L. van Gool . SURF: Speeded up robust features.Proceedings of the 9th European Conference on Computer Vision, Springer LNCS ,2006,3951(1):404~417
    [137] Lowe D. G.Object recognition from local scale-invariant features.The Proceedings of the Seventh IEEE International Conference on Computer Vision,September 20~27,1999,Kerkyra,Greece:1150~1157
    [138] Peter J. Burt and Edward H. Adelson.The laplacian pyramid as a compact image code.IEEE Transactions on Communication,1983,31(4):532~540
    [139] FaugerasO , Robert L . What can two images tell us about the third one.International Journal of Computer Vision,1996,18(1):5~19
    [140] Koenderink J.The structure of images.Biological Cybernetics,1984(50):363~396
    [141] Lindeberg T.Scale-Space for discrete signals.IEEE Transactions on Pattern Analysis andMachine Intelligence,1980(207):187~217
    [142] Babaud J,Witkin A P,BaudinMetal.Uniqueness of the gaussian kernel for scale space filtering.IEEE Transactions on Pattern Analysis andMachine Intelligence,1996,8 (1):26~33
    [143]周献忠,史迎春,王韬.基于HSV颜色空间加权Hu不变矩的台标识别.南京理工大学学报,29(3),2005:363~367
    [144] http://www.ee.columbia.edu/ln/dvmm/researchProjects/FeatureExtraction/Near- -DuplicateByParts/INDDetection.html
    [145]周蕾.基于小波包矩特征的改进LDB人脸识别方法及其应用研究.硕士学位论文,长沙,国防科技大学研究生院,2007
    [146] C. Kim and B. Vasudev.Spatiotemporal sequence matching for efficient video copy detection . IEEE Transactions on Circuits and Systems for Video Technology,2005,15(1):127~132
    [147] J. M. Gauch and A. Shivadas.Finding and identifying unknown commercials using repeated video sequence detection . Computer vision and image understanding,2006,103(1):80~88
    [148] Tolga Can and Pinar Duygulu . Searching for repeated video sequences . Proceedings of the international workshop on Workshop on multimedia information retrieval (MIR 2007),September 28~29,2007,Augsburg,Germany:207~216
    [149] Fei-Fei L and Perona P.A bayesian heirarcical model for learning natural scene categories.IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR 2005),June 20~25,2005,San Diego,USA:524-531
    [150] Sivic J,Russell B,Efros A,et al.Discovering object categories in image collections.Proceedings of the International Conference on Computer Vision (ICCV 2005),October 17~20,2005,Beijing,China
    [151] Jun Yang,Yu Gang Jiang,Alexander G. Hauptmann and Chong Wah Ngo . Evaluating bag of visual words representations in scene classification.Proceedings of the international workshop on Workshop on multimedia information retrieval (MIR 2007),September 28~29,2007,Augsburg,Germany:197~206
    [152] Alexander Hauptmann,Rong Yan and Wei HaoLin.How many highlevel concepts will fill the semantic gap in news video retrieval.Proceedings of the 6th ACM international conference on Image and video retrieval(CIVR 2007),July 9~11,2007,Amsterdam,The Netherlands:627~634
    [153] Akira Yanagawa,Shih Fu Chang,Lyndon Kennedy and Winston Hsu.Columbia university’s baseline detectors for 374 LSCOM semantic visual concepts.Columbia University ADVENT Technical Report # 222-2006-8,March 20,2007
    [154]宋丹,王卫东,陈英.基于改进向量空间模型的话题识别与跟踪.计算机技术与发展,2006,16(9):62~67
    [155]徐凤亚,罗震声.文本自动分类中权重算法的改进研究.计算机工程与应用,2005(1):181~184
    [156] Y. Zhang,J. Callan and T. Minka.Novelty and redundancy detection in adaptivefiltering.The 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR 2002),August 11~15,2002,Tampere,Finland:81~88
    [157]凌坚.新闻视频主题识别与跟踪的研究.博士学位论文,浙江大学,2007
    [158] Michael G. Christel.Establishing the utility of non-text search for news video retrieval with real world users.Proceedings of the 15th international conference on Multimedia(ACM MM2007),September 24~29,2007,Augsburg,Germany:707~716
    [159] Guo Jun Qi,Xian Sheng Hua,Yong Rui,et al.Correlative multi-label video annotation . Proceedings of the 15th international conference on Multimedia(ACM MM2007),September 24~29,2007,Augsburg,Germany:17~26
    [160] Ponte J M and Croft W B.A language modeling approach to information retrieval.Proceedings of the 21st International ACM SIGIR Conference on Research and Development in Information Retrieval,August 24~28,1998,Melbourne,Australia:275~281
    [161] http://www.ict.ac.cn/diffusive/channel/detail3259.asp
    [162] Jun Jie Foo and Ranjan Sinha.Pruning SIFT for scalable near-duplicate image matching.Proceedings of the eighteenth conference on Australasian database,January 30~February 2,2007,Ballarat,Australia:63~71

常见问题　|　交通位置　|　联系我们　|　OA远程办公

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700