互联网环境下大规模图像的内容分析、检索和自动标注的研究

设为首页

收藏本站

网站地图 | English | 公务邮箱

读者指南

学术客户端

NSTL服务站

科技查新

互联网环境下大规模图像的内容分析、检索和自动标注的研究

详细信息本馆镜像全文| 推荐本文 | | 获取CNKI官网全文

英文题名：Large Scale Image Content Analysis, Retrieval, and Automatic Annotation in Web Environment
作者：王长虎
论文级别：博士
学科专业名称：信号与信息处理
中文关键词：图像检索 ; 图像标注 ; 图像标注改善 ; 语义鸿沟 ; 稀疏编码 ; 距离度量学习 ; 多实例学习
英文关键词：Image Retrieval ; Image Annotation ; Image Annotation Refinement ; Semantic Gap ; Sparse Coding ; Distance Metric Learning ; Multiple-Instance Learning
学位年度：2009
导师：张宏江 ; 李明镜
学科代码：081002
学位授予单位：中国科学技术大学
论文提交日期：2009-04-17

摘要

随着互联网和数字摄影设备的普及和发展,互联网上的图像数量飞速增长。一方面,互联网上的海量图像吸引了越来越多的用户;另一方面,越来越丰富的图像资源使用户难以在浩如烟海的数据中找到其真正需要的信息。这使得快速、有效的图像检索技术成为商业界和学术界的一个重要研究方向。
     当前,互联网图像检索主要分成两大类:基于文本的图像检索(text-basedimage retrieval,简称TBIR),和基于内容的图像检索(content-based imageretrieval,简称CBIR)。TBIR在商业图像搜索引擎中被广泛使用。在TBIR系统中,互联网图像的文本信息用来索引和搜索图像。因此,图像文本标注的质量成为TBIR中的一个重要的问题。CBIR是学术界中一个非常流行的方向。在CBIR系统中,图像的视觉内容被用来索引。它面临的最主要的困难是语义鸿沟问题,即图像的低层内容特征(如颜色),不能有效的描述高层语义(如“狗”)。
     在本文中,我们尝试充分利用互联网图像丰富的文本信息和视觉信息,来解决上面提到的几个问题。我们对自动图像标注、图像标注改善、减小互联网图像检索中的语义鸿沟、基于对象的图像检索等问题进行了深入的研究。另外,为了更好地处理和利用互联网上的海量数据,更有效地帮助用户的在线检索,我们在设计相关算法和实现检索系统的时候,还特别地注意了其处理大规模图像的能力以及实时性。本文主要成果和创新之处包括以下几个方面:
     1.讨论并分析了自动图像标注问题,提出了一个多标记稀疏编码的框架来进行特征提取和分类,并把它应用到自动图像标注中。我们认为具有部分重叠标记的两张图像之间的语义相似度应该以一种重构的方式而不是一对一的方式来度量。因此,在这个框架中,图像标记向量之间的语义相似度,以及图像特征向量之间的语义相似度,都基于一对多的l~1稀疏重构/编码来度量。
     2.讨论并分析了大规模的自动图像标注问题,并提出了一个基于搜索的图像标注框架。在这个框架下,我们给用户提供了一个在线图像标注服务,可以对用户提交的任意图像进行实时的标注。我们从互联网上收集了一个大规模的图像库,并把它用做训练集来标注任意一张图像。快速检索技术的应用和大规模图像库的使用保证了我们提出的基于搜索的图像标注框架处理大规模图像的能力及实时性。
     3.讨论并分析了图像标注改善问题。我们把图像标注改善问题表述成一个马尔可夫过程,并在这个框架下解释了已有的图像标注改善工作。针对已有工作的问题,我们提出了一个基于内容的图像标注改善算法。马尔可夫过程表示的有效性,以及待标注图像与训练集中图像的内容信息的充分利用,使得我们提出的算法很大程度上改善了已有算法中存在的若干问题。
     4.讨论并分析了互联网上基于内容的图像检索中的语义鸿沟问题,并提出了一个基于排序的距离度量学习算法。通过互联网图像丰富的文本信息的引导,我们试图在视觉空间中学出一个新的距离度量,使得给定一张查询图像,基于这个新的距离度量,我们可以在图像库中检索到与查询图像语义上更相关的图像。基于这个新的距离度量学习算法,我们提出了一个大规模的基于内容的图像检索(CBIR)框架,并在2.4 million规模的互联网图像库上实现了一个实时的CBIR检索系统。
     5.讨论并分析了用多实例半监督学习(MISSL)算法来解决基于对象的图像检索问题。我们针对MISSL问题提出了一个新的正则化框架。基于这个框架,我们提出了一个基于图的多实例学习(GMIL)算法来解决MISSL问题。同样,在这个框架下,GMIL可以分别退化成一个新的标准多实例算法(GMIL-M)和一个标准半监督学习算法(GMIL-S)。我们从理论上证明了GMIL-S算法具有闭式解,以及GMIL和GMIL-M的迭代解的收敛性。我们用GMIL算法来解决基于对象的图像检索问题,实验结果验证了GMIL算法的有效性。
With the prevalence of the Internet and digital cameras,there are more and more digital images on the Web.On the one hand,the increasing number of images attracts more and more users;on the other hand,it is not easy for common users to find what they really need from the sea of images.Therefore,effective and efficient image retrieval techniques have become an important research direction in both commercial and academic circles.
     Currently,there are mainly two image retrieval frameworks:text-based image retrieval (TBIR),which is widely used in commercial image search engines,and content-based image retrieval(CBIR),which becomes a hot research topic in academic communities. In text-based systems,images are indexed and retrieved based on textual information of Web images,where the quality of the annotations of images is one of most important issues in text-based image retrieval.In content-based image retrieval, images are indexed by their visual content,in which one key problem is the semantic gap between low-level visual features and high-level semantic concepts.
     In this dissertation,we try to fully utilize the rich textual and visual information of Web images to solve the above-mentioned problems in Web image retrieval. The following key techniques of Web image retrieval are discussed:automatic image annotation,image annotation refinement,reducing the semantic gap in Web image retrieval, and object-based image retrieval.Moreover,to better handle and utilize the large amount of data on the Web,and make users more convenient during the online retrieval process,we particularly consider the scalability and efficiency of the proposed algorithms and developed systems.The main contributions of the dissertation are as follows:
     1.We present a multi-label sparse coding framework for feature extraction and classification within the context of automatic image annotation.We claim that the semantic similarity of two images with overlapped labels should be measured in a reconstruction-based way rather than in a one-to-one way.Beyond the one-to- one similarity,the semantic similarities of label vectors and image features are both measured based on one-to-all l~1 sparse reconstruction/coding as introduced afterwards.
     2.We study the problem of large scale automatic image annotation,and a search-based image annotation framework is proposed.Under this framework,a online image annotation service has been deployed to annotate arbitrary images submitted by users in real time.A Web-scale image database is crawled from the Web,and used as the training set to annotate an arbitrary image.The application of both efficient search technologies and Web-scale image set guarantees the scalability of the proposed algorithm.
     3.We study the problem of image annotation refinement.We formulate the annotation refinement process as a Markov process,and based on which we explain some existing annotation refinement algorithms.In order to solve the problems in existing algorithms,we propose a content-based image annotation algorithm. Owing to the effectiveness of the Markov process formulation and the use of content information of the query image as well as training images,the proposed algorithm resolves the problems in existing algorithms to a large extent.
     4.We study the problem of bridging the semantic gap in content-based image retrieval on the Web,and propose a ranking-based distance metric learning algorithm. Piloted by the rich textual information of Web images,the proposed framework tries to learn a new distance measure in the visual space,which can be used to retrieve more semantically relevant images for any unseen query image. Based on the ranking-based distance metric learning algorithm,we propose a novel framework for large scale content-based image retrieval(CBIR).We also implement a real-time CBIR system on a 2.4 million Web images dataset.
     5.We study the problem of using multiple-instance semi-supervised learning to solve object-based image retrieval problem.A novel regularization framework for MISSL is presented.Based on this framework,a graph-based multiple-instance learning(GMIL) algorithm is proposed to solve MISSL problem.Un- der the proposed framework,GMIL can be reduced to a novel standard MIL algorithm(GMIL-M) and a standard SSL algorithm(GMIL-S).We theoretically prove the existence of the closed form solution for GMIL-S and the convergence of the iterative solutions for GMIL and GMIL-M.We apply the GMIL algorithm to solving object-based image retrieval problem.Experimental results show the superiority of the proposed method.

引文

[1]P.Duygulu,Kobus Barnard,J.F.G.de Freitas,and David A.Forsyth.Object recognition as machine translation:Learning a lexicon for a fixed image vocabulary.In ECCV '02:Proceedings of the 7th European Conference on Computer Vision-Part Ⅳ,pages 97-112,London,UK,2002.Springer-Verlag.
    [2]Changhu Wang,Lei Zhang,and Hong-Jiang Zhang.Learning to reduce the semantic gap in web image retrieval and annotation.In SIGIR '08:Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval,pages 355-362,New York,NY,USA,2008.ACM.
    [3]A.Torralba,R.Fergus,and W.T.Freeman.80 million tiny images:A large data set for nonparametric object and scene recognition.Pattern Analysis and Machine Intelligence,IEEE Transactions on,30(11):1958-1970,Nov.2008.
    [4]David A.White and Ramesh Jain.Similarity indexing:Algorithms and performance.In In Storage and Retrieval for hnage and Video Databases(SPIE,pages 62-73,1996.
    [5]Mayur Datar,Nicole Immorlica,Piotr Indyk,and Vahab S.Mirrokni.Locality-sensitive hashing scheme based on p-stable distributions.In SCG '04:Proceedings of the twentieth annual symposium on Computational geometry,pages 253-262,New York,NY,USA,2004.ACM.
    [6]Changbo Yang,Ming Dong,and Jing Hua.Region-based image annotation using asymmetrical support vector machine-based multiple-instance learning.In CVPR '06:Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition,pages 2057-2063,Washington,DC,USA,2006.IEEE Computer Society.
    [7]Changhu Wang,Shuicheng Yan,and Hong-Jiang Zhang.Large scale natural image classification by sparsity exploration.April 2009.
    [8]Feng Jing,Mingjing Li,Lei Zhang,Hong jiang Zhang,and Bo Zhang.Learning in regionbased image retrieval.In in Proceedings of the IEEE International Symposium on Circuits and Systems,pages 206-215.Springer,2003.
    [9]C.P.Town and D.Sinclair.Content based image retrieval using semandc visual categories.Technical report,AT&T Laboratories Cambridge,2000.
    [10]James Z Wang,Jia Li,Desmond Chan,and Gio Wiederhold.Semantics-sensitive retrieval for digital picture libraries.Technical report,1999.
    [11]Simon Tong and Edward Chang.Support vector machine active learning for image retrieval.In MULTIMEDIA '01:Proceedings of the ninth ACM international conference on Multimedia,pages 107-118,New York,NY,USA,2001.ACM.
    [12]Xin Zheng,Deng Cai,Xiaofei He,Wei-Ymg Ma,and Xueyin Lin.Locality preserving clustering for image database.In MULTIMEDIA '04:Proceedings of the 12th annual ACM international conference on Multimedia,pages 885-891,New York,NY,USA,2004.ACM.
    [13]Wei-Ying Ma and B.S.Manjunath.Netra:a toolbox for navigating large image databases.Multimedia Syst.,7(3):184-198,1999.
    [14]J.Z.Wang,Jia Li,and G.Wiederhold.Simplicity:semantics-sensitive integrated matching for picture libraries.Pattern Analysis and Machine Intelligence,IEEE Transactions on,23(9):947-963,Sep 2001.
    [15]Hideyuki Tamura,Shunji Mori,and Takashi Yamawaki.Textural features corresponding to visual perception.Systems,Man and Cybernetics,IEEE Transactions on,8(6):460-473,June 1978.
    [16]F.Liu and R.W.Picard.Periodicity,directionality,and randomness:Wold features for image modeling and retrieval.Pattern Analysis and Machine Intelligence,IEEE Transactions on,18(7):722-733,Jul 1996.
    [17]R.Mehrotra and J.E.Gary.Similar-shape retrieval in shape data management.Computer,28(9):57-62,Sep 1995.
    [18]Claudio Cusano,Gianluigi Ciocca,and Raimondo Schettini.Image annotation using svm.volume 5304,pages 330-338,2003.
    [19]Yuli Gao,Jianping Fan,Xiangyang Xue,and Ramesh Jain.Automatic image annotation by incorporating feature hierarchy and boosting to scale up svm classifiers.In MULTIMEDIA '06:Proceedings of the 14th annual ACM international conference on Multimedia,pages 901-910,New York,NY,USA,2006.ACM.
    [20]Jia Li and J.Z.Wang.Automatic linguistic indexing of pictures by a statistical modeling approach.Pattern Analysis and Machine Intelligence,IEEE Transactions on,25(9):1075-1088,Sept.2003.
    [21]E.Chang,Kingshy Goh,G.Sychay,and Gang Wu.Cbsa:content-based soft annotationfor multimodal image retrieval using bayes point machines.Circuits and Systems for Video Technology,IEEE Transactions on,13(1):26-38,Jan 2003.
    [22]Gustavo Carneiro,Antoni B.Chan,and Pedro J.Moreno.Supervised learning of semantic classes for image annotation and retrieval.IEEE Trans.Pattern Anal.Mack Intell,29(3):394-410,2007.
    [23]Y.Mori,H.Takahashi,and R.Oka.Image-to-word transformation based on dividing and vector quantizing images with words.1999.
    [24]David M.Blei and Michael I.Jordan.Modeling annotated data.In SIG1R '03:Proceedings of the 26th annual international ACM SIG1R conference on Research and development in informaion retrieval,pages 127-134,New York,NY,USA,2003.ACM.
    [25]J.Jeon,V.Lavrenko,and R.Manmatha.Automatic image annotation and retrieval using cross-media relevance models.In SIGIR '03:Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval,pages 119-126,New York,NY,USA,2003.ACM.
    [26]V.Lavrenko,R.Manmatha,and J.Jeon.A model for learning the semantics of pictures.In in NIPS.MIT Press,2003.
    [27]S.L.Feng,R.Manmatha,and V.Lavrenko.Multiple bernoulli relevance models for image and video annotation,volume 2,pages 1002-1009,June-2 July 2004.
    [28]Yohan Jin,Latifur Khan,Lei Wang,and Mamoun Awad.Image annotations by combining multiple evidence & wordnet.In MULTIMEDIA '05:Proceedings of the 13th annual ACM international conference on Multimedia,pages 706-715,New York,NY,USA,2005.ACM.
    [29]Changhu Wang,Feng Jing,Lei Zhang,and Hong-Jiang Zhang.Image annotation refinement using random walk with restarts.In MULTIMEDIA '06:Proceedings of the 14th annual ACM international conference on Multimedia,pages 647-650,New York,NY,USA,2006.ACM.
    [30]Changhu Wang,Feng Jing,Lei Zhang,and Hong-Jiang Zhang.Content-based image annotation refinement,pages 1-8,June 2007.
    [31]Changhu Wang,Shuicheng Yan,Lei Zhang,and Hong-Jiang Zhang.Multi-label sparse coding for automatic image annotation.June 2009.
    [32]D.Androutsos,K.N.Plataniotiss,and A.N.Venetsanopoulos.Distance measures for color image retrieval,volume 2,pages 770-774,Oct 1998.
    [33]Zhixiang Chen and Binhai Zhu.Some formal analysis of rocchio's similarity-based relevance feedback algorithm.Inf.Retr.,5(1):61-86,2002.
    [34]S.Ardizzoni,I.Bartolini,and M.Patella.Windsurf:region-based image retrieval using wavelets,pages 167-173,1999.
    [35]Y.Rubner,C.Tomasi,and L.J.Guibas.A metric for distributions with applications to image databases,pages 59-66,Jan 1998.
    [36]Beitao Li,Edward Chang,and Ching tung Wu.Dpf-a perceptual distance function for image retrieval.In Proc.IEEE Int.Conf.on Image Processing,pages 597-600,2002.
    [37]Rouhollah Rahmani,Sally A.Goldman,Hui Zhang,John Krettek,and Jason E.Fritts.Localized content based image retrieval.In MIR '05:Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval,pages 227-236,New York,NY,USA,2005.ACM.
    [38]Oded Maron and Aparna Lakshmi Ratan.Multiple-instance learning for natural scene classification.In In The Fifteenth International Conference on Machine Learning,pages 341-349.Morgan Kaufmann,1998.
    [39]Rouhollah Rahmani and Sally A.Goldman.Missl:Multiple-instance semi-supervised learning.In In Proceedings of the International Conference on Machine Learning(ICML,pages 705-712.ACM Press,2006.
    [40]Changhu Wang,Lei Zhang,and Hong-Jiang Zhang.Graph-based multiple-instance learning for object-based image retrieval.In MIR '08:Proceeding of the 1st ACM international conference on Multimedia information retrieval,pages 156-163,New York,NY,USA,2008.ACM.
    [41]Lei Zhang,Fuzong Lin,and Bo Zhang.Support vector machine learning for image retrieval,volume 2,pages 721-724,Oct 2001.
    [42]A.Vailaya,M.A.T.Figueiredo,A.K.Jain,and Hong-Jiang Zhang.Image classification for content-based indexing.Image Processing,IEEE Transactions on,10(1):117-130,Jan 2001.
    [43]Jiebo Luo and A.Savakis.Indoor vs outdoor classification of consumer photographs using low-level and semantic features,volume 2,pages 745-748,Oct 2001.
    [44]HuaMin Feng and Tat-Seng Chua.A bootstrapping approach to annotating large image collection.In MIR '03:Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval,pages 55-62,New York,NY,USA,2003.ACM.
    [45]S.D.MacArthur,C.E.Brodley,and Chi-Ren Shyu.Relevance feedback decision trees in content-based image retrieval,pages 68-72,2000.
    [46]Wai chee Low and Tat seng Chua.Color-based relevance feedback for image retrieval.In In Proceedings of the International Workshop on Multimedia Database Management Systems, pages 116-123.IEEE Computer Society,1998.
    [47]Y.Chen,J.Z.Wang,and R.Krovetz.An unsupervised learning approach to content-based image retrieval,volume 1,pages 197-200,July 2003.
    [48]Wanjun Jin,Rui Shi,and Tat-Seng Chua.A semi-naive bayesian method incorporating clustering with pair-wise constraints for auto image annotation.In MULTIMEDIA '04:Proceedings of the 12th annual ACM international conference on Multimedia,pages 336-339,New York,NY,USA,2004.ACM.
    [49]Jingrui He,Mingjing Li,Hong-Jiang Zhang,Hanghang Tong,and Changshui Zhang.Manifold-ranking based image retrieval.In MULTIMEDIA '04:Proceedings of the 12th annual ACM international conference on Multimedia,pages 9-16,New York,NY,USA,2004.ACM.
    [50]Yong Rui,Thomas S.Huang,and Sharad Mehrotra.Content-based image retrieval with relevance feedback in mars.In In Proc.IEEE Int.Conf.on Image Proc,pages 815-818,1997.
    [51]S.Sclaroff,L.Taycher,and M.La Cascia.Imagerover:a content-based image browser for the world wide web.pages 2-9,Jun 1997.
    [52]Jing Huang,S.R.Kumar,M.Mitra,Wei-Jing Zhu,and R.Zabih.Image indexing using color correlograms.pages 762-768,Jun 1997.
    [53]Yong Rui,T.S.Huang,M.Ortega,and S.Mehrotra.Relevance feedback:a power tool for interactive content-based image retrieval.Circuits and Systems for Video Technology,IEEE Transactions on,8(5):644-655,Sep 1998.
    [54]Jing Peng,Bir Bhanu,and Shan Qing.Probabilistic feature relevance learning for contentbased image retrieval.Computer Vision and Image Understanding,75:150-164,1999.
    [55]Selim Aksoy,Robert M.Haralick,Faouzi A.Cheikh,and Moncef Gabbouj.A weighted distance approach to relevance feedback.Pattern Recognition,International Conference on,4:4812,2000.
    [56]Feng Jing.An effective region-based image retrieval framework.In In ACM Multimedia,pages 456-465,2002.
    [57]Yoshiharu Ishikawa,Ravishankar Subramanya,and Christos Faloutsos.Mindreader:Querying databases through multiple examples.In VLDB '98:Proceedings of the 24rd International Conference on Very Large Data Bases,pages 218-227,San Francisco,CA,USA,1998.Morgan Kaufmann Publishers Inc.
    [58]Y.Rui and T.Huang.Optimizing learning in image retrieval,volume 1,pages 236-243,2000.
    [59]Xiang Sean Zhou and T.S.Huang.Small sample learning during multimedia retrieval using biasmap,volume 1,pages 11-17,2001.
    [60]Zhong Su,Hongjiang Zhang,and Shaoping Ma.Relevance feedback using a bayesian classifier in content-based image retrieval.In Proceedings of the SPIE Storage and Retrieval for Multimedia Databases,2001.
    [61]I.J.Cox,M.L.Miller,T.P.Minka,and P.N.Yianilos.An optimized interaction strategy for bayesian relevance feedback,pages 553-558,Jun 1998.
    [62]Chahab Nastar Matthias,Matthias Mitschke,and Christophe Meilhac.Efficient query refinement for image retrieval.In In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition,pages 547-552,1998.
    [63]C.Meilhac and C.Nastar.Relevance feedback and category search in image databases.volume 1,pages 512-517,Jul 1999.
    [64]N.Vasconcelos and A.Lippman.Bayesian relevance feedback for content-based image retrieval,pages 63-67,2000.
    [65]D.Geman and R.Moquet.A stochastic feedback model for image retrieval.In Ecole Polytechnique,91128 Palaiseau Cedex,pages 173-180,1999.
    [66]T.P.Minka and R.W.Picard.Interactive learning with a "society of models",pages 447-452,Jun 1996.
    [67]J.Laaksonen,M.Koskela,and E.Oja.Picsom:self-organizing maps for content-based image retrieval,volume 4,pages 2470-2473,1999.
    [68]M.E.J.Wood,B.T.Thomas,and N.W.Campbell.Iterative refinement by relevance feedback in content-based digital image retrieval.In MULTIMEDIA '98:Proceedings of the sixth ACM international conference on Multimedia,pages 13-20,New York,NY,USA,1998.ACM.
    [69]Yunqiang Chen,Xiang Sean Zhou,and T.S.Huang.One-class svm for learning in image retrieval,volume 1,pages 34-37,2001.
    [70]Yoav Freund and Robert E.Schapire.A decision-theoretic generalization of on-line learning and an application to boosting.In EuroCOLT '95:Proceedings of the Second European Conference on Computational Learning Theory,pages 23-37,London,UK,1995.Springer-Verlag.
    [71]A.L.Ratan,O.Maron,W.E.L.Grimson,and T.Lozano-Perez.A framework for learning query concepts in image classification,volume 1,pages 423-429,1999.
    [72]H.Sahbi,J.-Y.Audibert,and R.Keriven.Graph-cut transducers for relevance feedback in content based image retrieval,pages 1-8,Oct.2007.
    [73]Rong Jin,Joyce Y.Chai,and Luo Si.Effective automatic image annotation via a coherent language model and active learning.In MULTIMEDIA '04:Proceedings of the 12th annual ACM international conference on Multimedia,pages 892-899,New York,NY,USA,2004.ACM.
    [74]Feng Kang,Rong Jin,and R.Sukthankar.Correlated label propagation with application to multi-label learning,volume 2,pages 1719-1726,2006.
    [75]Xiangdong Zhou,Mei Wang,Qi Zhang,Junqi Zhang,and Baile Shi.Automatic image annotation by an iterative approach:incorporating keyword correlations and region matching.In CIVR '07:Proceedings of the 6th ACM international conference on Image and video retrieval,pages 25-32,New York,NY,USA,2007.ACM.
    [76]Douglas A.Reynolds,Thomas F.Quatieri,and Robert B.Dunn.Speaker verification using adapted gaussian mixture models.In Digital Signal Processing,pages 19-41,2000.
    [77]A.P.Dempster,N.M.Laird,and D.B.Rubin.Maximum likelihood from incomplete data via the em algorithm.JOURNAL OF THE ROYAL STATISTICAL SOCIETY,SERIES B,39(1):1-38,1977.
    [78]C.-H.Lee,C.-H.Lin,and B.-H.Juang.A study on speaker adaptation of the parameters of continuous density hidden markov models.Signal Processing,IEEE Transactions on,39(4):806-814,Apr 1991.
    [79]Shuicheng Yan,Xi Zhou,Ming Liu,M.Hasegawa-Johnson,and T.S.Huang.Regression from patch-kernel,pages 1-8,June 2008.
    [80]Ian Jolliffe.Principal component analysis.Springer-Verlag,New York,1986.
    [81]P.N.Belhumeur,J.P.Hespanha,and D.J.Kriegman.Eigenfaces vs.fisherfaces:recognition using class specific linear projection.Pattern Analysis and Machine Intelligence,IEEE Transactions on,19(7):711-720,Jul 1997.
    [82]Sam T.Roweis and Lawrence K.Saul.Nonlinear Dimensionality Reduction by Locally Linear Embedding.Science,290(5500):2323-2326,2000.
    [83]David L.Donoho.For most large underdetermined systems of linear equations the minimal el-norm solution is also the sparsest solution.Comm.Pure Appl.Math,59:797-829,2004.
    [84]J.Wright,A.Y.Yang,A.Ganesh,S.S.Sastry,and Yi Ma.Robust face recognition via sparse representation.Pattern Analysis and Machine Intelligence,IEEE Transactions on,31(2):210-227,Feb 2009.
    [85]P.J.Moreno A.B.Chan and N.Vasconcelos.Using statistics to search and annotate pictures:an evaluation of semantic image annotation and retrieval on large databases.In Proceedings of Joint Statistical Meetings(JSM).
    [86]Xin Fan,Xing Xie,Zhiwei Li,Mingjing Li,and Wei-Ying Ma.Photo-to-search:using multimodal queries to search the web from mobile devices.In MIR '05:Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval,pages 143-150,New York,NY,USA,2005.ACM.
    [87]T.Yeh,K.Tollmar,and T.Darrell.Searching the web with mobile images for location recognition.In Computer Vision and Pattern Recognition,2004.CVPR 2004.Proceedings of the 2004 IEEE Computer Society Conference on,volume 2,pages 76-81,2004.
    [88]Xin-Jing Wang,Lei Zhang,Feng Jing,and Wei-Ying Ma.Annosearch:Image autoannotation by search,volume 2,pages 1483-1490,2006.
    [89]George A.Miller.Wordnet:A lexical database for english.Communications of the ACM,38:39-41,1995.
    [90]Lawrence Page,Sergey Brin,Rajeev Motwani,and Terry Winograd.The pagerank citation ranking:Bringing order to the web.Technical report,Stanford InfoLab,1999.
    [91]Michael Steinbach,George Karypis,and Vipin Kumar.A comparison of document clustering techniques.In In KDD Workshop on Text Mining,2000.
    [92]Hua-Jun Zeng,Qi-Cai He,Zheng Chen,Wei-Ying Ma,and Jinwen Ma.Learning to cluster web search results.In SIGIR '04:Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval,pages 210-217,New York,NY,USA,2004.ACM.
    [93]Peter F.Brown,Peter V.deSouza,Robert L.Mercer,Vincent J.Delia Pietra,and Jenifer C.Lai.Class-based n-gram models of natural language.Comput.Linguist.,18(4):467-479,1992.
    [94]Xirong Li,Le Chen,Lei Zhang,Fuzong Lin,and Wei-Ying Ma.Image annotation by largescale content-based image retrieval.In MULTIMEDIA '06:Proceedings of the 14th annual ACM international conference on Multimedia,pages 607-610,New York,NY,USA,2006.ACM.
    [95]Erhan Cinlar.Introduction to Stochastic Processes.Prentice Hall,Inc.,New Jersey.US.,1975.
    [96]Gene H.Golub and Charles F.Van Loan.Matrix computations(3rd ed.).Johns Hopkins University Press,Baltimore,MD,USA,1996.
    [97]Michael S.Lew,Nicu Sebe,Chabane Djeraba,and Ramesh Jain.Content-based multimedia information retrieval:State of the art and challenges.ACM Trans.Multimedia Comput.Commun.Appl.,2(1):1-19,2006.
    [98]A.W.M.Smeulders,M.Worring,S.Santini,A.Gupta,and R.Jain.Content-based image retrieval at the end of the early years.Pattern Analysis and Machine Intelligence,IEEE Transactions on,22(12):1349-1380,Dec 2000.
    [99]Jacob Goldberger,Sam Roweis,Geoff Hinton,and Ruslan Salakhutdinov.Neighbourhood components analysis.In Advances in Neural Information Processing Systems 17,pages 513-520.MIT Press,2004.
    [100]Liu Yang and Rong Jin.Distance metric learning:A comprehensive survey,2006.
    [101]Lorenzo Torresani and Kuang C.Lee.Large margin component analysis.In B.Sch(o|¨)lkopf,J.Platt,and T.Hoffman,editors,Advances in Neural Information Processing Systems 19,pages 1385-1392.MIT Press,Cambridge,MA,2007.
    [102]Hakan Ferhatosmanoglu,Ertem Tuncel,Divyakant Agrawal,and Amr El Abbadi.Approximate nearest neighbor searching in multimedia databases.In In Proc of 17th IEEE Int.Conf.on Data Engineering(ICDE,pages 503-511,2001.
    [103]Scott Deerwester,Susan T.Dumais,George W.Furnas,Thomas K.L,and Richard Harshman.Indexing by latent semantic analysis.Journal of the American Society for Information Science,41:391-407,1990.
    [104]Thomas Hofmann.Probabilistic latent semantic indexing.In SIGIR '99:Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval,pages 50-57,New York,NY,USA,1999.ACM.
    [105]David M.Blei,Andrew Y.Ng,and Michael I.Jordan.Latent dirichlet allocation.J.Mach.Learn.Res.,3:993-1022,2003.
    [106]Juan Dai,Shuicheng Yan,Xiaoou Tang,and James T.Kwok.Locally adaptive classification piloted by uncertainty.In ICML '06:Proceedings of the 23rd international conference on Machine learning,pages 225-232,New York,NY,USA,2006.ACM.
    [107]Vladimir N.Vapnik.The nature of statistical learning theory.Springer-Verlag New York, Inc.,New York,NY,USA,1995.
    [108]Yoav Freund.Boosting a weak learning algorithm by majority.Inf.Comput.,121(2):256-285,1995.
    [109]Thorsten Joachims.Optimizing search engines using clickthrough data.In KDD '02:Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining,pages 133-142,New York,NY,USA,2002.ACM.
    [110]R.Herbrich,T.Graepel,and K.Obermayer.Support vector learning for ordinal regression,volume 1,pages 97-102,1999.
    [111]Zhe Cao,Tao Qin,Tie-Yan Liu,Ming-Feng Tsai,and Hang Li.Learning to rank:from pairwise approach to listwise approach.In ICML '07:Proceedings of the 24th international conference on Machine learning,pages 129-136,New York,NY,USA,2007.ACM.
    [112]Changhu Wang,Feng Jing,Lei Zhang,and Hong-Jiang Zhang.Scalable search-based image annotation of personal images.In MIR '06:Proceedings of the 8th ACM international workshop on Multimedia information retrieval,pages 269-278,New York,NY,USA,2006.ACM.
    [113]Henning M(u|¨)ller,Thierry Pun,and David Squire.Learning from user behavior in image retrieval:Application of market basket analysis.Int.J.Comput.Vision,56(1-2):65-77,2004.
    [114]David G.Lowe.Distinctive image features from scale-invariant keypoints.International Journal of Computer Vision,60:91-110,2004.
    [115]Thomas G.Dietterich and Richard H.Lathrop.Solving the multiple-instance problem with axis-parallel rectangles.Artificial Intelligence,89:31-71,1997.
    [116]Stuart Andrews,Ioannis Tsochantaridis,and Thomas Hofmann.Support vector machines for multiple-instance learning.In Advances in Neural Information Processing Systems 15,pages 561-568.MIT Press,2003.
    [117]Jinbo Bi,Yixin Chen,and J.Z.Wang.A sparse support vector machine approach to regionbased image categorization,volume 1,pages 1121-1128,June 2005.
    [118]Yixin Chen and James Z.Wang.Image categorization by learning and reasoning with regions.J.Mach.Learn.Res.,5:913-939,2004.
    [119]Oded Maron,Tom(?)s Lozano-P(?)rez,and Tom As Lozano p Erez.A framework for multipleinstance learning.In Advances in Neural Information Processing Systems,pages 570-576.MIT Press,1998.
    [120]Qi Zhang,Sally A.Goldman,Wei Yu,and Jason E.Fritts.Content-based image retrieval using multiple-instance learning.In in Proc.19th International Conf.on Machine Learning,pages 682-689.Morgan Kaufmann,2002.
    [121]Xiaojin Zhu.Semi-supervised learning literature survey,2006.
    [122]Dengyong Zhou,Olivier Bousquet,Thomas Navin Lai,Jason Weston,and Bernhard Sch?lkopf.Learning with local and global consistency.In Advances in Neural Information Processing Systems 16,pages 321-328.MIT Press,2004.
    [123]M.Belkin,P.Niyogi,and V.Sindhwani.On manifold regularization.In AISTAT,2005.
    [124]H.Zhang,J.Fritts,and S.Goldman.An improved fine-grain hierarchical method of image segmentation.Technical report,Washington University,2005.
    [125]Jia Li,James Z.Wang,and Gio Wiederhold.Irm:integrated region matching for image retrieval.In MULTIMEDIA '00:Proceedings of the eighth ACM international conference on Multimedia,pages 147-156,New York,NY,USA,2000.ACM.
    [126]Jianbo Shi and J.Malik.Normalized cuts and image segmentation.Pattern Analysis and Machine Intelligence,IEEE Transactions on,22(8):888-905,Aug 2000.
    [127]Jun Wang and Jean-Daniel Zucker.Solving the multiple-instance problem:A lazy learning approach.In ICML '00:Proceedings of the Seventeenth International Conference on Machine Learning,pages 1119-1126,San Francisco,CA,USA,2000.Morgan Kaufmann Publishers Inc.
    [128]Qi Zhang and Sally A.Goldman.Em-dd:An improved multiple-instance learning technique.In In Advances in Neural Information Processing Systems,pages 1073-1080.MIT Press,2001.
    [129]Zhi-Hua Zhou and Jun-Ming Xu.On the relation between multi-instance learning and semisupervised learning.In ICML '07:Proceedings of the 24th international conference on Machine learning,pages 1167-1174,New York,NY,USA,2007.ACM.
    [130]Yixin Member-Chen,Jinbo Member-Bi,and James Z.Senior Member-Wang.Miles:Multiple-instance learning via embedded instance selection.IEEE Trans.Pattern Anal.Much.Intell,28(12):1931-1947,2006.
    [131]Soumya Ray and Mark Craven.Supervised versus multiple instance learning:An empirical comparison.In Proceedings of 22nd International Conference on Machine Learning(ICML-2005,pages 697-704.ACM Press,2005.
    [132]Xin Xu and Eibe Frank.Logistic regression and boosting for labeled bags of instances.In Proc.of the PacificAsia Conf.on Knowledge Discovery and Data Mining,pages 272-281.Springer-Verlag,2004.
    [133]Yann Chevaleyre and Jean daniel Zucker.A framework for learning rules from multiple instance data.In Proceedings of the 12th European Conference on Machine Learning(ECML-01,pages 49-60.Springer-Verlag,2001.
    [134]G.Ruffo.Learning single and multiple instance decision trees for computer security applications.Doctoral dissertation,University of Turin,Torino,Italy.,2000.
    [135]Peter Auer.On learning from multi-instance examples:Empirical evaluation of a theoretical approach.In ICML '97:Proceedings of the Fourteenth International Conference on Machine Learning,pages 21-29,San Francisco,CA,USA,1997.Morgan Kaufmann Publishers Inc.
    [136]Xiaojin Zhu,Zoubin Ghahramani,and John Lafferty.Semi-supervised learning using gaussian fields and harmonic functions.In In ICML,pages 912-919,2003.
    [137]Ricardo A.Baeza-Yates and Berthier Ribeiro-Neto.Modern Information Retrieval.Addison-Wesley Longman Publishing Co.,Inc.,Boston,MA,USA,1999.
    [138]G.Carneiro and N.Vasconcelos.Formulating semantic image annotation as a supervised learning problem,volume 2,pages 163-168,June 2005.
    [139]Gustavo Carneiro and Nuno Vasconcelos.A database centric view of semantic image annotation and retrieval.In SIGIR '05:Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval,pages 559-566,New York,NY,USA,2005.ACM.
    [140]J.Jeon and R.Manmatha.Automatic image annotation of news images with large vocabularies and low quality training data.In Proceedings of ACM Multimedia,2004.

常见问题　|　交通位置　|　联系我们　|　OA远程办公

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700