用户名: 密码: 验证码:
本体映射的若干方法研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着互联网上信息资源数量的快速增长,网络的应用和需求也在不断地扩大。传统的互联网技术并不考虑网络资源间的结构组织,而仅仅是完成了网络资源的连接,致使各种知识无序、零散的分布在成千上万的存贮介质上。如何在庞大的互联网资源中准确、快速地找到用户需要的信息成为亟待解决的问题。为了使不同的Web应用和服务之间的语义互操作性成为可能,本体成为解决语义异构问题的关键。一个本体以机器语言可理解的形式存在,并且以简化和抽象的方式表示我们感兴趣的领域。而最重要的本体应用领域莫过于语义Web,通过本体可以很好地解决语义Web的语义描述和二义性问题。但是语义Web本身具有分布式和异构的特点,这就导致即使是相同领域的本体也存在着异构和不一致性,这些本体所描述的领域知识就不能共享。因此,解决本体之间的异构问题成为关键。本体映射的目的是找到不同本体相似元素之间的语义对应关系。因此,本体映射成为当前的热点研究课题。
     然而,尽管有许多有关映射及匹配的方法,但始终没有一个清晰且成功的方案能够完全适应今后的发展需要,并且能够在没有专家参与的情况下自动完成所有操作。因此,有许多悬而未决的问题依然存在。本文针对本体映射方法中所存在的问题进行了一系列研究,提出了几种相关方法,以解决存在的实际问题。
     首先,当前的许多映射算法都过于依赖本体中实体的字符串信息和本体的结构,这些技术有时能得到很好的映射结果,但有时也会出现映射对发现失败的现象。解决此问题的最直接方法就是通过实例来丰富每个概念结点所拥有的语义信息。基于实例的映射方法使用实例对应的文本中出现的单词及其频率来发现元素之间的映射关系,实例信息的丰富程度决定了此方法的映射效率,本体中的实体可能具有多个实例,每个实例又包含实例名称及与其相关联的属性值。但许多本体在构建时并没有为每个实体添加相应的实例信息,这将直接影响基于实例映射策略的性能。另外,在映射发现时存在的不确定性问题也有待解决。
     本文提出基于扩展信息的本体映射方法,首先,使用基于本体的信息检索方法将网页本体中的文本作为扩展信息对本体中的概念和属性结点进行实例扩展。然后,将本体视为分类树,利用源本体的实例信息作为训练集,使用基于层次的文本分类方法构造本体中概念和属性结点的分类器,而目标本体的实例可以作为测试集向源本体中的结点分类。上述工作的目的是为了得到实体之间等价与包含关系的概率模型。最后使用基于概率论的本体映射方法得到映射集合。这种方法可以为没有实例信息的本体扩展实例集合,提高基于实例的映射方法的有效性,通过结合概率论的映射方法,不仅能得到等价关系的映射,也可以同时得到包含关系的映射,在一定程度上解决了映射发现的不确定性问题。实验结果表明本文提出的方法对缺少实例或没有实例的本体进行映射时有很好的映射结果,并且能得到更为复杂的关系映射。
     其次,随着语义互操作需求的不断增长,为了满足更多的语义应用,出现了规模庞大、结构复杂的本体。传统的映射技术在处理轻量级本体之间的映射时有很好的性能,但对于包含实体较多,关系较为复杂的大规模本体,映射质量和映射效率都不理想。因此,处理大规模本体之间的映射发现任务已成为当前的研究热点。其中一种方法就是避免比较两个本体中的所有实体。从两个本体中选择可能成为映射的实体对作为候选映射对,这就需要对原有的候选映射集合进行压缩。换言之,就是从原有的实体对集合中选择更为准确的实体对作为候选映射集合。
     本文针对此类问题提出了基于AP聚类的候选集压缩及映射方法。该方法将近似值传递的聚类思想应用于本体中实体的聚类,将原有的映射候选集压缩成映射候选子集,属于同一类中的实体为映射候选集合,排除噪音实体,进而提高映射性能。在语义相似度的计算中,同一本体与不同本体中的实体采用不同的计算方法,同时考虑了语义相似信息与结构相似信息对聚类的影响。对规模较小的本体进行聚类时,聚类结果可直接作为M:N映射输出,而大规模本体产生的聚类结果则可作为映射候选集输出,再进行其它针对性策略的映射,以产生更精确的映射结果。
     再次,本体映射策略的开发主要是针对于不同本体中实体的相似度计算而进行的,而这些实体所具有的信息种类又多种多样(例如:语义信息、结构信息),这些信息可以理解为本体的特征,而单一的映射方法却不能获得本体实体中的所有信息,因此,多策略的应用被目前的映射方法普遍使用。
     本文提出了基于匹配空间的多策略匹配方法,该方法先使用基于字符串与基于语义的方法构造策略的匹配空间,其中通过分析各种策略的优劣判断是否加入匹配空间,这种方法可根据不同的策略所计算的实体相似度值得出映射状态,并输出最佳匹配状态。然后,在这种状态下使用基于结构的匹配策略迭代生成映射结果。通过将各种匹配策略进行合理的结合,避免了单一方法不能利用本体全部信息的不足,并且灵活地允许用户选择各种匹配策略,为各种策略的结合提出了一个很好的整合框架,使映射结果更为理想。
     最后,提出基于决策理论的本体映射方法,用快速本体映射方法得出候选映射实体对,然后根据信息理论对其相似信息分析并进行策略预选,利用熵值决策分析方法对所选策略进行结合,得出最终映射结果。此方法避免无用映射策略对映射结果的影响,为用户提供了一个有效选择策略的方式,并通过自动调整域值来提高映射的查准率及查全率。最后通过对所选策略计算得出的相似度值进行分析并调整结合权值。实验表明,通过策略的选择及参数的调整,从整体上提高了映射的性能。
     虽然公用数据集上的实验结果显示了这几种方法的有效性,但它们也存在着一些缺陷和问题。因此,在下一步的工作中,将对这些问题进行有针对性的改进,以进一步提高方法的性能。
As the Internet information resources are increasing rapidly, the network application anddemand is also continually expanding. Traditional Internet technology does not consider theorganization structure among network resources in spite of completing the connection ofnetwork resources, with the various knowledge in disorder and thousands of storage media inscattered distribution. How to find user-needed information accurately and quickly in the vastinternet resources become urgent issues to tackle. In order to make the interoperabilityavailable between different Web applications and services, ontology semantic heterogeneity isthe crux to solve the problem. What falls into our domain of interest is an ontology existingcomprehensibly as the machine language and the simplification and abstraction expressed.And the most important application domain of ontology is the semantic Web and the problemof the semantic description and ambiguity of semantic Web can be well solved by theontology.But distributed patterns and heterogeneity are characteristics of semantic Web, andontology description of the domain knowledge can’t be shared because there also existisomerism and inconsistency even if it is the same field of ontology. Therefore, solving theheterogeneity between ontology becomes a key priority and ontology mapping becomes thehot research topic.
     However, although there are many relevant mapping and matching method, but there hasnot been a clear and successful scheme which can fully adapt to the future development need,and can not complete all the operations automatically without any domain experts. Therefore,many unsolved questions still exist. Based on the problems existing in ontology mappingmethod in a series of research, this article puts forward several related methods in order tosolve the practical problems.
     First of all, many mapping algorithm were too dependent on ontology of the entity in thestring information and body structure. These techniques can sometimes get very goodmapping results, but sometimes mapping fails. Using the example to enrich each conceptnode’s semantic information is the most effective way to deal with this problem. Mappingmethod based on the example uses the example’s corresponding "text" appearing in the wordand its frequency to find out the mapping relationship between elements. The richness in theexample information of this method determines the mapping efficiency. Ontology of the entitymay have multiple examples, each instance contains instance name and its associated attributevalue. But many ontologies under construction do not add some example information for eachentity, which will directly influence the performance of the strategy based on examplesmapping. In addition, uncertainty problems in mapping remain to be solved.
     This paper proposes an ontology mapping method based on extended information. First,use of information retrieval based on ontology methods will expand ontology text as anextension information about ontology concept and attribute node. Then, the ontology as classification tree, the use of source ontology examples of information as a training set, theuse of text classification method based on hierarchy structure in ontology concept andattribute node classifier, and goal ontology examples can be used as a test set to sourceontology of node classification. The purpose of the work is to get the equivalence betweenentities and inclusion relation of probability model. Finally use the ontology mapping methodbased on probability theory for mapping set. This method can extend example set forno-example information ontology; improve the effectiveness of the mapping method based onexamples, by combining theory of probability mapping method. That can get not only theequivalence relation mapping, but also the inclusion relation mapping, to a certain extent,solve the uncertainty problem of found mapping. The experimental results show that theproposed method has a very good mapping result on lack of case or no example of ontologyfor mapping, and can get more complex relationship mapping.
     Secondly, as the demand of semantic interoperability is growing, in order to satisfy moresemantic application, a huge, complicated structure of ontology has come out. The traditionalmapping technology in the treatment of mapping among light ontology shows very goodperformance, but mapping quality and mapping efficiency are not ideal to mappingtechnology which contain more entity relationship and more complex large-scale ontology. Todeal with mapping finding tasks between large scale ontology has become a current researchfocus. One way is to avoid comparison between two ontology of all entities. Choose mappingentity pair, which is likely to be the candidate for mapping pair, from two ontology, thisneeds the original candidate mapping set compression. In other words, set choose the moreaccurate entity from the original entity as candidate mapping set.
     This paper puts forward candidate set compression and mapping method based on the APclustering. This method will apply approximation transfer clustering thought in the ontologyof the entity of clustering, the original mapping candidate set compressed into mappingcandidate subset. Those which belong to the same kind of entity are for mapping thecandidate set, eliminate noise entity, and improve the mapping performance. In the semanticsimilarity calculation, the same body and different entity in the ontology adopt differentcalculation method, and takes into account the semantic similarity information and structuresimilarity information on cluster effect. For smaller ontology clustering, clustering results canbe directly output as m: n mapping and large-scale ontology produces clustering results whichcan be as a mapping candidate set output, and then do other targeted strategy mapping, inorder to generate more accurate mapping results.
     Besides, ontology mapping strategy development is mainly based on different ontologyof the entity in similarity calculation and these entities have various information types (E.g.the semantic information, structure information).These information can be understood as thecharacteristics of ontology, and single mapping method can not acquire ontology entity’s totalinformation. Therefore, many strategy applications can be extensively used by the currentmapping method.
     The paper proposes matching spatial multi strategy matching method, which usesmethod based on the string and semantics to construct strategy of matching space firstly, which depends on the analysis of strategies to determine whether to add matching space.This method mapped from different entity similarity which is calculated by strategies, andoutput the best matching condition. Then, in this state use the strategy iteration based on thestructure matching to map results. With the reasonable combination of various matchingstrategies, it can avoid the insufficiency that a single method can not use all the informationin the ontology, allow the user to select various matching strategy flexibly, and provide a goodframework for a variety of tactics combination, making the mapping result more feasible.
     Finally, the paper sets out the method for ontology mapping based on the theory ofdecision making, using a rapid method for ontology mapping to get the candidate mappingentity, and then preselecting the strategy according to the information theory analysis and itssimilarity information, later using entropy decision analysis method to combine the chosenstrategy, getting the final mapping results. This method avoids the affect of uselessmapping strategy on the mapping results, provides users with an effective way of selectionstrategy, and use the automatic adjusting threshold to improve mapping precision andrecall. By the end of the calculation of the chosen strategy, Analyze and adjust thecombination weights. Experimental results show that using the choice of strategy and theadjustment of parameters can improve the overall performance of the mapping.
引文
[1] Stuckenschmidt, H. and F. v. Harmelen. Information sharing on the semantic web[M].Springer.2005.
    [2] Wiederhold, G. Mediators in the architecture of future information systems[J] IEEEComputer,1992,25(3):38-49.
    [3] Berners-Lee, T.J. Hendler, et al. The Semantic Web[J]. Scientific American.2001,284(5):34-43.
    [4] P. Shvaiko. Iterative Schema-based Semantic Matching[D]. PhD thesis, InternationalDoctorate School in Information and Communication Technology, University of Trento,Trento, Italy, November2006.
    [5] Vanessa Lopez, Michele Pasin, and Enrico Motta. AquaLog. An ontology-portablequestion answering system for the semantic web[C]. In Proceedings of2nd EuropeanSemanticWeb Conference (ESWC), Lecture notes in computer science, Hersounisous(GR),2005,23(3532):546-562.
    [6] Martin Dzbor, John Domingue, and Enrico Motta. Magpie-towards a semantic webbrowser[C]. In Proceedings of2nd International Semantic Web Conference (ISWC),Lecture notes in computer science, Sanibel Island (FL US),2003,22(2870):690-705.
    [7] Martin Dzbor, Enrico Motta, and John Domingue. Opening up Magpie via semanticservices[C]. In Proceedings of3rd International Semantic Web Conference (ISWC),Lecture notes in computer science, Hiroshima (JP),2004,22(3298):635–649.
    [8] Stojanovic,Ljlijana. Methods and Tools for Ontology Evolution[D]. PhD thesis,InstitutAIFB, Universitat Karlsruhe (TH), Karlsruhe, Germany,2004.
    [9] Jér me Euzenat and Pavel Shvaiko[C]. Ontology matching. Springer Verlag, Berlin,2007.
    [10] H.Sofia Pinto, Steffen Staab, Christoph Tempich. DILIGENT: Towards a fine-grainedmethodology for Distributed, Loosely controlled and evolving Engineering ofontologies[C]. In Proceedings of the16th European Conference on Artificial Intelligence(ECAI).2004:393-397.
    [11] V. Richard Benjamins,Jesús Contreras,Mercedes Blázquez,Luís Rodrigo,PompeuCasanovas and Marta Poblet.The SEKT Legal Use Case Components: Ontology andArchitecture[C]. Legal Knowledge and Information Systems. The Seventeenth AnnualConference. Amsterdam: IOS Press,2004:69-77.
    [12] Amit P S, James A L. Federated database systems for managing distributed,heterogeneous, and autonomous databases[J]. ACM Comput. Surv.1990,22(3):183-236.
    [13] Batini C, Lenzerini M, Navathe S B.A comparative analysis of methodologies fordatabase schema integration[J].ACM Comput.Surv.1986,18(4):323-364.
    [14] Agrawal R, Srikant R.On integrating catalogs[C]. In Proceedings of the tenthinternational conference on World Wide Web.2001:603-612.
    [15] Bouquet P, Serafini L, Zanobini S. Semantic coordination: a new approach and anapplication[C]. In Proceedings of ISWC.2003:130–145.
    [16] Chawathe S, Garcia-Molina H, Hammer J, et al. The TSIMMIS project: Integrationof heterogeneous information sources[C].16th Meeting of the Information ProcessingSociety of Japan.1994,202:1-12.
    [17] Draper D, Halevy A Y, Weld D S. The nimble integration engine[C]. Proceedings ofthe2001ACM SIGMOD international conference on Management of data.2001:567-568.
    [18] Halevy A Y,Ashish N,Bitton D,et al. Enterprise information integration: successes,challenges and controversies[C]. In Proceedings of the2005ACM SIGMODinternational conference on Management of data.2005:778-787.
    [19] D Shestako,S Bhowinick,EP Lim. DEQUE: Querying the DeepWeb[J]. Data&Knowledge Engineering,2005,52:273-311.
    [20] Deep Web Technology. Accessible at http://WdeePwebteeh.com/,October2005.
    [21] D M Jones,T J M. Methodologies for Ontology Development[C]. In Proceedings of IT&knows Conference of the15th IFIP World Computer Congress,1998.
    [22] Bouquet P, Serafini L, Zanobini S. Semantic coordination: a new approach and anapplication[C]. In Proceedings of ISWC.2003:130-145.
    [23] McGuinness D L. UNSPSC Ontology in DAML+OIL, Retrieved November.2001,5:2004.
    [24] Leukel J, Schmitz V, Dorloff F D.A modeling approach for product classificationsystems. Database and Expert Systems Applications[C]. In Proceedings of13th.International Workshop on.2002.868-874.
    [25] Volz, Raphael, Oberle, Daniel, Staab, Steffen, and Studer, Rudi. WonderWeb deliverable D11: OntoLiFT Prototype[R]. Technical Report11,2003.
    [26] Rogier van Eijk, Frank de Boer, Wiebe van de Hoek, and John-Jules Meyer. Ondynamically generated ontology translators in agent communication. InternationalJournal of Intelligent Systems,2001,16(5):587-607.
    [27] FlorisWiesman, Nico Roos, and Paul Vogt. Automatic ontology mapping for agentcommunication[C]. In Proceedings of1st International joint Conference on Autonomousagents and multiagent systems (AAMAS), Bologna (IT),2002.563-564.
    [28] Jun Wang and Les Gasser. Mutual online ontology alignment. In Proceedings ofAAMAS Workshop on Ontologies in Agent Systems (OAS), Bologna (IT),2002.
    [29] Sidney Bailin, Walt Truszkowski. Ontology negotiation: How agents can really get toknow each other[C]. In Proceedings1st International Workshop on Radical AgentConcepts (WRAC), Lecture notes in computer science, McLean (VA US),2002,2564:320-334.
    [30] Ilya Zaihrayeu. Towards Peer-to-Peer Information Management Systems[D]. PhD thesis,International Doctorate School in Information and Communication Technology,University of Trento, Trento (IT), March2006.16,17
    [31] Peter Haase, Jeen Broekstra, Marc Ehrig, Maarten Menken, Peter Mika, MariuszOlko,Michal Plechawski,Pawel Pyszlak,Bj rn Schnizler and Ronny Siebes, et al..Bibster-a semantics-based bibliographic peer-to-peer system[J]. Journal of WebSemantics,2004,2(1):99-103.
    [32] Fausto Giunchiglia and Ilya Zaihrayeu. Making peer databases interact-a vision for anarchitecture supporting data coordination. Proceedings6th International Workshop onCooperative Information Agents (CIA), Madrid (ES),2002.
    [33] Pavel Shvaiko, Fausto Giunchiglia, Marco Schorlemmer, Fiona Mc-Neill, AlanBundy, Maurizio Marchese, Mikalai Yatskevich, Ilya Zaihrayeu, Bo Ho, VanessaLopez, Marta Sabou, Joaq’ n Abian, Ronny Siebes, and Spyros Kotoulas. Dynamicontology matching: a survey[R]. Deliverable3.1, Open Knowledge STREP,2006.
    [34] Philip Bernstein, Fausto Giunchiglia, A. Kementsietsidis, John Mylopoulos,Luciano Serafini, and Ilya Zaihrayeu. Data management for peer-to-peer computing: Avision[C]. In Proceedings of5th International Workshop on the Web and Databases(WebDB), Madison (WI US),2002.
    [35] Zachary Ives, Alon Halevy, Peter Mork, and Igor Tatarinov. Piazza: mediation andintegration infrastructure for semantic web data[J]. Journal of Web Semantics,2004,1(2):155–175.
    [36] P. Adjiman,P. Chatalic,F. Goasdoué,M.-C. Rousset and L. Simon. Somewhere in thesemantic web[C]. In Proceedings of32nd International Conference on Current Trends inTheory and Practice of Computer Science (SofSem), Lecture notes in computer science,Merin (CZ),2006,3831:84–99.
    [37] Maurizio Lenzerini. Data integration: A theoretical perspective[C]. Proceedings21stSymposium on Principles of Database Systems (PODS), Madison (WI US),2002,233-246.
    [38] Steffen Staab and Heiner Stuckenschmidt(M), editors. Semantic web and peer-to-peer.Springer, Heidelberg (DE),2006.
    [39] Christoph Bussle, Dieter Fensel, Alexander Madche. A concept architecture forsemantic web enabled web services[J]. ACM SIGMOD Record,2002,31(4):24-29.
    [40] Karl Aberer, Tiziana Catarci, Philippe Cudr’e-Mauroux, Tharam Dillon, StephanGrimm,Mohand-Said Hacid,Arantza Illarramendi,Mustafa Jarrar,Vipul Kashyap,Massimo Mecella, Eduardo Mena, Erich Neuhold, Aris Ouksel, Thomas Risse,Monica Scannapieco, Felix Saltor, Luca De Santis, Stefano Spaccapietra, SteffenStaab, Rudi Studer, and Olga De Troyer. Emergent semantics systems[C].InProceedings1st International Conference on Semantics of a Networked World (ICSNW),Lecture notes in computer science, Paris (FR),2004,3226:14-43.
    [41] Karl Aberer, Philippe Cudre-Mauroux et al. Emergent semantics principles andissues[C]. Proceedings9th International Conference on Database Systems for AdvancedApplications (DASFAA), Lecture notes in computer science, Jeju Island (KR),2004,2973:25-38.
    [42] Anna Zhdanova, Reto Krummenacher, Jan Henke, Dieter Fensel. Community-drivenontology management: DERI case study. Proceedings4th International Conference onWeb Intelligence (WI), pages73-79, Compiegne (FR),2005,73-79.
    [43] F. Giunchiglia, M. Yatskevich, P. Avesani, and P. Shvaiko. A large scale dataset forthe evaluation of ontology matching systems[J]. The Knowledge Engineering Review,2008.
    [44] H. Do and E. Rahm. COMA-a system for flexible combination of schema matchingapproaches[C]. In Proceedings of VLDB,2002.
    [45] J. Madhavan, P. Bernstein, E. Rahm. Generic schema matching with Cupid[C]. InProceedings of VLDB,2001.
    [46] H. Do and E. Rahm. COMA–a system for flexible combination of schema matchingapproaches[C]. In Proceedings of VLDB,2002.
    [47] R. Gligorov, Z. Aleksovski, W. ten Kate, and F. van Harmelen. Using googledistance to weight approximate ontology matches[C]. In Proceedings of WWW,2007.
    [48] Z. Aleksovski. Using background knowledge in ontology matching[D]. PhD thesis,Vrije Universiteit Amsterdam,2008.
    [49] J. Madhavan, P. Bernstein, A. Doan, and A. Halevy. Corpus-based schemamatching[C]. In Proceedings of ICDE,2005.
    [50] S. Zhang and O. Bodenreider. Experience in aligning anatomical ontologies[J].International Journal on Semantic Web and Information Systems,2007,3(2):1-26.
    [51] M. Sabou, M. d’Aquin, and E. Motta. Exploring the semantic web as backgroundknowledge for ontology matching[J]. Journal on Data Semantics,2008.156-190.
    [52] F. Giunchiglia, P. Shvaiko, and M. Yatskevich. Discovering missing backgroundknowledge in ontology matching[C]. In Proceedings of ECAI,2006.
    [53] S. Castano, A. Ferrara, D. Lorusso, T. N¨ath, and R. M¨oller. Mapping validationby probabilistic reasoning[C]. In Proceedings of ESWC,2008.
    [54] X. Dong, A. Halevy, and C. Yu. Data integration with uncertainty[C]. In Proceedingsof VLDB,2007.
    [55] A. Gal. Managing uncertainty in schema matching with top-k schema mappings[J].Journal on Data Semantics,2006,6:90-114.
    [56] A. Gal, A. Anaby-Tavor, A. Trombetta, and D. Montesi. A framework for modelingand evaluating automatic semantic reconciliation[J]. The VLDB Journal,2005,14(1):50-67.
    [57] J. Madhavan,P. Bernstein,P. Domingos,and A. Halevy. Representing and reasoningabout mappings between domain models[C]. In Proceedings of AAAI,2002.
    [58] H. Nottelmann and U. Straccia. Information retrieval and machine learning forprobabilistic schema matching[J]. Information Processing and Management,2007,43(3):552-576.
    [59] X. Dong, A. Halevy, and C. Yu. Data integration with uncertainty[C]. In Proceedingsof VLDB,2007.
    [60] A. Sarma, X. Dong, and A. Halevy. Bootstrapping pay-as-you-go data integrationsystems[C]. In Proceedings of SIGMOD,2008.
    [61] M. Mochol, A. Jentzsch, and J. Euzenat. Applying an analytic method for matchingapproach selection[C]. In Proceedings of the workshop on Ontology Matching,2006.
    [62] M. Huza, M. Harzallah, and F. Trichet. OntoMas: a tutoring system dedicated toontology matching[C]. In Proceedings of the workshop on Ontology Matching,2006.
    [63] Y. Lee,M. Sayyadian, A. Doan,and A. Rosenthal. eTuner: Tuning schema matchingsoftware using synthetic scenarios[J]. The VLDB Journal,2007,16(1):97-122.
    [64] C. Domshlak, A. Gal, and H. Roitman. Rank aggregation for automatic schemamatching[J]. IEEE Transactions on Knowledge and Data Engineering,2007,9(4):538-553.
    [65] S. Falconer and M. Storey. A cognitive support framework for ontology mapping[C]. InProceedings of ISWC/ASWC,2007.
    [66] A. Mocan. Ontology-based data mediation for semantic environments[D]. PhD thesis,National University Ireland Galway,2008.
    [67] A. Mocan, E. Cimpian, and M. Kerrigan. Formal model for ontology mappingcreation[C]. In Proceedings of ISWC,2006.
    [68] Dieter Fensel. Ontologies: a silver bullet for knowledge management and electroniccommerce[M]. Springer, Heidelberg (DE),2nd edition,2004.
    [69] Strawson, Peter F. and Bubner, Riidiger. Semantik und Ontologie[M]. Vandenhoeck&Ruprecht.1975.
    [70] Nicola Guarino, Roberto Poli. Editorial: The role of formal ontology in the informationtechnology[J]. Int. J. Hum.-Comput. Stud.1995,43(5-6):623-624.
    [71] Borst, W. Construction of Engineering Ontologies for Knowledge Sharing andReuse[D]. Ph.D. Dissertation, University of Twente,1997.
    [72] Rudi Studer, V. Richard Benjamins, Dieter Fensel. Knowledge Engineering:Principles and Methods[J]. Data Knowl. Eng,1998,25(1-2):161-197.
    [73] John F. Sowa: Ontology, Metadata, and Semiotics[C]. ICCS2000:55-81.
    [74] Gómez-Pérez, A.Corcho. Ontology languages for the Semantic Web[J]. IEEEIntelligent Systems,2002,17(1):54–60.
    [75] Genesereth et al. Knowledge Interchange Format[R]. Reference Manual Online.1992.
    [76] T.R.Gruber. Ontolingua: A Mechanism to Support Portable Ontologies[R]. KnowledgeSystems Laboratory, Stanford University, CA.1992.
    [77] MacGregor, R. Inside the LOOM classifier[J]. SIGART Bulletin,1991,2(3):70-76.
    [78] S.Luke, J.Heflin. SHOE1.01. Proposed Specification[R]. University of MarylandOnline,2000.
    [79] R. Karp, V. Chaudhri, J.Thomere. XOL: An XML-Based Ontology ExchangeLanguage. SRI International Online,1999. http://www.ai.sri.com/pkarp/xol/.
    [80] F.Manola, E.Miller. RDF Primer[C/OL]. W3C Recommendation Online,2004.http://www.w3.org/TR/rdf-primer/.
    [81] D. Brickley, R.V. Guha, RDF Vocabulary Description Language1.0: RDFSchema[C/OL]. W3C Working Draft Online.2004. http://www.w3.org/TR/PR-rdf-schema.
    [82] D. Fensel, F.van Harmelen et al. OIL: An ontology infrastructure for the SemanticWeb[J]. IEEE Intelligent Systems.2001,16(2):38-45.
    [83] I. Horrocks, F. van Harmelen. Reference Description of the DAML+OIL OntologyMarkup Language Online.2002. http://www.daml.org/2001/03/reference.
    [84] M.K. Smith,C. Welty,D.L. McGuinness. OWL Web Ontology Language Guide.W3COnline,2004. http://www.w3.org/TR/owlguide/.
    [85] Vipul Kashyap, Amit Sheth. Semantic and Schematic Similarities Between DatabaseObjects: A Context-Based Approach[J]. The VLDB Journal,1995,5(4):276-304.
    [86] Erhard Rahm, Philip Bernstein. A survey of approaches to automatic schemamatching[J]. The VLDB Journal,2001,10(4):334-350.
    [87] W. Cohen, P. Ravikumar, S. Fienberg. A comparison of string metries for matchingname sand records[C]. In Proceedings of KDD Workshop on Data Cleaning and ObjectConsolidation, Washington,2003.
    [88] D. Maynard, S. Ananiadou. Term extraction using a similarity-based approach[M]. D.Bourigault, C. Jacquemin, M. Lhomme, eds, Recent advances in computationalterminology. Amsterdam:John Benjamins,2001:261-278.
    [89] E. Rahm, P. Bernstein, A survey of approaches to automatic schema matching[J]. TheVLDB Journal,2001,10(4):334-350.
    [90] J.Euzenat, et al. State of the art on ontology alignment[J]. The Journal of arthroplasty,Deliverable D2.2.3, Knowledge web NoE,2004.
    [91] Euzenat, J. and Valtchev, P. Similarity-based ontology alignment in OWL-lite[C]. InProceedings of15th European Conference on Artificial Intelligence (ECAI), Valencia(ES),2004.
    [92] P. Valtchev, J. Euzenat. Dissimilarity measure for collections of objects and values[C].In Proceedings of2nd Symposium on Intelligent Data Analysis (IDA), London (UK),1997.
    [93] P.Valtchev. Construction automatique de taxonomies pour I'aide a la representation deconnaissances par objets. These d'informatique, Universite Grenoble,Grenoble (FR),1999.
    [94] Z. Wu, M. Palmer. Verb semantics and lexical selection[C]. In Proceedings of32ndAnnual Meeting of the Association for Computational Linguistics (ACL), Las Cruces(NM US),1994.
    [95] P. Mitra, G. Wiederhold, J. Jannink, Semi-automatic integration of knowledgesources[C]. In Proceedings of2nd International Conference on Information Fusion,Sunnyvale (CA US),1999.
    [96] P. Mitra, G. Wiederhold, M. Kersten. A graph oriented model for articulation ofontology interdependencies[C]. In Proceedings of8th Conference on ExtendingDatabase Technology (EDBT), Praha (CZ),2000.
    [97] S. Castano,A. Ferrara,S. Montanelli. Matching ontologies in open networked systems:Techniques and applications[J]. Journal on Data Semantics,2006,3870:25-63.
    [98] S. Castano, A. Ferrara, S. Montanelli. Dynamic knowledge discovery in open,distributed and multi-ontology systems: Techniques and applications[M]. Web semanticsand ontology, Idea Group Publishing, Hershey (PA US),2005, ch.8:226-258.
    [99] N. Natalya, M. Mark. PROMPT: Algorithm and tool for automated ontology mergingand alignment[C]. In Proceedings of the17th National Conference of ArtificialIntelligence (AAAI), Austin (TX US),2000.
    [100] Y. An, A. Borgida, J. Mylopoulos. Discovering the semantics of relational tablesthrough mappings[J]. Journal on Data Semantics,2006,4244:1-32.
    [101] Y. An, A. Borgida, J. Mylopoulos. Constructing complex semantic mappingsbetween XML data and ontologies[C]. In Proceedings of4th International Semantic WebConference (ISWC), Galway (IE),2005.
    [102] Y. An, A. Borgida, J. Mylopoulos, Inferring complex semantic mappings betweenrelational tables and ontologies from simple correspondences[C]. In Proceedings of4thInternational Conference on Ontologies, Databases and Applications of Semantics(ODBASE), Agia Napa (CY),2005.
    [103] P. Bouquet, B. Magnini, L. Serafini, S. Zanobini. A SAT based algorithm forcontext matching[C]. In Proceedings of4th International and InterdisciplinaryConference on Modeling and Using Context (CONTEXT), Stanford (CA US),2003.
    [104] P. Bouquet, L. Serafini, S. Zanobini. Semantic coordination: A new approach and anapplication[C]. In Proceedings of2nd International Semantic Web Conference (ISWC),Sanibel Island (FL US),2003.
    [105] P.Bouquet,L. Serafini,S. Zanobini,S. Sceffer. Bootstrapping semantics on the web:meaning elicitation from schemas[C]. In Proceedings of15th International WWWConference, Edinburgh (UK),2006.
    [106] E. Sirin, B. Parsia, B. Grau, et al. Pellet:a practical OWL-DL reasoned[J]. Journalof Web Semantics,2007,5(2):51-53.
    [107] D. Tsarkov, L. Horrocks. FaCT++description logic reasoner: system description[C].In Proceedings of3rd International Joint Conf. on Automated Reasoning (IJCAR),Springer, Seattle (WA US),2006.
    [108] F. Giunchiglia, P. Shvaiko, Semantic matching[J]. The Knowledge EngineeringReview,2003,18(3):265-280.
    [109] P.Mitra, N. Noy, A. Jaiswal. Ontology mapping discovery with uncertainty[C]. InProceedings4th International Semantic Web Conference (ISWC), Galway (IE),2005.
    [110] J. Euzenat, Brief overview of T-tree: the Tropes taxonomy building tool[C]. InProceedings of4th ASIS SIG/CR Workshop on Classification Research,1994.
    [111] M. Lacher, G. Groh. Facilitating the exchange of explicit knowledge through ontologymappings[C]. In Proceedings of14th International Florida Artificial IntelligenceResearch Society Conference (FLAIRS), Key West (FL US),2001.
    [112] G. Stumme, A. Madche. FCA-Merge: Bottom-up merging of ontologies[C]. InProceedings of17th International Joint Conference on Artificial Intelligence (IJCAI),Seattle (WA US),2001.
    [113] A. Doan, J. Madhavan, P. Domingos, A. Halevy. Ontology matching: a machinelearning approach[M]. Handbook on ontologies,Springer Verlag,Berlin (DE),2004,ch.18:85-404.
    [114] Y. Kalfoglou, M. Schorlemmer. IF-Map: an ontology mapping method based oninformation flow theory[J]. Journal on Data Semantics,2003,1(1):98-127.
    [115] U. Straccia, R. Troncy. oMAP: Combining classifiers for aligning automatically OWLontologies[C]. In Proceedings of6th International Conf. on Web Information SystemsEngineering (WISE), New York (NY US)2005.
    [116] Euzenat, J. and Valtchev, P. Similarity-based ontology alignment in OWL-lite[C]. InProceedings of15th European Conference on Artificial Intelligence (ECAI), Valencia(ES),2004.
    [117] W. Hu, Y. Zhao, et al. The results of Falcon-AO[C]. In Proceedings Internationalworkshop on Ontology Matching (OM), Busan, Korea.2007.
    [118] Y. Li, Q. Zhong, J. Li, J. Tang. Result of ontology alignment with RiMOM atOAEI'06[C]. In Proceedings of International workshop on Ontology Matching (OM),2006.
    [119] Thomas R. Gruber. A translation approach to portable ontology specifications[J].Knowledge Acquisition,1993,5(2):199-220.
    [120] Natalya F. Noy. Semantic Integration: A Survey of Ontology-based Approaches[J]. ACMSIGMOD Recordvol,2004,33(4):65-70.
    [121] S. Pavel, E. Jér me. A Survey of Schema-based Matching Approaches[J]. Journal ofData Semantics,2005,3730(4):146-171.
    [122] Tang Jie, Liang BangYong, Li Juan-Zi. Wang Ke-Hong. Automatic OntologyMapping in Semantic Web[J]. Chinese Journal of Computers,2006,29(11):1956-1957.
    [123] P. Bouquet, L. Serafini, and S. Zanobini. Peer-to-Peer Semantic Coordination[J].Journal of Web Semantics,2005,2(1):1-24.
    [124] Konstantinos Kotis, George A. Vouros, Konstantinos Stergiou. Towards automaticmerging of domain ontologies: The hcone-merge approach[J]. Journal of WebSemantics,2006,4(1):60-79.
    [125] Prateek Jain, Peter Z. Yeh, et al. Contextual Ontology Alignment of LOD with anUpper Ontology: A Case Study with Proton[C]. In Proceedings of ESWC,2011.
    [126] Z. Aleksovski, M. Klein, W. ten Katen, and F. van Harmelen. MatchingUnstructured Vocabularies using a Background Ontology[C]. In Proceedings of EKAW,LNAI. Springer-Verlag,2006.
    [127] H. Stuckenschmidt, F. van Harmelen, et al. Using C-OWL for the Alignment andMerging of Medical Ontologies[C]. In Proceedings of the First Int. WS. on FormalBiomedical K. R.(KRMed),2004.
    [128] W. van Hage, S. Katrenko, and G. Schreiber. A Method to Combine LinguisticOntology-Mapping Techniques[C]. In Proceedings of ISWC,2005.
    [129] K. Minkoo,H. A. Ali,S. D. Jitender,et al. On Modeling of Concept Based Retrievalin Generalized Vector Spaces[C], Foundations of intelligents systems:12thintenationalsymposium, ISMIS,2000.
    [130]张永兴,孙四明,张峰.基于本体的信息检索系统研究[J].微计算机信息,2011,27(8):125-127.
    [131] Mia K.stem,Joseph E Beck, Beverly Park Woolf. Naive Bayes classifiers for usermodeling[C]. Proceedings of the Conference on User Modeling,1999.
    [132] D.D.Lewis.Narve (Bayes). The In Dependenee Assum Ptionin Inofmration Retrieval[C].In Proeeedings of the10thEuro Pean Coneference on Machine Lemaing, NewYork,1998.
    [133] S.Eyhermaendy, D.D.Lewisn, D.Madigna. On the na ve byaes model for textcategorization[M]. Artifieial Intelligenee&Statisties,2003.
    [134] E. Pengnad,D..Sehuumrnas,.Combining na ve byaes and n-gram language models fortext classification. Proceedings of The25thEuropean Conference on InofmrationRertieval Researeh(ECIR03),2003.
    [135] Yiming Yang. An Evaluation of Statistical Approaches to Text Categorization[J].Information retrieval,1999,1(1):76-88.
    [136] W. Cohen and Y. Singer. Context-sensitive learning methods for text categorization[C].In Proceedings of the19thAnnual International ACM SIGIR Conference on Researchand Development in Information Retrieval,1996:307-315.
    [137] D.D.Lewis,R.E.Schpaier,J.P. Callnan,R. Parka. Training algorithms for linear textclassifiers. In Proceedings of the19th Annual International ACM SIGIR Conference onResearch and Development in Information Retrieval,1996:298-306.
    [138] Y. Yang, C.G. Chute. A linear least squares fit mapping method for informationretrieval from natural language texts. In Proceedings of the14th Conference onComputational Linguistics(COLING92),1992.
    [139] C. Hsu, C. Lin. A comparison on methods for multi-class support vector machines,IEEE Transactions on Neural Networks.2002,13:415-425.
    [140] K. Nigam, J. Lafferty, A. McCallum. Using maximum entropy for textclassification[C]. In Proceedings of the IJCAI-99Workshop on Information Filtering,Stockholm, Sweden,1999.
    [141] E. Wiener. Aneural network approach to topic spotting[C]. In Proceedings of the4thAnnual Symposium on Document Analysis and Information Retrieval (SDAIR95), LasVegas, NV,1995.
    [142] C. Apte,P. Damerau,S. Weiss. Text mining with decision rules and decision trees. InProceedings of the Conference on Automated Learning and Discovery Workshop6:Learning from Text and the Web,1998.
    [143] B. Lent, A. Swami, J. Widom. Clustering association rules. In Proceedings of theThirteenth International Conference on Data Engineering (ICDE97), Birming,England,1997.
    [144] B. Liu, W. Hsu, Y. Ma. Integrating classification and association rule mining. InProceedings of the Fourth International Conference on Knowledge Discovery and DataMining(KDD98), New York,1998.
    [145] G. Dong, J. Li. Efficient mining of emerging patterns: Discovery trends anddifferences[C]. In Proceedings of the5th International Conference on KnowledgeDiscovery and Data Mining(KDD99), San Diego, CA,1999.
    [146] P. Stemmer. An algorithmfor suffix strippingprograms[J]. Pro2gram,1980,4(3):130-137.
    [147] G. Salton, C.Buckley. Term-weighting approaches in automatic text retrieval[J].Information Processing&Managemen.1998,24(5):513-523.
    [148] Dunja Mladenic, Marko Grobelnik. Feature Selection for Classification Based onText Hierarchy[C]. In Proceedings of Conference on Automated Learning and DiscoveryCONALD,1998.
    [149] R′emi Tournaire,Jean-Marc Petit,Marie-Christine Rousset,and Alexandre Termier.Discovery of Probabilistic Mappings between Taxonomies: Principles andExperiments[J]. Journal of Data Semantics,2011,15:66-101.
    [150] M.H Degroot. Optimal Statistical Decisions[R] Wiley-Interscience,2004.
    [151] A.Doan, J.Madhavan, P. Domingos, A.Y.Halevy. Learning to map betweenontologies on the Semantic Web[C]. In: WWW.2002,662–673.
    [152] T.H.Cormen, C.E.Leiserson, R.L.Rivest, C.Stein, Introduction to AlgorithmsSecond Edition[C]. The MIT Press,2001
    [153] P.Duchon,P.Flajolet,G.Louchard,G.Schaeffer,Boltzmann samplers for the randomgeneration of combinatorial structures[J]. Comb. Probab. Comput.2004,13(4-5):577-625.
    [154] T. Mitchell. Machine Learning. McGraw-Hill Education (ISE Editions) http://www.amazon.ca/exec/obidos/redirect?tag=citeulike0920&path=ASIN/0071154671.1997.
    [155] G.W.Flake, S.Lawrence. Efficient SVM regression training with SMO. Mach Learn[J].2002,46(1-3):271–290.
    [156] M.Q.Stearns, C.Price, K.A.Spackman, et al. SNOMED clinical terms: overview ofthe development process and project status[C], In Proceedings of AMIA Symp.2001,662(6):503-512.
    [157] A. Rector, J. Rogers. Ontological and practical issues in using a description logic torepresent medical concept systems[R]. Experience from GALEN, Reasoning Web,Second International Summer School, Tutorial Lectures.4126:197-231.
    [158] C. Rosse, L. V. Mejino J. A reference ontology for biomedical informatics: theFoundational Model of Anatomy[J], Journal of Biomedical Informatics.2003,36(6):478-500.
    [159] J.Golbeck, G.Fragoso, F.Hartel, et al. The National Cancer Institute’s ThésaurusandOntology[M], Web Semantics: Science, Services and Agents on the World WideWeb.2003,1(1):75-80.
    [160] Q. Zhong,H. Li,J. Li,G. Xie,J. Tang,L. Zhou. A gauss function based approachfor unbalanced ontology matching[C]. In Proceedings of2009ACM SIGMODInternational Conference on Management of Data (SIGMOD2009),2009.
    [161] T. Wachter, A. Wobst, M. Schroeder, H. Tan, and P. Lambrix. A corpus-drivenapproach for design, evolution and alignment of ontologies[C]. In Proceedings of the38th conference on Winter simulation, Monterey, California,2006,1595-1602.
    [162] Pubmed central. http://www.pubmedcentral.nih.gov/.
    [163] QU Yu-zhong,HU Wei,CHENG Gong. Constructing Virtual Document for OntologyMatching [C]. In Proceedings of the15th International Conference on World Wide Web.New York: ACM,2006,23-31.
    [164]刘春辰,刘大有,王生生,等.改进的语义相似度计算模型及应用[J].吉林大学学报:工学版,2009,39(1):119-123.
    [165]纪祥,刘华虓,吴芬芬,等.基于特征和HMM的信息提取[J].吉林大学学报:信息科学版,2009,27(4):396-399.
    [166]吴健,吴朝晖,李莹,等.基于本体论和词汇语义相似度的Web服务发现[J].计算机学报,2005,28(4):595-602.
    [167] R. Yves, et al. Ontology matching with semantic verification[J], Web Semantics:Science, Services and Agents on the World Wide Web,2009,7(3):235-251.
    [168] D. Lin. An information-theoretic definition of similarity[C], In Proceedings of15thInternational Conference of Machine Learning (ICML),1998,296-304.
    [169] V.Schickel-Zuber,, B.Faltings. OSS: A semantic similarity function based onhierarchical ontologies[C]. In Proceedings of the20th International Joint Conference onArtificial Intelligence (IJCAI2007),2007, pp:551–556.
    [170] Food and Agriculture Oganization of the United Nations, ASFA Ontology [EB/OL].http://www4.fao.org.asfa.asfa.htm/,2009.
    [171] V.Haarslev and R.Moeller. RACER system description. In Proceedings of the Int. JointConf. on Automated Reasoning (IJCAR2001),2001.
    [172] G. Stoilos, G.B. Stamou, S.D. Kollias. A string metric for ontology alignment[C]. InProceedings of ISWC,2005.
    [173] W.E. Winkler. The state of record linkage and current research problems[R]. TechnicalReport, Statistical Research Division, US Census Bureau,1999.
    [174] V.I. Levenshtein, Binary codes capable of correcting deletions, insertions, andreversals[J], Soviet Physics Doklady,1966,10(8):707-710.
    [175] H. Hamacher, H. Leberling, H.J. Zimmermann. Sensitivity analysis in fuzzy linearprogramming[J]. Fuzzy Sets and Systems,1978,1(4):269-281.
    [176] G. Miller. WordNet an on-line lexical database[J]. International Journal ofLexicography,1990,3(4):235–312.
    [177] N. Seco, T. Veale, J. Hayes, An intrinsic information content metric for semanticsimilarity in WordNet[C]. In Proceedings of ECAI,2004,1089-1090.
    [178] V.D. Blondel, A.Gajardo, M. Heymans, P. Senellart, P. Van Dooren, A Measureof Similarity between Graph Vertices[C]. CoRR, cs.IR/0407061,2004.
    [179] R. Tous, J. Delgado. A vector space model for semantic similarity calculation andOWL ontology alignment[C]. In Proceedings of DEXA,2006,307-316.
    [180] Vincent D. Blondel et al. A measure of similarity between graph vertices: Applicationsto synonym extraction and web searching[J]. SIAM Rev.,2004,46(4):647-666.
    [181] Knouf, N. MIT bibtex ontology.2003. http://visus.mit.edu/bibtex/0.1/bibtex.owl.
    [182] eBiquity Publication Ontology. UMBC ontology http://ebiquity.umbc.edu/ontology/publication.owl.
    [183] P. Bouquet,L. Serafini. On the difference between bridge rules and lifting axiomsc[C].In Procceedings of4th International and Interdisciplinary Conference on Modeling andUsing Context, Stanford,2003,80-93.
    [184] H. Do, S. Melnik, E. Rahm. Comparison of schema matching evaluations[C]. InProceedings of workshop on Web and Databases,2002,221-237.
    [185] G. Pirró, D. Talia, LOM. A linguistic ontology matcher based on informationretrieval[J], Journal of Information Science.2008,34(6):845–860.
    [186]王红卫等.基于数据的决策方法综述[J].自动化学报.2009,35(6).
    [187]王开军,张军英等.自适应仿射传播聚类[J].自动化学报.2007,33(12).

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700