用户名: 密码: 验证码:
空间数据挖掘与GIS集成技术研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
地理信息系统和数据挖掘是当今信息技术中的两颗璀璨明珠。前者侧重于信息的管理,后者侧重于信息的处理与分析,有着紧密地联系性和互补性。
     GIS数据库中含有丰富的数据和信息,其中隐含着许多有价值的知识,而目前的GIS系统主要局限于实现数据的录入、查询、统计等功能,无法有效地发现数据中存在的关系和规则,而数据挖掘技术可以对GIS数据进行更高层次的分析,发现其中隐含的知识。因而从GIS的空间数据库中进行知识发现即空间数据挖掘,已成为数据挖掘领域中一个重要的研究方向。
     近年来两种技术的发展,使数据挖掘和GIS结合以挖掘GIS所管理的海量空间数据背后的知识规律成为可能。
     如何发现在大型空间数据库中所隐藏的、预先未知的信息以辅助相应的应用,这就是目前空间数据挖掘的任务。同时空间关联规则是空间数据挖掘结果的一种最主要的知识规则,它侧重于确定数据中不同领域之间的联系,找出满足给定支持度和可信度阈值的多个域之间的依赖关系。
     本文主要通过对知识发现和数据挖掘的研究入手,并结合GIS空间数据库,对空间数据挖掘与GIS的集成模式进行深入研究,并建立相应知识模型,提出了基于MapX的空间数据挖掘体系结构,进行模型数据预处理,实现了关联规则挖掘算法Apriori,由挖掘出的知识规则可以体现出空间数据挖掘与GIS集成后得广泛应用前景。本论文主要工作如下:
     (1)对知识发现和数据挖掘进行了系统深入的研究,将空间数据挖掘与GIS空间数据库进行联合,对两者的集成模式进行深入研究,并建立GIS数据库知识发现模型;
     (2)对空间数据的处理和分析过程进行了研究,提出了几种空间数据预处理的方法及GIS中的关联规则挖掘,分析了关联规则算法,并进行实现;
     (3)结合GIS组件MapX,提出了基于MapX的空间数据挖掘体系结构,实现了关联规则挖掘算法Apriori,建立挖掘算法Apriori试验分析平台,通过分析试验体现出空间数据挖掘与GIS集成后得广泛应用前景。
Nowadays, Geographic Information System and Data Mining and Knowledge Discovery technology are two bright pearls of information technology .The former emphasizes particularly on the management of information and the latter emphasizes particularly on the processing and analysis of information.
     There are abundant data and information in the database of Geographic Information System. These data and information include much implicit and valuable knowledge. Presently GIS is limited to the collection, query and statistic of data. It can not discover relations and rules in data while data mining techniques can analyze GIS data at a higher level which can discover the implicit knowledge. Discovering knowledge from spatial database of GIS with data mining techniques, namely spatial data mining, has become an important research aspect in data mining fields.
     The development of the two techniques makes it possible to mine the knowledge and rules that behind the massive spatial data which is stored in GIS by integrating the two techniques.
     The main task of spatial data mining is discovering the implicit, unknown information to assist relevant application. At the same time, spatial association rule is one of the most primary knowledge rules in the result of spatial data mining. It emphasizes particularly on confirming the relationship of data in different fields, and finding out the dependence of data in multi-fields.
     This paper mainly researches on spatial data mining and GIS Integration mode with GIS spatial database through studing knowledge discovery and data mining.In the paper a association knowledge model and a spatial data mining system architecture based on the MapX are presented. Moreover, model data are pretreated and association rules mining algorithm Apriori is implemented. The knowledge rules by data mining reflect a broad application prospects by integrating spatial data mining and GIS. The main thesis is as follows:
     (1) Through the research of the knowledge discovery and data mining, combined the spatial data mining and GIS spatial database, and built the knowledge discovery model through in-depth analysis of the spatial data mining and GIS spatial database's integration modes.
     (2) Studying the process of data processing and analysing advanced several spatial data reprocessing methods and spatial association rules mining, analyzed and realized the algorithm of spatial association rules.
     (3) Presenting a spatial data mining architecture based on GIS module MapX , implementing association rules mining algorithm Apriori and establishing mining algorithm Apriori analysis platfoim.it reflects a broad application prospects by integrating the spatial data mining and GIS .
引文
[1] 白石磊,毛雪岷,王儒敬,熊范纶.基于数据库和知识库的知识发现研究综述.广西师范大学学报(自然科学版),2003.21(1):137~140
    [2] 吕成哲,赵晓明,王起伟.浅谈数据挖掘理论.中国西部科技,2007(2):1~3
    [3] 杨会志.数据挖掘技术的主要方法及其发展方向.河北科技大学学报,2000.21(2):77~80
    [4] 乔明中.GIS在旅游行业的应用:[硕士学位论文].成都:电子科技大学计算机应用技术,2006
    [5] 郭达志,何彬彬.空间数据挖掘及其不确定性研究坝4绘与空间地理信息,2004.27(5):6~9
    [6] 张瑞菊.空间数据挖掘方法及其与GIS集成模式的应用研究:[硕士学位论文].山东:山东科技大学大地测量与测量工程,2003
    [7] 周海燕.空间数据挖掘的研究:[博士学位论文].河南:中国人民解放军信息工程大学测绘学院,2003
    [8] 魏芳,伊胜伟,李欣,徐友文.空间数据挖掘与GIS集成研究及应用.辽宁石油化工大学学报,2005.25(4):80~82
    [9] 肖寒,吴侃,孙君.基于GIS的空间数据和空间数据库浅谈.江苏地质,2005.29(2):105~107
    [10] 李德仁,程涛.从GIS数据库中发现知识.测绘学报,1995.24(1):37~40
    [11] 张洪群.基于J2EE的空间关联规则挖掘理论研究与原型系统实现:[硕士学位论文].福州:福州大学计算机应用技术,2003
    [12] 陆丽娜,陈亚萍,魏恒义,杨麦顺.挖掘关联规则中Apriori算法的研究小型微型计算机系统,2000.21(9):940~943
    [13] 刘星沙,谭利球,熊拥军.关联规则挖掘算法及其应用研究.计算机工程与科学,2007.29(1):83~86
    [14] 蔡伟杰,张晓辉,朱建秋,朱杨勇.关联规则挖掘综述.计算机工程,2001.27(5):30~33
    [15] 李芳玉.GIS数据预处理分析及若干算法的研究.计算机工程与应用,2004.23(1):54~56
    [16] 蒋良孝,蔡之华.空间数据挖掘的回顾与展望..计算机工程.2003.29(6):9~10
    [17] 王艳.城市空间数据挖掘方法与应用研究:[硕士学位论文].山东:山东师范大学人文地理学,2004
    [18] 崔阳,王华,乔淑娟.基于GIS的空间数据库构建与应用研究.微计算机信息,2006.22(2):80~83
    [19] 徐德军.基于GIS与SDM集成技术的基本地形特征识别:[硕士学位论文].湖北:武汉大学地图学与地理信息系统,2004
    [20] 石云,孙玉芳,左春.基于Rough Set的空间数据分类方法.软件学报,2000,11(5):673~678
    [21] 曹加恒,舒风笛,张凯,彭敏,叶可.基于多媒体数据库德数据挖掘系统原型.武汉大学学报,2000.46(5):567~570
    [22] 邸凯昌,李德仁,李德毅,Rough集理论及其在GIS属性分析和知识发现中的应用.武汉测绘科技大学学报,1999.24(1):6~10
    [23] 周海燕,王家耀,吴升.空间数据挖掘技术及其应用,测绘通报.2002.36(2):53~57
    [24] 张瑞菊,陶华学.GIS与空间数据挖掘技术集成问题的研究.勘察科学技术,2003.12(2):23~25
    [25] 吴为英.基于GIS的空间数据挖掘技术.山东科技大学学报(自然科学版),2004.23(3):15~18
    [26] 王志海,胡可云等.基于粗糙集合理论的知识发现综述.模式识别和人工智能,1998.11(2):346~351
    [27] 刘君强,孙晓莹.关联规则挖掘技术研究的新进展.计算机科学,2004,31(1):110~113
    [28] 李兴生,李德毅.基于云模型的决策表连续属性离散化方法.模式识别与人工智能,2003(3):33~37
    [29] 郭平,叶莲,范丽.基于粗糙集的空间信息预处理方法.计算机科学,2005.132(3):101~104
    [30] 李福堂,凌峰,张秋文.空间数据挖掘及其3S集成系统的应用研究.微机发展,2005(2)
    [31] 过仲阳,陆衍,戴晓燕.空间数据挖掘系统及其应用.上海地质,2004(4):76~78
    [32] 刘同时,刘伟.空间数据发掘技术的研究和发展趋势理论研究2004(1):45~47
    [33] Smith, M. H. Pedrycz, W. Expanding the meaning of and applications for data mining. systems, Man, and Cybernetics, 2000.
    [34] Teng, W. Pollack, N. Serafino, G. Long Chiu Sweatrnan, P. GIS and data interoperability at the NASA Goddard DAAC. Geoscience and Remote Sensing Symposium, 2001. 43~47
    [35] Xiaogang Li, Ruoming Jin, Agraywai. G. A compilation framework for distributed memory rallelization of data mining algorithms. Parallel and Distributed Processing Symposium, 2003.
    [36] Liu. Z, Guo. M 1. A proposal of integrating data mining and on-line analytical processing in data warehouse. Info-tech and Info-net, 2001.
    [37] Jiawei Han et al.数据挖掘:概念与技术.机械工业出版社[M].2001
    [38] Kanungo, T., Mount, D. M., Netanyahu, N. S. An effcient k-means clustering algorithm: analysis and implementation. Pattern Analysis and Machine Intelligence, IEEE Transactions on, Volume: 24, Issue: 7, July 2002: 33~36
    [39] Jiawei Han, Micheline Kamber, 数据挖掘概念与技术,北京:机械工业出版社,2001,8(8)
    [40] Robinson.N, Shapcott, M. information visualisation - beyond charts and graphs.Information Visualisation, 2002.
    [41] Xingdong Wu artificial intelligence in data analysis. Intelligent Agent Technology, 2004.
    [42] .Agrawal R, LimielinskiT, Swami A N.Mining Association Rules Between Sets of Item in Larger Databases[A].Proc of 1993 ACMSIGMOD Int'l Conf on Management of Data[C]. 1993.207-216
    [43] Robinson.N, Shapcott, M. information visualisation - beyond charts and graphs.Information Visualisation, 2002.94-103
    [44] Beaubouf T, Petry F. Vague regions and spatial relationships: a rough set approach. Computational Intelligence and Multimedia Applications, 2001: 313-317.
    [45] Berger A L, Delia Pietra. A maximum approach to natural language processing. Computational Linguistics, 1996, 22(1): 40-72.
    [46] David Hand, Heikki Mannila. Principles of Data Mining. China Machine Press.2003.
    [47] Jiawei Han, Micheline Kamber. Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, Inc.2001.
    [48] Goodchild M.F, Models for uncertainty in area-class maps, In: Proceedings of The 2"d Internatianal Symposium on Spatial Data Quality 2003, Edited by ShiW.Z, Goodchild M.F, Fisher P.F, Hong Kong, The Hong Kong Polytechnic University, 2003.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700