用户名: 密码: 验证码:
基于SOA的数据挖掘服务在物流管理平台中的应用研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着数据挖掘理论及其应用研究的不断深入,数据挖掘技术在企业信息决策支持系统中显示出越来越重要的作用,已经成为企业信息化战略建设的重要组成部分。对于物流企业而言,大量的实物流、信息流以及资金流是其共同特点,采用数据挖掘技术对这些信息进行深入分析,对提高物流企业的运作效率、降低其运营成本具有重要意义。目前,已经出现了各种专业数据挖掘工具,但只有精通数据挖掘技术的专家才能熟练使用;而且纵向数据挖掘工具的开发成本又较高,如果直接在物流企业的信息系统中实施,必将增加其信息化建设的成本。
     物流行业中对数据挖掘技术日益增长的需求与应用数据挖掘技术的高成本之间的矛盾是本论文研究的出发点。论文全面研究了SOA(Service-OrientedArchitecture)架构、WCF(Windows Communication Foundation)开发技术及数据挖掘理论,通过分析数据挖掘技术在物流管理中的应用需求,以WCF为开发框架,在现代物流管理平台中设计基于SOA的数据挖掘服务,这些服务包括数据上传服务、数据清洗服务、数据挖掘算法服务和OLAP(On-Line AnalyticalProcessing)服务等。
     在数据挖掘服务的实现方面,论文研究了大文件上传、元数据管理、ETL(Extraction-Transformation-Loading,数据抽取、转换和加载)规则设计、数据挖掘算法封装、OLAP可视化等关键技术,并设计、开发了部分实例,讨论了如何在C/S(Client/Server,客户机/服务器)模式、B/S(Browser/Server,浏览器/服务器)模式及Windows Mobile等不同平台中调用WCF服务。
     在数据挖掘服务的应用方面,论文结合开源WebGIS(Web-Based GeographicInformation System)——MapEasy,对其二次开发后实现了动态标注、标注管理、标注查询,以及路径规划功能,并在此基础上开发了物流运输系统的实例,对配送路径、运输绩效问题进行智能分析,成功将上述部分关键技术进行了综合应用。
     对于数据挖掘服务的评价,论文分析了基于服务质量的评价策略,建立了具有通用性和专属性的完整的服务评价的本体及相应的评价因子,给出了数据挖掘服务的评价过程。同时采用了欧几里德距离法来判断数据挖掘服务之间的相似性,以对服务资源进行合理分配。
     论文的创新点主要体现在以下几个方面:
     (1)提出了一种采用WCF框架来设计基于SOA的数据挖掘服务的方法;
     (2)分析了实现数据挖掘服务需要解决的各种关键技术,结合WebGIS技术,将设计的数据挖掘服务成功应用到物流运输系统中;
     (3)基于Qos(Quality of Service)的评价方法来对数据挖掘服务进行评价和选择,取得了良好效果。
     论文的研究成果及创新内容不仅满足了物流企业对数据挖掘技术的需求,为企业节省了信息化成本,符合未来企业实现信息集成和业务敏捷性的趋势;而且,采用WCF框架开发数据挖掘服务也是为适应未来IT系统架构及其开发模式而进行的一种有效尝试。
     本论文得到国家十一·五支撑计划课题(项目编号:2006BAH02A06)的资助。
With the continuous development of theory and application research on Data Mining (DM), the technique has increasingly shown its significant place in the enterprises' information decision-making and supporting system, and has been one of the key components in the enterprises' informationization strategies. For most of logistic enterprises, one of the common characteristics is the great amount of physical flow, information flow and capital flow. Applying the technique of DM in the analysis of the information mentioned above will mean significance in improving the effectiveness and lowering the cost of the logistic enterprises' running. Nowadays, different sorts of DM Tools have been developed, but it relies on experts in DM to manipulate these tools proficiently. What's more, the develop of longitudinal DM will cost quite a lot, so if we implement it directly into the logistic enterprises information system, the cost of their informationization will run up as well.
     The contradiction between the increasing need of DM technique and its high implementing cost is the starting point of this thesis. A general introduction of SOA, WCF and the theory about DM is given. After analyzing the need of DM technique in logistics management, a kind of SOA-based DM service is developed within the framework of WCF, including data uploading service, data eliminating service, DM algorithm service and OLAP service and so on.
     In implementing of DM services, the key techniques including large file uploading, metadata management, ETL rules, DM algorithm packaging, and visualization by OLAP are analyzed in this thesis, and some examples developed here are shown as well. And the discussion about the call of WCF service in different platforms such as C/S, B/S, and Windows Mobile is carried out here.
     For the application of DM services, the thesis redevelops the service based on the open source technique WebGIS—MapEasy, and realizes the functions of dynamic noting, notes management, notes querying, and route planning. And as a basis, the distribution routes and effectiveness of transportation are analyzed intelligently in the application of a logistic transportation system, which is a successful implementation of these key techniques.
     The Qos-based evaluation method for DM services is adopted, with the service evaluation Ontology with Universality and specificity and relevant evaluation factors built, and the procedure of evaluation depicted. In order to allot the resource of services properly, the method of Euclid distance is used to judge the similarity of different services.
     The innovations of this thesis are mainly shown as follows:
     (1) Proposing a method of developing the SOA-based DM service within the framwork of WCF;
     (2) Analyzing the key techniques needed to realize the DM services, and applying these techniques to logistics transportation systems combined with WebGIS.
     (3) Evaluating the DM services based on Qos method.
     In a word, the research results and innovations obtained in this thesis meet the need of logistics enterprises on DM services, and lower the cost of informationization, which also meet the trend of information integration and agile operation for enterprises. What's more, the method of developing the DM service within the framwork of WCF is a kind of effective attempt of adapting to the future IT system architecture and developing mode.
     This thesis is supported by the funding from the 11th Five-Year Supporting Plan Issue (Project number: 2006BAH02A06).
引文
[1]杨斌.基于网格的分布式存储系统中数据分布和传输机制研究与实现[D]:[硕士学位论文].北京:北京交通大学,2007
    [2]http://www.emc.com/digital_universe
    [3]祖巧红.基于实例的OLAM技术及其多维可视化研究[D]:[博士学位论文].武汉:武汉理工大学,2007
    [4]YU Yi jun,CHEN Chun,YU Yi min,etal.Web multimedia information retrieval using improved Bayesian algorithm[J].Journal of Zhejiang University SCIENCE,2003,4(4):415-420
    [5]Khaled Alsabti,Sanjay Ranka,Vineet Singh.An Efficient K-Means Clustering Algorithm[J].2002,24(7):65-72
    [6]Ruoming Jin,Ge Yang,Gagan Agrawal,Member,Shared Memory Parallelization of Data Mining Algorithms:Techniques,Programming Interface Knowledge and Data Engineering,2005,17(1):71-89
    [7]Kim YongSeog,Street W Nick.An intelligent system for customer targeting:a data mining approach.Decision Support Systems,2004,37-59
    [8]Mourad Quzzani,Athman Bouguettaya.Efficient Access to Web Services[J].IEEE Internet Computing,2004,8(2)
    [9]Min Luo,Mark Endrei.Service-Oriented Architecure and Web Services[J].Inernational Technical Support Organization,2006
    [10]沈毅.基于面向服务架构(SOA)的港口企业信息集成系统的应用研究[D]:[硕士学位论文].厦门:厦门大学,2007
    [11]Eric Newcomer,Dreg Lomow.Understanding SOA with Web Services[M].Addison Wesley Professional,2004,142-144
    [12]侯健.基于SOA架构的通用数据交换平合的设计与实现[D]:[硕士学位论文].北京:华北电力大学,2006
    [13]朱磊,周明辉,刘天成等.一种面向服务的权限管理模型[J].计算机学报,2005,28(4):677-685
    [14]丁兆青,董传良.基于SOA的分布式应用集成研究[J].计算机工程,2007,5:246-248
    [15]邵欢庆,康建初.企业服务总线的研究与应用[J].计算机工程,2007,33(2):220-222
    [161 Nicolai M.Josuttis.SOA in Practice:The Art of Distributed System Design[M],O'Reilly Media Inc,2007,7-196
    [17]Juval Lowy.Programming WCF Services[M](张逸,徐宁).北京:机械工业出版社,2008,13-33
    [18]Ying Chen,Brad CohenBooz Allen Hamilton8251 Greensboro Drive McLean.Data Mining and Service Rating in Service-Oriented Architectures to Improve Information Sharing[J].IEEE Conference 2005,1-11
    [19]Scott Klein.Professional WCF Programming:.NET Development with the Windows Communication Foundation[J].Wiley publishing,2007,29-45
    [20]Michael Bell.Service-Oriented Modeling(SOA):Service Analysis,Design,and Architecture[M],Wiley publishing,2008,95-135
    [21]张波,陈定方,祖巧红.基于SQLSERVER2005的数据挖掘系统设计[J].湖北工业大学学报,2007,22(3):29-31
    [22]吴婕.浅析数据挖掘软件的发展[J].情报理论与实践,2004,27(2):212-214
    [23]杨永刚.数据挖掘在物流领域中的应用[D]:[硕士学位论文].武汉:武汉理工大学,2006
    [24]ZhaoHui Tang,Jamie MacLennan.Data Mining with SQL Server 2005[M],Wiley publishing,2007,365-398
    [25]何建安.基于物流信息平台的OLAP系统设计与实现[D]:[硕士学位论文].武汉:武汉理工大学,2005
    [26]马瑞新,许力.基于SOA的实时ETL的研究与实现[J].计算机工程与科学,2007,29(8):115-122
    [27]王媛媛.基于SQL Server 2005的Web日志挖掘系统构建[J].网络资源与建设,2006,(5):58-61
    [28]方志斌.基于数据挖掘技术的物流决策支持系统的研究与实现[D]:[硕士学位论文].长沙:国防科学技术大学,2006
    [29]朱子昊.基于数据挖掘技术的物流信息系统研究[D]:[硕士学位论文].上海:上海交通大学,2007
    [30]李宗璞.数据挖掘技术在物流系统中的应用[J].商场现代化,2006,(458):134-135
    [31]舒帆.港口物流信息平台共享架构及其可视化挖掘[J].上海海事大学学报,2006,(27):79-84
    [32]丁一涛,王国利,赵宏伟.基于Java技术的多媒体信息远程上传[J].计算机工程,2003,29(6):126-127
    [33]戚艳军,马光思.多文件上传在Web应用中的实现方法研究[J].计算机技术与发展,2006,16(4):158-159
    [34]林天华,马素霞,赵霞.基于Web的大文件上传技术研究[J].科技导报,2007,25(16):30-33
    [35]陈静.基于Web Services的数据上传系统的设计与实现[J].信息技术,2005,(6):17-23
    [36]平静,平林瑞.元数据管理及其在数据仓库中的应用研究[J].平原大学学报,2006,23(4):130-132
    [37]John Poole,Dan Chang,Douglas Tolbert,etal.Common Warehouse Metamodel:An Introduction to the Standard for Data Warehouse Integration[M].Chichester:John Wiley &Sons Inc,2001
    [38]OMG.MOF 2.0/XMI Mapping Specification version2.1[S/OL].(2005-09-01)[2007-05-30].http://www.omg.org/docs/formal/05-09-01.pdf.
    [39]管丽娟.数据ETL研究与展望[J].数据库及信息管理,2007,3:1512-1514
    [40]任蓉,王伦津.SQL Server 2005数据挖掘API技术分析与实例应用[J].宁夏工程技术,2007,6(3):221-225
    [41]陈健飞.地理信息系统导论[M].北京:科学出版社,2004
    [42]周戈,王蔚韬,何光辉.基于数据挖掘的GIS在车辆自动导航系统中的应用[J].计算机科学,2005,32(6):145-146
    [43]赵信洋.基于数据仓库的物流配送系统研究[D]:[硕士学位论文].武汉:武汉理工大学,2006
    [44]彭利.物流信息系统和车辆装载问题研究[D]:[硕士学位论文].长春:吉林大学,2004
    [45]黄晓滨,邹书蓉,张洪伟.改进的遗传算法及在物流配送路径优化中的应用[J].西南民族大学学报(自然科学版),2008,34(4):854-859
    [46]刘北林,高爽.配送车辆路径优化问题算法研究[J].商业经济,2008,309(9):31-33
    [47]王会云,肖建禄,刘登泰等.基于遗传算法的配送路线优化[J].后勤工程学院学报,2008,24(3):91-94
    [48]吴建斌,王晓虎.Qos驱动的Web services动态计价机制研究[J].计算机应用,2008,28(5):1307-1309
    [49]李玉华,陈云开,卢正鼎.基于质量的数据挖掘服务选择[J].计算机科学,2007,34(8):159-164
    [50]Silverstein A,Brin S,Motwani R.Beyond market baskets:Generalizing association rules to dependence rules[J].Data Mining and Knowledge Discovery,1998,2(1):39-68
    [51]Xu J,Chen H.Fighting Organized Crime:Using Shortest-Path Algorithms to Identify Associations in Criminal Networks[J].Deci-sion Support Systems,2004,38(3):473-487
    [52]刘振鹏,韩磊,刘志田.基于Qos相似性的服务选择[J].江西师范大学学报(自然科学版),2008,32(2):190-191
    [53]陈崚.完全欧几里德距离变换的最优算法[J].计算机学报,1995,18(8):100-120

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700