用户名: 密码: 验证码:
面向海量信息处理领域的数据网格及其关键技术研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着信息技术的不断发展,越来越多的信息在丰富人们的思想、扩大人们视野的同时,也为计算机进行海量信息处理带来了众多难题,其主要表现在以下几个方面:存在着大量的、异构的数据源,而且格式各异;同时这些信息又缺乏一个统一的规范化的描述方法;再者由于信息的更新速度非常快,因此还需要解决数据修改和同步的问题;此外还需要解决信息的易用性问题。
     本文针对以上海量信息处理过程中出现的困难和问题,提出了面向海量信息处理的数据网格MIPDG(Data Grid for Mass Information Process)。作为一种新型的数据管理和利用的体系架构,MIPDG提出了一种新的海量信息处理中心的数据网格建设模式,并通过为多种数据格式提供标准化的描述方式,以实现相关数据的自动关联、自动映射;通过对各类数据源提供副本创建策略、副本一致性算法和数据传输算法等,有效地解决信息资源的一致性共享问题,实现高速可靠的数据访问;通过对海量信息处理业务提供平台级和应用级的访问支持,从而极大地降低海量信息处理应用开发和使用的复杂性,为最终实现信息的全面共享和综合利用提供了一个高性能、大容量、广域覆盖的数据共享平台。
     本文以高性能、易用性和可扩展性为依据,对面向海量信息处理的数据网格的体系结构及若干关键技术做了详细研究和探讨,主要工作和贡献如下:
     1)结合海量信息处理需求的特点,设计了若干个由高速网络互联形成的分布式海量信息处理中心节点,建立了数据中心的数据网格建设模式,实现了对多数据源的稳定可靠访问,克服了由于数据源本身在存储容量、网络带宽以及可用性等方面差异所导致的访问瓶颈问题,为海量信息处理应用可扩展、可维护、易用性等目标提供了保障。
     2)基于面向对象的设计方法,给出了一种层次型的海量信息处理元数据结构定义,实现了灵活的数据映射机制。并根据这种元数据目录管理的方式,设计了一种基于服务的数据映射五层模型,实现了对海量数据透明、可扩展和开放的映射管理,为不同存储方式、不同格式类型的多种数据源提供了统一便捷的数据访问模式。
     3)根据海量信息处理业务流程的特点,给出了DRFT数据传输策略,建立了一种无监督的数据传输调度模型,提出了三种数据传输作业调度策略,并进一步对最优适合策略进行优化,优化算法不仅能够充分利用有效的传输带宽,而且还具有比较稳定的传输速率。
     4)针对副本管理机制的研究,提出了基于聚类的动态副本创建策略、基于活跃度的多阶段副本一致性算法。这两种算法克服了由于网络带宽有限、地理位置分散所带来的数据访问效率低等困难,有效地减少了平均作业执行时间,提高了网格资源的利用率以及网格环境的性能和可扩展性,并保证网格系统的正确运行。
     研究成果已在实验环境中得到测试,不仅验证了整个网格系统的可行性,还验证了对于一个具备论文提出的各项策略的网格系统,其数据访问性能能够得到明显的改善,为论文提出的各种技术的推广应用提供了有益的参考。
With the development of information technology, more and more information enriches the idea of people and enlarges their field of vision. At the same time, it brings many difficult problems, which is listed as follows: There exits a large amount of heterogenous data resource, whose size and network environment are very different. At the same time, the information is lack of a uniform format. How to transform them into operable and standard data with fixed format according to the users’different requirement is also a formidable task. Because of the rapid speed of information updating, the change and synchronization of data also need to be resolved. How to guarantee the consistency of information with the limited bandwidth will make the mass information process application has the ability of access to the newest and the exactst data.Moreover, the ability of information use will be improved.
     This thesis presents the Data Grid for Mass Information Process (MIPDG) based on those problems and difficulties mentioned above. MIPDG, as the new data mamagement architecture, proposed a new mode of mass information process center and by providing a uniform standard description mode for different data formats. So the automatic mapping and automatic association will be implemented. The replica creation strategy, the replica coherence algrithom and data transfer algrithom can resolve the problem of information resource coherence sharement effectively. MIPDG provides a level of application of access support for mass information process, which can reduce the complexity of mass information process application. It will provide a high performance, giant capacity, high speed transfer, wide spread data share platform for share and utilization of information.
     The thesis analyses the architecture of MIPDG and its’key technologies in detail based on high performance, ease to use, and expansibility. The contribution of this thesis is composed of eight points listed below:
     1) MIPDG designs many distributed information center nodes combined with the charastics of mass information process. Also it sets up the uniform model of data access based on data center, which can overcome the access bottleneck problem caused by the difference of capacity, network bandwidth and utilization of data resources themselves.
     2) Based on the design of OO, it gives a definition of mass information process metadata and it carries out the flexible mechanism. Otherwise, it designes a five-layer model of data mapping based on service by utilizing the mode of metadata catalogue management, which implements the transparent and expanding mapping management of massive data. As a result, a uniform convenient data access mode is provided for the data access of different storage manner and different data formats.
     3) The model put forward a data transfer strategy called DRFT(Distribute Reliable File Transfer), which implements the automatic partition of data transfer process and management by the mode of job scheduler. Therefore, the aim at the automatic scheduler of data transfer without manaual work is achieved. The model also discusses the assignment of bandwidth and proposes three data transfer scheduler strategies. Morver, the best fit strategy is optimized, which can not only utlize the transfer bandwidth effectively, but also has more steady transfer speed.
     4) A Strategy of Dynamic Replica Creation Based on Clustering (DRCC) and a strategy of activity based multi-phase consistency maintenance algorithm are proposed. These two algorithms overcome the difficulties of the low data access efficiency caused by the limited network bandwidth. As a result, it reduces the average completion time and improves the usage of grid resource and the informance of grid environment.
     The feasibility of the whole grid system is validated by setting up the system of MIPDG in the experimental enviorment. The experiment results show that our replica creation strategy, coherence algorithm and data transfer algorithm is effective and correct. The average job response time and replica updation times are both improved. The results will be a valuable reference to the application of MIPDG and its strategy or algorithm in future.
引文
[1] [Andrea 2004] Andrea Domenici, Flavia Donno, Gianni Pucciani, Heinz Stockinger, Kurt Stockinger. Replica consistency in a data grid[J]. Nuclear Instrument and Methods in Physics Research, 2004, 534: 24-28.
    [2] [Arie 2001] Arie Shoshani, Alex Sim, Junmin Gu. Storage Resource Managers: Middleware Components for Grid Storage. 2001.
    [3] [Atsuko 2001] Atsuko Takefusa. Bricks. A Performance Evaluation System for Scheduling Algorithms on the Grids [A]. In JWAITS: JSPSWorkshop on Applied Information Technology for Science [C]. Tokyo Japan, 2001.
    [4] [Avaki 2003] Avaki Data Grid[EB/OL]:http://www.sybase.com/products/allproductsa-z/avakieii/sybaseavakifordatagrid.
    [5] [Barbara 1996] Barbara Bicking, RussellEast. Towards Dynamically Integrating Spatial Data and its Metadata[A]. First IEEE Metadata Conference[C]. NOAA Auditorium, Silver Spring, Maryland, April 16-18,1996.
    [6] [Baldonado 1997] M Baldonado, C Chang, L Gravano et al. The Stanford Digital Library Metadata Architecture[J]. International Journal Digital Libraries, 1997, 1 (2) 108-121.
    [7] [Beck 2000] M. Beck, T. Moore, J. Plank, and M. Swany. Logistical Networking: Sharing More Than the Wires[A]. In Proccedings of 2nd Annual Workshop on Active Middleware Services[C]. August 2000.
    [8] [Bell 2003] Bell W. H., Cameron D. G., Carvajal-Schiaffino R., Millar A. P., Stockinger K., Zini F. (2003).Evaluation of an Economy-Based File Replication Strategy for a Data Grid. Proccedings of 3rd IEEE International Symposium on Cluster Computing and the Grid (CCGrid), Tokyo Japan,May 2003, IEEE Computer Society Press.
    [9] [Bell2 2003] Bell W. H., Cameron D. G, etc. Simulation of Dynamic Grid Replication Strategies in OptorSim[J]. University of Glasgow,2003:2-7.
    [10] [Berman 2003] Fran Berman, Geoffrey Fox, Tony Hey. Grid Computing Making the Global Infrastructure a Reality[M]. England: John Wiley & Sons Ltd., 2003.
    [11] [Bird 2001] I. Bird, B. Hess, and A. Kowalski. Building the mass storage system at Jeerson Lab[A]. In Proceedings of 18th IEEE Symposium on Mass Storage Systems[C]. San Diego,California: April 2001.
    [12] [Bldonado 1997] Michelle Bldonado, Chen-Chuan K. Chang, Luis Gravano, Andreas Paepcke. The Stanford Digital Library Metadata Arechitehture[J]. International Journal on Digital Libraries, 1997(1):108-121
    [13] [Buyya 2002] Buyya R, Murshed M. GridSim: A Toolkit for the Modeling and Simulation of Distributed Resource Management and Scheduling for Grid Computing [J]. The Journal of Concurrency and Computation: Practice and Experience. 2002,5: 1-32.
    [14] [Cameron 2003] Cameron D G, Carvajal-Schiaffino R, Millar A P, et al. Evaluating Scheduling and Replica Optimization Strategies in optorSim[C/OL]. In 4th International workshop on Grid Computing (Grid 2003). Arizona: IEEE Computes Society Press, 2003.
    [15] [Cancio 2001] German Cancio, CERN, et.al. The DataGrid Architecture Version 2[EB/OL]. http://acs.lbl.gov/~hoschek/publications/edg-architecture.pdf. July 2, 2001.
    [16] [Carman 2002] Carman M., Zini F., Serafini L. and Stockinger K. Towards an Economy-Based Optimisation of File Access and Replication on a Data Grid[A]. In Proceedings of the 1st IEEE/ACM International Conference on Cluster Computing and the Grid (CCGrid)[C]. Berlin,Germany:IEEE Computer Society Press,May 2002: p340-345.
    [17] [CERN 2003] CERN in 2 mintutes[EB/OL]. http://public.web.cern.ch/Public/whatiscern.html.
    [18] [Chaitanya 2003] Chaitanya Baru, Reagan Moore, Arcot Rajasekar, Michael Wan. The SDSC Storage Resource Broker[EB/OL]. http://www.sdsc.edu/srb/index.php/Main_Page. 2003
    [19] [Chang 2006] Ruay_Shiung Chang, Jih_Sheng Chang. Adaptable Replica Consistency Service for Data Grids[C]. In Proceedings of the Third International Conference on Information Technology: New Generations (ITNG06), 2006.
    [20] [Chen 2005] Chen X, Ren SS, Wang HN, Zhang XD. SCOPE. Scalable consistency maintenance in structured P2P systems[A]. In: Proc. of the IEEE Infocom 2005[C]. Washington: IEEE Computer Society, 2005:p1502-1513.
    [21] [Chervenak 1999] A. Chervenak, I. Foster, C. Kesselman, C. Salisbury, S. Tuecke. The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets[J]. Journal of Network and Computer Applications, 23:187-200, 2001 (based on conference publication from Proceedings of NetStore Conference 1999)
    [22] [Chervenak 2002] A. Chervenak, E. Deelman, I. Foster, et al. Giggle: A Framework for Constructing Sclable Replica Location Services[C]. In Proceedings of Supercomputing 2002 (SC2002), Baltimore , USA: IEEE Computer Society Press , November 2002.
    [23] [Crosby 2003] Crosby P. EDGSim: Simulating the European Data Grid [EB/OL] .http: //www.hep.ucl.ac.uk/~pac/EDGSim/edgsim.html, 2003.
    [24] [Christopher 1996] Christopher Miller, Thomas Karl, et al. Documenting Climatological Data Sets for GCOS: a Conceptual Model[C]. First IEEE Metadata Conference, NOAAAuditorium, Silver Spring, Maryland: April 16-18, 1996.
    [25] [Czajkowski 2004] K. Czajkowski, D. Ferguson, I. Foster, J. Frey, S. Graham, T. Maguire, D. Snelling, S. Tuecke. From Open Grid Services Infrastructure to WS-Resource Framework[C]. Refactoring & Evolution. March 5, 2004.
    [26] [Datta 2003] Datta A, Hauswirth M, Aberer K. Updates in highly unreliable, replicated peer-to-peer systems[C]. In Proceedings of the 23rd Int’l Conference on Distributed Computing Systems. Washington: IEEE Computer Society, 2003:76-85.
    [27] [Deelman ,2002] E. Deelman, Carl Kesselman, Gaurang Mehta, Leila Meshkat, Laura Pearlman, Kent Blackburn, Phil Ehrens, Albert Lazzarini, Roy Williams, Scott Koranda. GriPhyN and LIGO. Building a Virtual Data Grid for Gravitational Wave Scientists[C]. In 11th IEEE International Symposium on High Performance Distributed Computing HPDC-11. Edinburgh, Scotland 2002 (HPDC02):July 24-26, 2002.
    [28] [Deelman ,2003] E Deelman, James Blythe, Yolanda Gil, Carl Kesselman, Gaurang Mehta, Karan Vahi. From Metadata to Execution on the Grid Pegasus and the Pulsar Search[C]. In 12th IEEE International Symposium on High Performance Distributed Computing, Seattle: Washington:June, 2003: 22-24.
    [29] [Deelman ,2004] E. Deelman, G. Singh, M.P. Atkinson. Grid-Based Metadata Services[C]. In 16th International Conference on Scientific and Statistical Database Management (SSDBM04), June 2004.
    [30] [DeFago 1998] X. DeFago, A. Schiper, N. Sergent. Semi-passive replication[C]. In Proceedings of the 17th IEEE Symposium on Reliable Distributed Systems (SRDS). West Lafayette,USA: Oct. 1998:43-50.
    [31] [Dou 2004] Dou W, Wang HM, Jia Y, Zou P. A Rumor-spreading Analog on Unstructured P2P Broadcast Mechanism(in Chinese with English abstract) [J]. Joural of Computer Research and Development, 2004,41(9):1460-1465. http://crad.ict.ac.cn/papers/2004-9-1460.htm.
    [32] [EMC 2004] EMC Corporation. VMware Workstation4 [OL]. http://www.vmware.com/download/.2004,(3).
    [33] [Enstore 2007] Enstore Mass Storage System:Enstore and dCache User Documentation[EB/OL]. http://www.fnal.gov/docs/products/enstore, 2007
    [34] [EU 2003] The EU Data Grid Project[EB/OL], http://www.eu-datagrid.org/. 2003
    [35] [Eurogrid 2003] http://www.eurogrid.org. 2003
    [36] [First 2002] First results and Future Perspectives of the European DataGrid project[EB/OL]. http://www.hoise.com/primeur/02/articles/weekly/AE-PR-04-02-22.html
    [37] [Foster 1999] I. Foster and C. Kesselmann. Globus: A Toolkit-Based Grid Architecture[R]. In The Grid: Blueprints for a New Computing Infrastructure. Morgan Kaufmann:1999:p259-278.
    [38] [Foster 2000] Ian Foster. Internet Computing and the Emerging Grid[EB/OL]. http://www.nature.com/nature/webmatters/grid/grid.html. Nature Web Matters, 7th December 2000.
    [39] [Foster1 2002] Ian Foster. The GriPhyN Virtual Data System[R]. Technical Report GriPhyN-2002-02.
    [40] [Foster2 2002] Foster I., et.al. Grid Services for Distributed System Integration, Computer, 35,2002
    [41] [Foster3 2002] K. Ranganathan and I. Foster. Identifying Dynamic Replication Strategies for High Performance Data Grids[C]. In Proceedings of International Workshop on Grid Computing. Denver, CO: November 2002.
    [42] [Foster 2004] Ian Foster,Carl Kesselman. The Grid 2──Blueprint for a New Computing Infrastructure(Second Edition). 2004.
    [43] [Foster 2008] Ian Foster, Yong Zhao, I Raicu, S Lu. Cloud Computing and Grid Computing 360-Degree Compared[C]. Grid Computing Environments Workshop(GCE08) 2008:p1-10.
    [44] [Gianni 2003] Gianni Pucciani, et al. Replica Consistency Service in a Data Grid[C]. In Proceedings of the 9th International Workshop on Advanced Computing and Analysis Techniques in Physics Research, 2003.
    [45] [GIG 2007] Global Information Grid. http://www.globalsecurity.org/intell/systems/gig.htm, 2007
    [46] [Globus1 2005] GT4 Admin Grid[EB/OL], http://www.globus.org/globus/doc/doc/www.globus.org/toolkit/docs/4.0/admin/docbook/admin.pdf 2005
    [47] [Globus2 2006] GT 4.0 Reliable File Transfer (RFT) Service[EB/OL]. http://www.globus.org/toolkit/docs/4.0/data/rft/index.pdf
    [48] [Globus3 2006] GT 4.0 GridFTP[EB/OL]. http://www.globus.org/toolkit/docs/4.0/data/gridftp/index.pdf
    [49] [Globus4 2006] GT 4.0 RLS. http://www.globus.org/toolkit/docs/4.0/data/rls/index.pdf
    [50] [Globus5 2006] GT 4.0 Tech Preview: OGSA-DAI[EB/OL]. http://www.globus.org/toolkit/docs/4.0/techpreview/ogsadai/index.pdf
    [51] [Globus6 2006] GT 4.0 Tech Preview: Data Replication Service (DRS)[EB/OL].http://www.globus.org/toolkit/docs/4.0/techpreview/datarep/index.pdf
    [52] [Globus7 2006]GT 4.0: Security: Community Authorization Service[EB/OL]. http://www.globus.org/toolkit/docs/4.0/security/cas/index.pdf
    [53] [Globus8 2006] GT 4.0: Information Services[EB/OL]. http://www.globus.org/toolkit/docs/4.0/info/index/index.pdf
    [54] [Gucrraoui 1997] R. Gucrraoui, A. Schiper. Software-based Replication for Fault Tolerance[J]. IEEE Computer, 30(4):68-74, Apr. 1997.
    [55] [Gurmeet 2003] Gurmeet Singh, Shishir Bharathi, Ann Chervenak, Ewa Deelman, Carl Kesselman, Mary Manohar, Sonal Patil, Laura Pearlman. A Metadata Catalog Service for Data Intensive Applications[C]. Proceedings of Supercomputing 2003 (SC2003), November 2003.
    [56] [Hadzilacos 1993] V. Hadzilacos, S. Toueg. Fault-tolerant broadcasts and related problems[C]. Acm Press Frontier Series archive Distributed systems (2nd Ed.). New York, NY, USA: ACM Press/Addison-Wesley Publishing Co:1993.
    [57] [Holland 1975] Holland J H. Adaptation in Natural and Artificial System [M]. Ann Arbor :Michigan University Press ,1975.
    [58] [Hoschek 2001] Wolfgang Hoschek, et.al, Grid Enabled Relational Database Middleware[R]. Informational Document Global Grid Forum, Frascati, Italy: 7-10 October, 2001.
    [59] [Hu 2005] J. Hu, Xiao N, Zhao Y, et al. An Asynchronous Replica Consistency Model in Data Grid[C]. Parallel and Distributed Processing and Applications (ISPA 2005 Workshops). Nanjing: Springer-Verlag, November, 2005: p475-484.
    [60] [Hyo 2000] Hyo J. Song, Xin Liu, Dennis Jakobsen, Ranjita Bhagwan, Xianan Zhang. The MicroGrid: a Scientific Tool for Modeling Computational Grids[C]. In Proceedings of Super Computing 2000, 2000.
    [61] [Ion 2003] Ion S, Robert M, David L, David K, Frans KM, Frank D, Hari B. Chord: A scalable peer-to-peer lookup protocol for internet applications[C]. IEEE/ACM Trans. on Networking, 2003,11(1):17-32.
    [62] [Hector 2001] Hector Garcia-Molina, Jeffrey D.Ullman, Jennifer Widom.数据库系统实现[M].北京:机械工业出版社,2001.
    [63] [Jeremy 2001] Jeremy Sugerman ,Ganesh Venkitachalam, et al1Virtualizing I/ O Devices on VMware Workstationps Hosted Virtual Machine Monitor [C] Proc1Usenix Annual Technical Conference ,2001
    [64] [Jiang 2002] Jiang L, Xiaotao L, Prashant S, Krithi R. Consistency maintenance inpeer-to-peer file sharing networks[C]. In: Proc. of the 3rd IEEE Workshop on Internet Applications. Washington: IEEE Computer Society, 2002: p90-94.
    [65] [Jurk 2003] S. Jurk, M. Neiling. Client-side Dynamic Preprocessing of Transactions. ADBIS Conf. Dresden, Germany, 2003. p103-117.
    [66] [Kamath 1993] Kamath Y. H., Smilan R. E. & Smith J. G. Reaping Benefits With Object-Oriented Technology[J]. AT&T Technical Journal 72, 5 (September/October 1993): p14-24.
    [67] [Kavitha 2002] Kavitha Ranganathan, Ian Foster. Decoupling Computation and Data Scheduling in Distributed Data-Intensive Applications [A]. In HPDC-11: Proceedings of 11th IEEE International Symposium of High PerformanceDistributed Computing [C]. Edinburgh, Scotland, 2002.
    [68] [Kevin 2003] Kevin O'Neill, Ray Cramer, Marta Gutierrez, Kerstin Kleese van Dam,Siva Kondapalli, Susan Latham, Bryan Lawrence, Roy Lowry, Andrew Woolf. The Metadata Model of the Nerc Data Grid[EB/OL]. http://ndg.nerc.ac.uk/public_docs/AHM-2003-KON.pdf, 2003.
    [69] [Koranda 2003] S. Koranda and B. Moe. Lightweight Data Replicator[EB/OL]. http://www.lscgroup.phys.uwm.edu/lscdatagrid/LDR/ overview.html, 2003.
    [70] [Kubiatowicz 2000] J. Kubiatowicz, D. Bindel, Y. Chen, S. Czerwinski, P. Eaton, D. Geels, R. Gummadi, S. Rhea, H. Weatherspoon, W. Weimer, C. Wells, and B. Zhao. Oceanstore. An Architecture for Global-scale Persistent Storage[C]. In Proceedings of the Ninth international Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2000), November 2000.
    [71] [Lamehamedi 2003] Lamehamedi H, Shentu Z, Szymanski B, et al.. Simulation of Dynamic Data Replication Strategies in Data Grids [A]. In HCW2003: Proceedings of 12th Heterogeneous Computing Workshop. Nice, France, 2003.
    [72] [Lan 2002] Lan J, Liu X, Shenoy P, Ramamrit ham K. Consistency maintenance in Peer-to-Peer file sharing networks[C]. In Proceedings of the 3rd IEEE Workshop on Internet Applications. Washington , USA , 2002: p90-94.
    [73] [LIGO 2003] Laser Interferometer Gravitational Wave Observatory[EB/OL]. http://www.ligo.caltech.edu/.
    [74] [Liu 2003] Xin Liu and Andrew Chien. Traffic-based Load Balance for Scalable Network Emulation[C]. In Proceedings of the ACM Conference on High Performance Computing and Networking, 2003.
    [75] [Liu 2007] Gui Liu, Hongyu Zhu, Xianghui Xie, Xiaoliang Lu, Linsheng Lu. A DataAccess Scheme of Heterogeneous Data Resource in Grid[C]. The 6th International Conference on Grid and Cooperative Computing (GCC2007), 2007.08: p160-167.
    [76] [Liu 2009] Gui Liu, Hongyu Zhu, Hailiang Wei, Xin Wang, Wei Peng. Research on Data Interoperability Based on Clustering Analysis in Data Grid. The 5th International Conference on Interoperability for Enterprise Software and Applications ( I-ESA China 2009 )[C].BeiJing:2009.04.
    [77] [Lin 1989] K. J. Lin. Consistency Issues in Real-Time Database Systems. In Proceedings of the 22nd Hawaii International Conference on Systems Sciences[C], 1989: p654-661.
    [78] [MACT 2008] Metadata Catalog[EB/OL]. http://www.sdsc.edu/srb/index.php/FAQ,2008.
    [79] [Marc 2008] Marc-Elian Bégin. An EGEE Comparative Study: Grids and Clouds Evolution or Revolution. Session: Exploring Cloud Computing, OGF23 Barcelona, Spain, June 2, 2008
    [80] [Marius 2002] Marius P, Aruna S. The cost of application-level broadcast in a fully decentralized peer-to-peer network[C]. In Proceedings of the 7th Int’l Symp. on Computers and Communications. Washington: IEEE Computer Society, 2002: 941-946.
    [81] [Michael 2009] Michael Armbrust, Armando Fox, Rean Griffith, Anthony D. Joseph, Randy H. Katz, Andrew Konwinski, Gunho Lee, David A. Patterson, Ariel Rabkin, Ion Stoica and Matei Zaharia .Above the Clouds: A Berkeley View of Cloud Computing[R]. EECS Department University of California, Berkeley Technical Report No. UCB/EECS-2009-28 February 10, 2009
    [82] [Morita 2003] Y. Morita, H. Sato, Y. Watase, O. Tatebe, S. Sekiguchi, S. Matsuoka, N. Soda, and A. Dell'Acqua. Building a high performance parallelle system using Grid Datafarm and ROOT I/O[C]. In Proceedings of the 2003 Computing in High Energy and Nuclear Physics (CHEP03), La Jolla, CA: March 2003.
    [83] [OGSA-DAI 2005] OGSA-DAI WSRF 2.1 User Guide[EB/OL], http://www.ogsadai.org/releases/ogsadai.html. 2005
    [84] [Park 2003] S.M. Park, J.H. Kim and Y.B. Ko. Dynamic Grid Replication Strategy based on Internet Hierarchy[C]. In Proceedings of Grid and Cooperative Computing (GCC03). 2003:p838-846,.
    [85] [Paton 2002] Norman W Paton, Malcolm P Atkinson, Vijay Dialani, Dave Pearson et.al. Database Access and Integration Services on theGrid[C]. UK e-Science Program Technical Report Number UkeS-2 2-01,2002
    [86] [Powell 1991] D. Powell, M. Chereque, D. Drackley. Fault-tolerance in Delta-4[J]. ACM Operating Systems Review, SIGOPS, 25(2):122-125, Apr. 1991
    [87] [Rajasekar 2003] Arcot Rajasekar. Introduction to SRB, August 2003[EB/OL].http://www.sdsc.edu/srb/Pappres/srbIntro.ppt
    [88] [Rajive 1998] Rajive Bagrodia, Richard Meyer, Mineo Takai and Yu-an Chen. Parsec: A Parallel Simulation Environment for Complex Systems[J]. IEEE Computer, Vol. 31(10), p77-85, 1998.
    [89] [Ranganathan 2001] K. Ranganathan and I. Foster. Design and Evaluation of Dynamic Replication Strategies for High Performance Data Grids[C]. In Proceedings of International Conference on Computing in High Energy and Nuclear Physics, Beijing, China: September 2001.
    [90] [Ranganathan 2002] K. Ranganathan and I. Foster. Identifying Dynamic Replication Strategies for High Performance in Data Grids[C]. In Proceedings of International Workshop on Grid Computing, Denver, CO: November 2002
    [91] [Ranjita 2002] Ranjita B, Stefan S, Geoffrey V. Replication Strategies for Highly Available Peer-to-Peer Storage Systems[C]. Technical Report, CS2002-0726, UCSD, 2002.
    [92] [Ripeanu 2001] M Ripeanu. Peer-to-peer Architecture Case Study: Gnutella network[C]. In Proceedings of Int’l Conf on Peer-to-Peer Computing Sweden: IEEE Computer Press, 2001:. p99-101.
    [93] [Robert 2001] Robert Jones. Status of EU DataGrid project and test-bed 1 [EB/OL]. First Presentation At 2nd NorduGrid, Oslo, Norway, 12.01,2001. http://web.datagrid.cnr.it/pls/portal30/docs/2048.ppt, 2001.
    [94] [Sam 2003] Installation Guide for Setting up a SAM Station in Clusterlike Environments [EB/OL]. http://d0db.fnal.gov/sam/documents.html, 2003
    [95] [Schiper 1993] A. Schiper, A. Sandoz. Uniform Reliable Multicast in a Virtually Synchronous Environment[C]. In Proceedings of the 13th International Conference on Distributed Computing Systems (ICDCS-13). Pittsburgh, Pennsylvania, USA: IEEE Computer Society Press, May 1993:p561-568.
    [96] [Schneider 1990] F. B. Schneider. Implementing fault-tolerant services using the state machine approach[J]. A tutorial. ACM Computing Surveys,22(4), Dec. 1990:299-319.
    [97] [Shoshani 2002] A. Shoshani, A. Sim, and J. Gu. Storage Resource Managers: Middleware components for Grid storage[C]. In Nineteenth IEEE Symposium on Mass Storage Systems, 2002.
    [98] [Smith 2002] Jim Smith, Anastasios Gounaris, Paul Watson, Norman W Paton, Alvaro A.A. Fernandes, Rizos Sakellariou. Distributed Query Processing on the Grid[C]. In Proceedings of Grid Computing 2002, Springer, LNCS 2536, 2002:p279-290.
    [99] [SRB 2006] SRB User Manual[EB/OL].http://www.sdsc.edu/srb/index.php/SRB_User_Manual, 2006.
    [100] [Sun 2004] Yu-zhong Sun, Zhi-wei Xu. Grid Replication Coherence Protocol[C]. The 18th International Parallel and Distributed Processing Symposium, Santa Fe, USA: April 2004: p232-239.
    [101] [Qin 2002] Qin L, Pei C, Edith C, Kai L, Scott S. Search and replication in unstructured peer-to-peer networks[C]. In: Proceedings. of the 16th ACM Int’l Conf. on Supercomputing (ICS 2002). New York: ACM Press, 2002: p84-95.
    [102] [UK 2006] UK National Grid[EB/OL]. http://www.nationalgrid.com/uk/, 2006
    [103] [Watson 2001] Paul Watson. Databases and The Grid[R]. Technical Report CS-TR-755, University of Newcastle, 2001
    [104] [Wang 2003] Wang QB, Dai YF, Tian J, Zhao T, Li XM. An Infrastructure for Attribute Addressable P2P Network(in Chinese with English abstract) [J]. Journal of Software, 2003,14(8):1481-1488.
    [105] [Wiesmann 2000] M.Wiesmann, F.Pedone, A.Schiper, B.Kemme,G.. Alonso. Understanding Replication in Databases and Distrihuted Systems[C]. In Proceedings of the 20th International Conference on Distributed Computing Systems (ICDCS 2000),April 10-13, 2000:p464.
    [106] [William 2003] William H B, David G C, Luigi C, et al. OptorSim-a grid simulator for studying dynamic data replication strategies[J]. International Journal of High Performance Computing Applications, 2003, 10(3): 256-268.
    [107] [Woolf 2003] Andrew Woolf, et.al, Data Virtualisation in the NERC DataGrid[EB/OL]. http://ndg.badc.rl.ac.uk/public_docs/AHM-2003-AW.pdf.2003.
    [108] [Yang 2008] Chao-Tung Yang, Chun-Pin Fu, Chien-Jung Huang, Ching-Hsien Hsu. FRCS: A File Replication and Consistency Service in Data Grids[A]. In Proceedings of the International Conference on Multimedia and Ubiquitous Engineering in 2008[C]:2008: p444-447.
    [109] [Yoshio 2004] Yoshio Tanaka. Asia Pacific Grid: Towards a production Grid [EB/OL]. http://www.apgrid.org/documents/ApGrid-Tanaka.ppt:May 2004.
    [110] [Zhijun 2004] Zhijun W, Das SK, Kumar M, Huaping S. Update propagation through replica chain in decentralized and unstructured P2P systems[C].In Proceedings of the 4th Int’l Conf. on Peer-to-Peer Computing. Washington: IEEE Computer Society, 2004: 64-71.
    [111] [巩敦卫2002]巩敦卫孙晓燕郭西进.一种新的优胜劣汰遗传算法[J].控制与决策,2002年06期.
    [112] [刘瑰2006]刘瑰等.利用OGSA-DAI实现异构数据的透明访问[J].高性能计算技术. 2006年03期.
    [113] [刘瑰2007]刘瑰等.数据网格中访问代理中间件的设计与实现[J].计算机工程. 2007年18期,p42-44.
    [114] [石柯2004]石柯,王庆春,吴松.数据网格中一种基于副本和缓存的元数据管理系统[J].计算机研究与发展. 2004年第12期,p2206-2210.
    [115] [孙霞2004]孙霞,郑庆华.教育资源元数据语义扩展查找方法的研究[J].计算机研究与发展. 2004年第12期,p2170-2174.
    [116] [肖侬2003]肖侬,付伟,黄斌,卢锡城.数据网格系统的设计与关键技术实现[C].国防科学计算大学计算机学院, http://www.chinagrid.net/grid/paperppt/Griddaen.doc,中国计算机世界大会,2003.
    [117] [阎保平2004]阎保平.科学数据网格与e-Science. http://www.chinagrid.net,2004
    [118] [张晓林2002]张晓林.元数据研究与应用[M].北京:北京图书馆出版社,2002.
    [119] [周翔鹰2006]周翔鹰.基于VMware构建虚拟计算机网络实验[J].实验室研究与探索,2006年第7期.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700