用户名: 密码: 验证码:
面向城域网的网络流量建模研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
网络流量模型是网络性能分析与网络规划设计的基础,精确的网络流量模型对设计高性能网络协议、高性能的网络设备、高效网络拓扑结构、流量预测与网络规划、拥塞控制与负载均衡等都具有重要意义。随着近年来互联网的日益普及和应用的急剧增加,骨干网流量的特征也发生了很大改变,传统的网络流量模型已经不能适应当前的骨干网流量。因而,研究当前骨干网流量的规律,设计能反映其特征的模型,是今后一段时间网络流量领域研究的一个重要方向。
     论文以陕西省自然科学基金项目(基金编号:2005F43)及2003年、2004年西安宽带多媒体城域网优化项目为背景,对城域网网络流量进行了深入研究,取得了预期的效果,在自相似网络流量理论和实际应用之间架设了一个桥梁。论文的主要内容和创新点包括:
     1、通过对城域网大量真实网络流量数据的分析和严格检验,发现城域网网络流量的概率密度分布服从gamma分布,探索出一条从概率密度分布特征出发来研究网络流量的新途径。基于上述结论,对美国Rice大学RudolgH Riedi所提出的、在网络流量研究领域具有重要影响的MWM模型进行改造,建立了一个基于gamma分布和小波的网络流量仿真模型(GWM)。仿真实验显示,该模型在仿真流量的特征、算法的合理性等方面,均较MWM模型有了显著的提高。
     2、发现了城域网流量聚合过程的规律,揭示了城域网流量之所以服从gamma分布的成因,把对网络流量特征的认识提升到一个新的高度。利用上述规律,预测了城域网网络流量的演化趋势,为提高城域网网络规划的前瞻性奠定了基础。
     3、发现了网络流量的概率密度分布与观测尺度之间的量化关系,刻画了网络流量复杂而细致的自相似结构,从而打破了10多年来仅能依靠方差和数学期望等这样简单的统计特征来定义网络流量自相似性的现状,深化了对自相似现象的理解。作为应用实例,这些研究成果在网络设备的缓存设计工作中,对于网络流量排队分析起了重要作用。
     4、建立了一个基于gamma分布和多重随机二分法的网络流量仿真模型(GIRDM)。该模型的仿真实验显示,其仿真流量能够精确地反映实际网络流量的特征,并可以通过调整输入参数的值,方便地改变仿真流量自相似的程度,从而性能较GWM模型又有了进一步提高。该模型发表在2005年8月的《中国通信学报(英)》上,详细的算法设计将发表在2007年的《系统仿真学报》。华为公司在2006年基金项目中,也提出了基于层叠模型的网络流量异常检测等课题,旁证了该模型的实际价值。
     5、设计的基于SNMP的网络流量采样工具,最小尺度达到10毫秒,远高于目前同样采用SNMP方法的流量采样所能达到的精度,有能力捕捉到流量在微小尺度上的精细特征。流量样本直接采自一个实际城域网,样本代表性强。
Network traffic model acts as the fundamental issue of network performance analysis and network planning. Especially, precise network traffic model is essential to high performance network protocol constructing, network device designing, effective network topology deployment, traffic forecasting, congestion control, load balance, and so on. Due to the surging popularity and increasing utility of the Internet, backbone network is rapidly growing in size, speed and scope, accompanied with the evolving of their traffic properties. As a result, the traditional network traffic model does no longer accordant with traffic of current backbone network. Consequentially, it is a matter of great urgency to research and model the characteristics of backbone network traffic in the near future.
     In this dissertation, the network traffic of metropolitan area network (MAN) are researched thoroughly on a background of the natural science fund of shannxi province (2005F43) and the project of Xi'An broadband multimedia MAN optimization in 2003 and 2004. The result achieves the expectation and sets up a bridge between the self-similar theory and the practice of network traffic. Main research works and creative contributions in the dissertation include:
     1. The network traffic is discovered and verified to be accordant with gamma distribution based on analysis of a great deal of traces collected from a real MAN. This result provides a new approach using probability density distribution property for the study of network traffic. Based on it, the influential MWM set up by Rudolf.H Riedi of Rice University (US) is rebuilded to be a gamma distribution and wavelet based model (GWM). Performance of the improved model is showed to be more effective than Riedi's MWM by the experiment, espeically in aspects of statistcal properties of the traffic generated by the model and rationality of the algorithm used in the model.
     2. Laws appeared in the process of traffic aggregating are exposed and the cause why the MAN network traffic appears gamma distribution is also discovered. These achievements enriche the knowledge of network traffic properties greatly and provide a necessary basis for network deployment with foresight.
     3. Quantificational relationship between the probability density distribution of the network traffic and the related time scale is discovered, thus, the complex but delicate structure of the MAN traffic is well illustrated. The improvement greatly deepens the understanding on self-similar phenomena and notablely ends the status lasted for more then ten years that the self-similarity of the network traffic can only be described with the simple statistical variables such as the variance and the expectation. As an application instance, it plays important role in the queue anaysis, in the buffer design of network device.
     4. A gamma distribution and iterative random dichotomy based model (GIRDM) is set up. The experiment shows that the model not only accurately maps the probability density distribution property of the real network traffic accurately, but also has the ability to alter the self-similar degree of the generated network traffic through adjusting the values of the input parameters. Consequentially, performance of the GIRDM is improved greatly than GWM. The model is published at china communication in Aug 2005 and its detailed algorithm will appeared in Journal of System Simulation. Furthermore, its worthiness is emphasized by the fund of Huawei Corp. in 2006, in which, practical projects such as abnormal network traffic detection based on cascade are supported.
     5. A network traffic sampling tool based on SNMP is elaborately developed to ensure the time scale as precise as ten milliseconds, which is more accurate than before. This advantage makes it possible to catch the details of the traffic porperty in small scale. Traces in the study are collected directly from a real MAN network.
引文
[CNNIC06] 中国互联网络发展状况统计报告,http://www.cnnic.net.cn/,2006年7月。
    [Kle76] L.Kleinrock. Queueing Systems. Volume Ⅱ: Computer Applications, John Wiley & Sons, 1976.
    [Le194] W.E.Leland, M.S.Taqqu, W. Willinger, On the Self-Similar Nature of Ethemet Traffic, IEEE/ACM Transactions on Networking, 1994, 2(1):1-15.
    [SPRINT] http://ipmon.sprintlabs.com/
    [IMC05] http://www.usenix.org/events/imc05/
    [NetF] http://netforum.net9.org/.
    [Bstap] Bstap,网络流量研究的意义与方向探讨,http://netforum.net9.org/,Nov 2005。
    [NSFC05] 国家自然科学基金委员会《网络与信息安全》项目指南,http://www.nsfc.gov.cn/nsfc/cen/02/htmlcreated/2004jh/2005_05_30.htm,2005年
    [Sta98] William Stallings, High-speed networks TCP/IP and ATM design principles, Prentice-Hall.Inc, 1998.
    [Kli94] Steven M.Klivansky, A.Mukherjee, C.Song, "On Long Range Dependence in NSFNET Traffic", Technical Report GIT-CC-94/61, Geogia Institute of Technology, Altlanta, GA 30332, USA, December 1994.
    [Pax95] Vern Paxson, Sally Floyd, Wide-Area Traffic: The Failure of Poisson Modeling, IEEE/ACM Transactions on Networking, 1995, 3(3).
    [Norr94] I.Norros, A Storage Model with Self-similar Input. Queueing System, 1994, 16:387-396.
    [CROV96] Mark E. Crovella and Azer Bestavros, Self-Similarity in World Wide Web Traffic: Evidence and Possible Causes. IEEE/ACM Transactions on Networking, 1997, 5: 835-846.
    [Will97] W. Willinger, M.S.Taqqu, R.Sherman, Self-similar Through High-variability: Statistical Analysis of Ethemet LAN Traffic at Source Level. April 15, 1997.
    [Nag98] Krishnamurthy Nagarajan, Fractional ARIMA Processes and Its Applications in Network Traffic Modeling: A Qualifying Examination Report of Philisophy in Electrical Engineering, August 1998.
    [Jiak99] Jiakun Liu, Yantai Shu, Lianfang Zhang, Traffic Modeling Based on FARIMA Models, Proceedings of the 1999 IEEE canadian Conference on Electrical and Computer Engineering, May 9-12 1999.
    [HHitc05] F.C. Harmantzis, D. Hatzinakos, Heavy Network Traffic Modeling and Simulation Using Stable FARIMA Processes. http://personal.stevens.edu/, 2005.
    [ZLC00] 张鹏,廖建新,程时端,自相似业务量的多重分形分析[J],电子学报,2000,28(1):96-98。
    [zLi06] 赵其刚,李群湛,基于混沌预测的IP网动态QoS实现,铁道学报,2006,28(3):47-53。
    [Rie99] Rudolf H.Riedi, Mattew S.Crouse, Vinay J.Ribeiro, A Multifractal Wavelet Model with Application to Network Traffic. IEEE Transactions on Information Theory, April, 1999.
    [Rib00] Vinay J. Ribeiro, Rudolf H. Riedi, Matthew S. Crouse, Multiscale Queuing Analysis of Long-Range-Dependent Network Traffic. Proceedings IEEE INFOCOM'00, March 2000.
    [Long04] 龙图景,孙政顺,李春文等,一种新的网络业务流的多重分形小波模型,计算机学报,2004,27(8):1074-1082。
    [Zqg06] 赵其刚,李群湛,基于混沌预测的IP QoS模型与机制,计算机工程,2006,32(13):23-25。
    [Kim03] Sunggon Kim, Ju Yong Lee, A Shifted Gamma Distribution Model for Long-Range Dependent Intemet Traffic. IEEE COMMUNICATIONS LETTERS, 2003, 7(3).
    [Nor94F] I.Norros, On the Use of Fractional Brownian Motion in the Theory of Connectionless Networks, September24, 1994.
    [Kin97] Kihong Park, On the Effect and Control of Self-similar Network Traffic: A Simulation Perspective, Proceedings of the 1997 Winter Simulation Conference.
    [Chen98] 陈惠民,蔡弘,李衍达,自相似网络:基于多分辨率采样和小波分析的Hurst系数估计方法.电子学报,1998,26(7):88-93。
    [Abry98] Patrice Abry, Darryl Veitch,Wavelet Analysis of Long Range Dependent Traffic, IEEE Transactions on Information Theory, 1998, 44(1).
    [YJB02] S.H. Yook, H. Jeong, A.L.Barab, Modeling the Internet's Large-scale Topology. Proc. Nat. Acad. Sci. 99, pp.13382-13386, 2002.
    [WiL04] Walter Willinger, David Alderson, John C, More "Normal" Than Normal: Scaling Distributions and Complex Systems. Proceedings of the 2004 Winter Simulation Conference.
    [KMF04] Thomas Karagiannis, Mart Molle, and Michalis Faloutsos, Long-Range Dependence-Ten Years of Internet Traffic Modeling, 1089-7801, IEEE Computer Society, 2004.
    [KFMB04] T. Karagiannis, M. Faloutsos, M. Molle, A Nonstationary Poisson View of Internet Traffic. Proc. IEEE INFOCOM '04, 2004..
    [Liu05] Yong Liu, Don Towsley, Tao Ye, An Information-theoretic Approach to Network Monitoring and Measurement, In: IMC. New Orleans, LA, USA. October 2005.
    [NST05] Antonio Nucci, Ashwin Sridharan and Nina Taft, The Problem of Synthetically Generating IP Traffic Matrices: Initial Recommendations. In: Computer Communications Review. July 2005.
    [XLB05] Kuai Xu, Zhi-Li Zhang, Supratik Bhattacharrya, Profiling Internet Backbone Traffic: Behavior Models and Applications. In: ACM Sigcomm 2005. Philadelphia, PA. August 2005.
    [MTS02] A.Medina, N.Taft, K.Salamatian, "Traffic Matrix Estimation: Existing Techniques Compared and New Directions", ACM Sigcomm, Pitsburgh, PA, August 2002.
    [ZRD03] Y. Zhang, M. Roughan, N. Duffield, Fast Accurate Computation of Large-Scale IP Traffic Matrices from Link Loads, Proceedings of ACM Sigmetrics San Diego, CA, June 2003.
    [ZRL03] Y. Zhang, M. Roughan, C. Lund, An Information Theoretic Approach to Traffic Matrix Estimation, Proceedings of ACM Sigcomm, Karlsruhe, 2003.
    [NCT03] A. Nucci, R. Cruz, N. Taft, "Design of IGP Link Weight Changes for Estimation of Traffic Matrices", IEEE mfocom, Hong Kong, China, March 2004. Miami FL, May 2003.
    [SNL04] A. Soule, A. Nucci, E. Leonardi, How to Identify and Estimate the Top Largest Traffic Matrix Elements in a Changing Environment, Proceedings of ACM Sigmetrics, New York, June 2004.
    [Tuk86] Tukey, J.W, Data Analysis and Behaviorial Science or Learning to. Bear the Quantitative's Man Burden by Shunning Badmandments. In The Collected Works of John W. Tukey, ed. L.W. Jones, Vol. Ⅲ. Wadsworth. Monterey, CA. 1986,
    [Box98] G.P.E.Box, G.M.Jenkis, Time Series Analysis: Forecasting and Control. San Francisco: San Francisco Press, 1978.
    [MK01] T.Mori and R. Kawahara, "Is hurst parameter sufficient for evaluating the performance of bursty network traffic?," in Proc. 2001 Society Conf. of IEICE, 2001, pp. 39-40.
    [Lin01] 林闯,《计算机网络和计算机系统的性能评价》,清华大学出版社,2001年3月出版。
    [JLW02] 江勇,林闯,吴建平,网络传输控制的综合性能评价标准,计算机学报,2002,25(8):869-877。
    [Ye02] 叶守泽等,水文科学研究的世纪回眸与展望,水科学进展,2002,13(1),93.104。
    [Zh02] 张学文,《组成论(第十八章概率分布的统一)》,http://xjqxsc.idm.cn/zhangxw%20web/ZCL/index.htm, 2006。
    [ZVSC03] Z. Zhang, V. Ribeiro, S. Moon, Small-time scaling behaviors of Internet backbone traffic: An empirical study, IEEE 1NFOCOM, April 2003.
    [CFC03] Chuck Fraleigh, Fouad Tobagi and Christophe Dio, Provisioning IP Backbone Networks to Support Latency Sensitive Traffic. http://ipmon.sprintlabs.com/. 2003.
    [Cao04] J. Cao, W.S. Cleveland, Y. Gao, Stochastic Models for Generating Synthetic HTTP Source Traffic", Proceedings of IEEE INFOCOM, Hong Kong, March 2004.
    [NS_doc] NS manual, http://www.isi.edu/. P218-223.
    [David031] David Moore, Vern Paxson, Stefan Savage, Colleen Shannon, Stuart Staniford, and Nicholas Weaver. Inside the Slammer Worm. IEEE Security & Privacy, pages 33n39, July/August 2003.
    [David032] David Moore, Vern Paxson, Stefan Savage, Colleen Shannon, Stuart Staniford, and Nicholas Weaver. The Spread of the Sapphire/Slammer Worm, 2003.
    [MJ88] Mark Eichin and Jon Rochlis. With microscope and tweezers: An Analysis of the Internet Virus of November 1988. In Proc. IEEE Symposium on Research in Security and Privacy, 1989.
    [RKH96] B.V. Rao, K. R. Krishnan, and D. P. Heyman, "Performance of Finite-Buffer Queues under Traffic with Long-Range Dependence," Proc. IEEE GLOBECOM, vol. 1, pp. 607-611, November 1996.
    [xu02] 徐明伟,仝爱军,基于自相似模型的网络性能测试,计算机工程与应用,2002,38(5):56-59。
    [Bi02] 毕经平,李忠诚,吴起,大规模互联网端到端行为评价指标研究,计算机工程与应用,2002,38(12):144-150。
    [Ou02] 欧灿辉,李晓明,Web服务器性能评测,计算机研究与发展,2002,39(5):540-547。
    [Yang02] 杨雅辉,李小东,IP网络性能指标体系的研究,通信学报,2002,23(11)。
    [Gu02] 顾勇,寿国础,IP网络性能度量和测试方法的研究,电信科学,2002第9期。
    [wang02] 王治,罗琨,徐宁等,IP高速信息网络性能的实时监测技术研究,计算机工程于应用,2002,38(2):153-156。
    [Lian02] 连一峰,戴英侠,王航,基于模式挖掘的用户行为异常检测,计算机学报,2002,25(3):325-330。
    [Lng04] [粱洁等,网络仿真技术及其在163网广州POP节点建模中的应用,多媒体论坛2000。
    [Ma098] 茆诗松、王静龙,濮晓龙,《高等数理统计》,高等教育出版社,施普林格出版社,1998年7月第一版,2003年4月印刷。
    [zhu90] 朱燕堂,王朝杰,赵选民等,《数理统计》,西北工业大学出版社,1990年6月。(英)》,2005,2(5):22-26。
    [GaoX02] 高翔,苏广文等,入侵检测系统中的网络监测,《微电子学与计算机》,2002,19(2):37-39。
    [xaoQG03] 夏清国 高德远 苏广文等,基于宽带IP网络的电话虚拟专用网.P_VPN的研究,《微电子学与计算机》,2003,20(5):95-97。
    [Chen99] 陈惠民,蔡弘,李衍达。突发业务的多重分形建模及其参数估计,电子学报,1999,27(4)。
    [Xu99] 许都,李乐民。网络中业务流的自相似性与线性AR1模型,电子学报,1999,27(4)。
    [ZhXS99] 张连芳,薛飞,舒炎泰。自相似业务模型下的队列分析—大偏差技 术,通信学报,1999,20(4)。
    [Chu99] 褚立文,廖建新,陈俊亮。自相似业务合成流的建模及排队性能分析,通信学报,1999,20(8)。
    [ZhXS00] 张连芳,薛飞,舒炎泰,CERNET网络业务的自相似性及性能分析,天津大学学报(自然科学与工程技术版),2000,33(3):367-370。

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700