用户名: 密码: 验证码:
软计算方法和数据挖掘理论在电力系统负荷预测中的应用
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
数据挖掘技术能够从大量数据中提取人们感兴趣的潜在知识和信息,软计算是一种创建计算智能系统的有效方法。本文将两者相结合,完成负荷预测过程中的两个主要任务:电力负荷坏数据处理和多因素负荷预测建模。计算过程完全基于数据自动进行,提高了模型的智能化和科学性。
     本文概述了数据挖掘技术的有关内容,着重阐述其中两种重要的思想:分类方法与聚类分析,并详述它们的具体实现方法。本文关于负荷预测问题的研究始终贯穿了这两种思想。
     神经网络和模糊系统是软计算的重要基础,它们是设计智能系统的精髓。本文详细讨论了BP网、Kohonen网两种神经网络和TSK型模糊推理系统的原理、结构和算法等基本问题,其中前两者用于对负荷坏数据的处理,后者用在多因素负荷预测建模。
     负荷坏数据辨识是由负荷曲线抗差聚类和坏数据曲线模式分类两个顺序的过程组成的;本文通过对Kohonen网的抗差聚类和BP网模式分类的效果分析,设计由这两种网络组合而成的神经网络模型,较好地完成了坏数据辨识的任务。
     本文以模糊推理系统为基础,构建自适应神经—模糊系统建立预测模型。建模过程中解决了两个主要问题:模糊建模中的结构辨识问题和ANFIS系统参数辨识的收敛性问题。本文采用决策树分类方法完成结构辨识的任务,初步找出负荷变化的模式,有效减少了系统需要优化调整的参数数量;采用拟牛顿优化方法,较好地解决了大规模参数优化问题。
     模糊建模的难点是结构辨识中输入变量的选择和输入空间的划分。本文采用CART算法来解决结构辨识问题,它能够剔除无关变量并将输入空间划分成树状结构;在此基础上设计适当的隶属度函数,提高了参数辨识过程的精度和速度。
Data mining distills connotative knowledge and information from abundant data, while soft computing is an effective mothed to establish intelligent computing systems. This dissertation combines the two motheds to accomplish two main tasks: outliers processing in load data and multi-factor STLF (short term load forecasting) system modeling. The algorithms operate based on data completely, and are more intelligent and scientific.
    The dissertation layes stress on two main thoughts of data mining: classification and cluster analysis, some methods of which are discussed.
    Neural networks and fuzzy systems lay the two main foundations of soft computing. The principles, constructures and algorithms of BP and Kohonen network and the TSK fuzzy model are discussed. The two neural networks are combined to processing the outliers in load data, while the fuzzy model is the kernel of a multi-factor load forecasting system.
    The outlier identification is divided into two sequential parts: the robust day-load-curves cluster and the bad curve pattern classifcation. By analysising the effects of Kohonen network clustering and BP network classifcation, the dissertation designs an outlier identification model comprising these two kinds of neural network and implements the tasks of bad data identifications and adjustments.
    A STLF system is modeled based on ANFIS (Adaptive Neural-Fuzzy Inference System). Two main problems are sovled: fuzzy system structure identifications and ANFIS parameters identifications. A decision tree classifier is used to deal with the first task, which can find the load pattern preliminarily, and reduces the number of parameters to be adjusted. The Quasi-Newton method is applied to solving the second problem, which is suitable to optimizing large number of parameters.
    Difficulties of fuzzy modeling are the variables selection and the input space division in the fuzzy structure identification. CART can eliminate the unnecessary variables, and then divides the input space as a tree-shape. Appropriate membership functions improve the convergency of the parameters identification.
引文
[1] J.R.Quinlan, simplifying decision trees. Int. J.Man-Machine Studies. Vol.27, pp.221-234, 1987.
    [2] Floriana Esposito, Donato Malerba, Giovanni Semeraro, A Comparative Analysis of Methods for Pruning Decision Trees, IEEE Trans. Pattern Analysis and Machine Intelligence, v19, No.5, May 1997.
    [3] L.Breiman, J.H.Friedman, R.A.Olshen, and C.J.Stone. Classification and regression trees. Wadsworth, Inc., Belmont, Californiz, 1984.
    [4] J.R.Quinlan, Induction of decision trees, Machine Learning, Vol. 1, No.1, pp.81-106, 1986.
    [5] J.R.Quinlan, Decision trees and decisionmaking, IEEE Trans. Systems, Man, and Cybernetics, Vol.20, No.2, March/April, 1990.
    [6] Tjen-sien Lim, Wei-yin Loh, Yu-shan Shih, A comparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms, Machine Learning, 40, 203-228, 2000.
    [7] Olcay Taner Yildiz and Ethem Alpaydm, Omnivariate Decision Trees, IEEE Trans. On Neural Networks, vol. 12, No.6, November 2001.
    [8] S.Rasoul Safavian and David Landgrebe, A Survey of Decision Tree Classifier Methodology, IEEE Trans. On Systems, Man, and Cybernetics, vol.21, No.3, May/June 1991.
    [9] 朱绍文,胡宏银,王泉德,张大斌,黄浩,陆玉昌。决策树采掘技术及发展趋势,计算机工程,26卷,第10期,2000年10月。
    [10] 边肇祺,张学工 等。模式识别。清华大学出版社。2001年5月。
    [11] C.Wu, D.Landgrebe, and P. Swain, The decision tree approach to classification, School Elec. Eng., Purdue Univ., W. Lafayette, IN, Rep.RE-EE 75-17, 1975.
    [12] Allan P. White, Wei Zhong Liu, Bias in information-based measures in decision tree induction, Machine Learning, 15, 321-329, 1999.
    [13] P. Swain and H.Hauska, The Decision Tree Classifier Design and potential, IEEE Trans. Geosci. Electron, vol. GE- 15, pp. 142-147, 1977.
    [14] Philips A. Chou, Optimal Partitioning for Classification and Regression Trees, IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 13, No.4, April 1991.
    
    
    [15] Philips A. Chou, Tom Lookabauch, and Robert M.Gray, Optimal Pruning with Applications to Tree-Structured Source Coding and Modeling, IEEE Trans. Information Theory, Vol.35, No.2, March 1989.
    [16] Spanger, Fayyad and Uthurusamy, Induction of decision trees from inconclusive data, in proc. Sixth Int. Workshop on Machine Learning. A.M.Segre, Ed., (Los Altos, CA: Morgen Kaufmann), 1989.
    [17] J.Pearl, on the connection between the complexity and credibility of inferred models, Int. J.Gen. Systems in the Micro Electronic Age, D.Michie, Ed.Edinburgh Univ. Press, 1979.
    [18] W.J.Leech, A rule based process control method with feedback, Proc. Instrument Soc. Of Amer. Conf., Houston, TX, pp.169-175, 1986.
    [19] W. Buntine and D.Stirling, Interactive induction, Proc. IEEE Conf. AI Appl., San Diego, CA, 1988.
    [20] 吕安民 等,数据挖掘和知识发现的技术方法,测绘科学 2000 v25 No4.
    [21] Jiawei Han, Micheline Kambr, 数据挖掘:概念与技术,北京:高等教育出版社,2001年5月。
    [22] J.N.Morgan and J.A.Sonquist. Problems in the analysis of survey data, and a proposal. Journal of American Statistics Association, 58: 415-434, 1963.
    [23] Sonquist, J.A., E.L.Baker, and J.N.Morgan, Searching for structure. Institute for Social Research, Univ. of Michigan, Ann Arbor, MI, 1971.
    [24] S.B.Gelfand, C.S.Ravishankar, and E.J.Delp, An Iterative Growing and Pruning Algorithm for Classification Tree Design. IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 13, no.2, pp.138-150, 1991.
    [25] J.Kittler and P.A.Devijver, Statistical Properties of Error Estimators in Performance Assessment of Recognition Systems. IEEE Trans. Pattern Analysis and Machine Intelligence, vol.4, no.2, pp.215-220, 1982.
    [26] D.Malerba, G. Semeraro, and G. Esposito, Choosing the Best Pruned Decision Tree: A Matter of Bias, Proc. Fifth Italian Workshop on Machine Learning, Parma, Italy, pp.33-37, 1994.
    [27] Snedecor, G.W.& Cochran, W.G. (1980), Continuity correction for the binomial distribution, Statistical Methods(7th Edition). Iowa State University press, pp. 117.
    
    
    [28] Thomas G. Dietterich, An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization, Machine Learning, 40, 139-157, 2000.
    [29] Eric Bauer, Ron Kohavi, An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants, Machine Learning, 36, 105-139, 1999.
    [30] Sreerama K.Murthy, Automatic construction of decision trees from data: a multi-disciplinary survey, Data mining and knowledge discovery,2,345-389,1998.
    [31] J.Mui and K.S.Fu, Automated classification of nucleated blood cells using a binary tree classifier, IEEE Trans. Patt. Anal. Mach. Intell., vol. PAMI-2,pp.429-443, 1980.
    [32] K.C.You and K.S.Fu, An approach to the design of a linear binary tree classifier, in Proc. 3rd Symp. Machine Processing of Remotely Sensed Data, Purdue Univ., W. Lafaytte, IN, 1976.
    [33] J.R.Quinlan and R.L.Rivest, Inferring decision trees using the minimum description length principle. Information and Computation, vol.80, no.3, pp.227-248, 1989.
    [34] 钟晓 等,数据挖掘综述,模式识别与人工智能,V14,NO.1,2001.
    [35] P. Swain and H.Hauska, The decision tree classifier design and potential, IEEE, Trans. Geosci. Electron, vol. GE-15, pp.211-219, 1984.
    [36] 俞金寿,数据挖掘技术,石油化工自动化,2000,6:38。
    [37] A.K.Jam, M.N. Murty, P.J.Flyn, Data Clustering: A Review, ACM Computing Surveys. Vol.31, No.3, September 1999.
    [38] M.Halkidi, Y. Batistakis, M. Vazirgiannis, Clustering algorithms and validity measures, Scientific and Statistical Database Management, 2001. SSDBM 2001. Proceedings. Thirteenth International Conference on, 2001.
    [39] Chedsada Chinrungrueng. Evaluation of Heterogeneous Architectures for Artificial Neural Networks. PhD thesis, University of California at Berkeley, May 1993.
    [40] 张智星,孙春在,(日)水谷英二 著,张平安,高春华 等 译,神经、模糊和软计算,西安:西安交通大学出版社,2000年2月。
    [41] S.P. Lolyd. Least squares quantization in PCM. IEEE Trans. On Information Theory, (2):129-137, 1982.
    [42] J.Moody and C.Darken. Fast learning in networks of locally--tuned processing units.
    
    Neural Computation, 1:281—294, 1989.
    [43] 方开泰,潘恩沛 著,聚类分析,北京:地质出版社,1982年。
    [44] M.Ester, H.-P. Kriegel, J.Sander, and X.Xu, Adensity-based algorithm for discovering clusters in large spatial databaes. In Proc. 1996 Int. Conf. Knowledge Discovery and Data Mining(KDD'96), pages 226-231, Porland, OR, Aug. 1996.
    [45] M.Ankerst, M.Breunig, H.-P. Kriegel, and J.Sander, OPTICS: Ordering points to identify the clustering structure. In Proc. 1999 ACM-SIGMOD Int. Conf. Management of Data(SIGMOD'99), pages 49-60, Philadelphia, PA, June 1999.
    [46] Alexander Hinneburg, Daniel Keim, An Efficient Approach to Clustering in Large Multimedia Databases with Noise. In Proc. 1998 Int. Conf. Knowledge Discovery and Data Mining(KDD'98), pages 58-65, New York, Aug. 1998.
    [47] 罗耀光 盛立东,模式识别,人民邮电出版社,1989年6月。
    [48] G. Karypis, E.-H.Han, and V. Kumar. CHAMELEON: A hierarchical clustering algorithm using dynamic modeling. Computer, 32:68-75, 1999.
    [49] Tian Zhang, Raghu Ramakrishnman, Miron Linvy. BIRCH: An Efficient Method for Very Large Databases, ACM SIGMOD, Montreal, Canada, 1996.
    [50] S.Guha, R.Rastogi, and K. Shim. Cure: An efficient clustering algorithm for large databases. In Proc. 1998 ACM-SIGMOD Int. Conf. Management of Data(SIGMOD'98), pages 73-84, Seattle, WA, June 1998.
    [51] 杨曾武 统计预测原理,中国财政经济出版社。
    [52] S.Guha, R.Rastogi, and K.Shim. Rock: A robust clustering algorithm for categorical attributes. In Proc. 1999 Int. Conf. Data Engineering(ICDE'99), pages 512-521, Sydney, Australia, Mar. 1999.
    [53] W. Wang, J.Yang, and R.Muntz. STING: A ststistical information grid approach to spatial data mining. In Proc. 1997 Int. Conf. Very Large Data Bases(VLDB'97), pages 186-195, Athens, Greece, Aug. 1997.
    [54] Ronald R.Yager and Dimitar P. Filev, Approximate clustering via the mountain method. IEEE Trans. On Systems, Man, and Cybernetics, 24:1279-1284, 1994.
    [55] S.L.Chin, Fuzzy model identification based on cluster estimation. Journal of intelligent and Fuzzy Systems, 2(3), 1994.
    
    
    [56] 吉根林 等,数据挖掘技术及其应用,南京师大学报(自然科学版),2000,v23,No2。
    [57] 沈清,汤霖,模式识别导论,国防科技大学出版社,1991年5月。
    [58] Zadeh L., Fuzzy logic, neural network and soft computing. Communications of the ACM, 1994, 37(3): 77-84.
    [59] 江毅,软计算中的协作和融合技术综述。模糊系统与数学,vol.12,no.4,1998年。
    [60] Shi Qing-yun, K.S.Fu, A Method for the Design of Binary Tree Classifiers, Pattern Recognition, v16, No.6, pp 593-603, 1983.
    [61] 徐秉铮,张百灵,韦岗 编著,神经网络理论与应用,华南理工大学出版社,1994年12月第一版。
    [62] 徐涛,李强,数据挖掘技术及其应用。广西工学院学报,2000年,no.3。
    [63] 陈富赞 等,数据挖掘方法的研究,系统工程与电子技术,2000年,v22,No.3。
    [64] A.Kandel, editor. Fuzzy expert systems. CRC Press, Inc., Boca Raton, FL, 1992.
    [65] M. Sugeno and G.T. Kang. Structure identification of fuzzy model. Fuzzy sets and systems, v28: 15-33,1988.
    [66] T. Takagi and M.Sugeno. Fuzzy identification of systems and its applications to modeling and control. IEEE Transactions on Systems, Man, and Cysbernetics, 15:116-132, 1985.
    [67] B.Kosko. Neural networks and fuzzy systems: a dynamical systems approach. Prentice Hall, Upper Saddle River, NJ, 1991.
    [68] 杨会志,数据挖掘技术的主要方法及其主要发展方向,河北科技大学学报,2000:v21,No.3.
    [69] C. -C. Lee. Fuzzy logic in control systems: fuzzy logic controller- part 1. IEEE Trans. On Syetems, Man, and Cybernetics, 20(2): 404-418, 1990.
    [70] C. -C. Lee. Fuzzy logic in control systems: fuzzy logic controller- part 2. IEEE Trans. On Syetems, Man, and Cybernetics, 20(2): 404-418, 1990.
    [71] E. H. Mamdani and S. Assilian. An experiment in linguistic synthesis with a fuzzy logic controller. International Journal of Man-Machine Studies, 7(1): 1-13, 1975.
    [72] H. R. Kassaei, A.Keyhani, T. Woung, M Rahman, A hybrid fuzzy neural network bus load modeling and predication, IEEE Trans. On power systems, Vol. 14, No.2, May 1999.
    [73] Aldrich C., D.W. Moolman, F.S.Gouws, and G.P.J.Schmitz. Machine learning strategies for control of flotation plants. Control Eng. Practice, 5(2): 263-269, February 1997.
    
    
    [74] Perner P., T.B.Belikova, N.I.Yashunskaya. Knowledge acquisition by symbolic decision tree induction for interpretation of digital images in radiology. Lecture Notes in Computer Science, 1121:208, 1996.
    [75] 刘普寅,李洪兴,软计算及其哲学内涵,自然辩证法研究,vol.16,no.5,2000年。
    [76] L.A.Zadeh, Outline of a new approach to the analysis of complex systems and decision processes. IEEE Trans. On Systems, Man, and Cybernetics, 1973, 3(1): 28-44.
    [77] L.A.Zadeh, Quantitative fuzzy semantics. Information sciences, 1971, 3:177-206.
    [78] Lehnert, Wendy, Stephen Soderland, David Aronow, Fangfang Feng, and Avinoam Shmueli. Inductive text classification for medical applications. Journal of Experimental and Theoretical Artificial Intelligence, 7(1): 49-80, January-March 1995.
    [79] Stephen L. Chiu, Fuzzy model identification based on cluster estimation, Journal of intelligent and fuzzy systems, Vol.2, 267-278, 1994.
    [80] Guo, Y. and K.J.Dooley, Distinguishing between mean, variance and autocorrelation changes in statistical quanlity control, Int. J. of Production Research, 33(2): 497-510, Frebruary 1995.
    [81] Lagacherie, P. and S.Holmes, Addressing geographical data errors in classification tree for soil unit prediction. Int. J. of Geographical Information Science, 11(2): 183-198, March 1997.
    [82] Hannan, J.M.; Kiernan, L.A.; Warwick, K.; Majithia, S. , Intelligent Methods For Load Forecasting, Control '96, UKACC International Conference on (Conf. Publ. No. 427) , Volume: 2,2-5 September 1996, pp: 1418 -1423.
    [83] Masahird Miyakawa, Criteria for selecting a variable in the construction of efficient decision trees, IEEE Trans. On Computers, Vol.38, No. 1, January 1989.
    [84] Kandil, et al. Use of ANNs for short-term load forecasting. Canadian Conference on Electrical and Computer Engineering, 1999, 2: 1057~1061.
    [85] 张国江,邱家驹,李继红,基于人工神经网络的电力负荷坏数据辨识与调整,中国电机工程学报,Vol.21,No.8,Aug 2001。
    [86] P.D.Brierley, W.J.Batty, Neural data mining and modeling for electric load pridiction. Engineering applications of neural networks, 10-12th June 1998.
    [87] Islam, S.M.; Al-Alawi, S.M., Principles of electricity demand forecasting. Ⅰ. Methodologies,
    
    Power Engineering Journal, Volume: 11 Issue: 2, April 1997, Page(s): 91 -95.
    [88] 周江文,等.抗差最小二乘法[M].武汉:华中理工大学出版社,1997.
    [89] Nicolaos B.Karayiannis. An axiomatic approach to soft learning vector quantization and clustering[J]. IEEE Trans on Neural Networks, 1999, 10(5): 1015~1019.
    [90] 杨行峻,郑君里,人工神经网络,北京:高等教育出版社,1992.9。
    [91] Ronald R. Yager, Dimitar P. Filev, Generation of fuzzy rules by mountain clustering, Journal of intelligent and fuzzy systems, Vol.2, 209-219, 1994.
    [92] K. Liu, S.Subbarayan, R.R.Shoults, M.T. Manry, et al, Comparison of very short-term load forecasting techniques, IEEE Trans. On power systems, Vol. 11, No.2, May 1996.
    [93] M.Sugeno and G.T. Kang, Structure identification of fuzzy model. Fuzzy sets and systems, 28: 15-33,1988.
    [94] 易大义,陈道琦,数值分析引论,杭州:浙江大学出版社,2000年4月.
    [95] Teuvo Kohonen. Adaptive, associative, and self-organizing functions in neural computing[J]. Applied Optics, 1987, 26(23): 4910~4918.
    [96] 袁曾任.人工神经网络原理及其应用[M].北京:清华大学出版社,1999.
    [97] Hyeonjoong Yoo, Russell L. Primmel, Short term load forecasting using a self-supervised adaptive neural network, IEEE Trans. On power systems, Vol. 14, No.2, May 1999.
    [98] Neuro-fuzzy approaches to short-term electrical load forecasting, Bartkiewicz, W., Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on, Volume: 6,2000, Page(s): 229-234 vol.6.
    [99] P.A. Mastorocostas, J.B. Theocharis, A.G. Bakirtzis, Fuzzy modeling for short term load forecasting using the orthogonal least squares method, IEEE Trans. On power systems, Vol. 14, No.1, February 1999.
    [100] T. Takagi and M. Sugeno, Fuzzy identification of systems and its applications to modeling and control. IEEE Trans. On Systems, Man, and Cysbernetics, 15:116-132, 1985.
    [101] B.Cestnik, I.Kononenko, and I.Bratko, ASSISTANT 86: A knowledge-elicitation tool for sophisticated users. In Progress in Machine Learning, Ⅰ.Bratko and N.Lavrac. Eds. Wilmslow, UK: Sigma Press, 1987.
    [102] P. Clark, and T. Ninlett, The CN2 induction algorithm, Machine Learing, vol.3, no.4, pp.261-284, 1989.
    
    
    [103] Jyh-Shing Roger Jang. ANFIS: Adaptive-Network-Based Fuzzy Inference System. IEEE Transactions on Systems, Man, and Cybernetics. 1996, 23(3): 665~684.
    [104] Juberias, et al. New arima model for hourly load forecasting. Proceedings of the IEEE Power Engineering Society Transmission and Distribution Conference, 1999, 1:314~319.
    [105] ZhengTongxin, et al. Hybrid wavelet-Kalman filter method for load forecasting. Electric Power Systems Research, 2000, 54(1): 11-17.
    [106] S.E.Papadkis, et al. A novel approach to short-term load forecasting using fuzzy neural networks. IEEE Transactions on Power Systems, 1998, 13(2): 480~489.
    [107] H. Mori, H.kobayashi. Optimal Fuzzy Inference for Short-Term Load Forecasting. IEEE Transactions on Power Systems, 1996,11(1): 390~396.
    [108] 孙增圻,智能控制理论与技术。北京:清华大学出版社,1997。
    [109] 袁亚湘,孙文瑜,最优化理论与方法。北京:科学出版社,1999。
    [110] A.S.Hu, T.T. Lie and H.B.Gooi, Load Forecasting for customers under real time pricing. Electric Utility Deregulation and Restructuring and Power Technologies, 2000. Proceedings. DRPT 2000. International Conference on, 2000, Page(s): 538-543.
    [111] Hong Chen; Canizares, C.A.; Singh, A., ANN-based short-term load forecasting in electricity markets, Power Engineering Society Winter Meeting, 2001 IEEE, Vol 2,2001, Page(s): 411-415.
    [112] Henrique Steinherz Hippert, Carlos Eduardo Pedreira, and Reinaldo Castro Souza, Neural networks for short-term load forecasting: A review and evaluation, IEEE Trans. on Power Systems, Vol. 16, No. 1, February 2001.
    [113] Derek W. Bunn, Forecasting loads and prices in competitive power markets, Proceedings of The IEEE, Vol.88, No.2, February 2001.
    [114] L.Hyafil, R.L.Rivest, Constructing optimal binary decision trees is NP-complete. Information processing Letters, vol.5, no. 1, 1976.
    [115] 张国江,邱家驹,李继红,基于模糊推理系统的多因素电力负荷预测,电力系统自动化,Vol.26,No.5,March 2002。

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700