用户名: 密码: 验证码:
测量数据的建模与半参数估计
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
在测绘科学中,过去人们研究与讨论系统误差或粗差(称之为函数模型误差),主要局限于单个或几个观测量上,其原因在于采用常规测量仪器和测量方法进行各种静态测量时,常规测量受到外界环境变化的影响不大。通过重复性实验,干扰因素(习惯上称为系统误差或粗差)对测量结果的影响规律有比较明确的了解,观测值中的系统误差在平差前就能够得到较好的补偿。而且常规测量观测量较小,在观测过程中又有比较完善的观测程序和检核条件(观测量之间通常满足一定几何条件),粗差也比较容易发现和剔除,残余的系统误差与偶然误差相比达到了可以忽略不计的程度。即使有个别的系统误差或粗差存在,在参数回归数据处理中,有许多方法(如数据探测、稳健估计等)能够解决系统误差或粗差问题。随着现代测量仪器和测绘技术的发展,特别是空间技术的广泛应用,在较短时间内可以获得大量的观测数据,而且观测数据受外部环境的影响较大。由于影响测量结果的因素较多,其函数关系复杂而且对其认识较少,如果不考虑这种系统性的影响,近似的按参数回归建模,将会导致了参数模型与客观实际存在不可忽视的偏差,严重的影响估计量的结果。另外,从某种意义来说,经典参数数据处理方法没有从根本上解决模型误差问题,也没有从根本上解决好系统误差与粗差的区分问题,或者把两者混为一体,当平差函数模型包含有系统误差与粗差时,参数回归中处理模型误差的方法可能会不尽如意。因此,有必要提出一种新的数据处理理论与方法,来完善现有的测量数据处理理论。
     综上所述,研究与解决模型误差,以及如何区分系统误差与粗差问题,是现代测量数据处理所研究重要内容之一。20世纪80年代发展起来的一种重要的统计模型—半参数回归模型(Semiparametric regression model),为我们研究上述问题提供了一种新的方法。
     半参数回归模型可表达为:
     L=BX+s(t)+△式中s(t)是描述未知函数关系的模型误差(这里指函数模型),它是某个量t的函数,也就是所谓的非参数。由于半参数模型引入了非参数,克服传统平差函数模型的局限性,使得数学模型与客观实际更为接近,如果采用某种估计方法,在数值上能够分别求出参数、非参数(系统性的模型误差)和偶然误差,该估计方法将是一种较为理想的数据处理方法,而且必将有着广阔的应用前景。基于上述原因,作者主要基于半参数模型估计的基本理论与方法,研究测量数据建模中如何处理模型误差,以及模型的诊断与粗差检验等问题,同时将半参数模型与测绘的实际结合起来,来解决测绘数据处理的实际问题。
     本论文分为七章,第一章绪论主要阐述了在模型误差与半参数模型估计方面的研究现状,以及本论文研究的内容与意义。其它章节研究的主要内容如下:
     在第二章里,论述了线性参数模型数据处理的基本理论与分析方法。包括参数估计、参数显著性检验、模型诊断与模型误差假设检验等问题。介绍了残差分析与检验的基本方法,提出了需要解决的问题:1、当平差模型存在明显的系统误差时应如何处理?2、当系统误差与粗差同时存在时应如何探测粗差、估计系统误差?
In the science of surveying and mapping, people researched and discussed the systematic errors or gross errors on one or a few observations in the past, the reasons are that static ground observations are less impacted by outside surrounding environment factors as the static surveying to be accomplished with conventional instruments and methods. Through duplicate experiments impacting factors (usually called systematic errors or outliers) that influence result of observation are much known, and systematic errors can be fairly compensated or be expressed in parameter model before adjustment. Furthermore observation quantity is less, and there are fairly perfect observation procedures and checking conditions (such as definite geometric conditions), so outliers are easily detected or cleared up. It can reach up to be ignored comparing remainder system errors with occasional errors. Though there is individual error, many methods (data snooping, robust estimate etc.) can deal with it. along with development of surveying instruments and technologies, especially extensive application of space technologies, a great deal of surveying data can be gained in the short time, moreover, data is seriously influenced by outside environment, and factors impacting observation results is too much , and its function relation is complicated and unknown. If not considering this the parametric model is approximately built, and it can bring forward not ignored deviation between parametric model and real model, and seriously influenced estimates. In a sense, the classical parametric regression model does not solve essentially model error, and also can not fundamentally distinguish system errors with gross errors, or mix up both. When the adjustment model include systematic errors and outliers, the method dealing with model errors is not quite satisfactory , so it is necessary that the new data processing methods to improve present theories of surveying data processing.From what has been discussed above, researching and resolving model errors, or differentiating systematic error with gross is one of the main contents to be researched nowadays.The eighth decade of the 20~(th) century, a important statistical model ——the semiparametricregression model has been developed, it is a fire-new method for us to research the above problems.The semiparametric model is as follows:L=BX+s(t)+△Where s(t) is the model errors (here called function model errors) describing unknown function relation, and it is the function of variablet, and is called the non-parameter. It can overcome the demerits of conventional adjustment model and get mathematics model close to reality since the non-parametric weight be brought in the semiparametric model. The parameter, non-parameter and accidental error can be got if an estimation method is used, and it will be an ideal method, moreover has very good foreground of application. Duo to these reasons above the semiparametric model is given abroad attentions, and the writer deal with model errors mainly in surveying data modeling with its theories and methods, and research model diagnose and gross test etc. At the same time, surveying data processing be combined with surveying practice to solve practical questions.The paper is divided into seven chapters. In chapter one, introduction expatiate the present state
    of research on the model errors and the semiparametric model estimation, and clarify content and signification. The main contents are as follows on the other chapters:In chapter 2, it clarifies the basic theories and methods on the data processing of linear parameter model, Including parameter estimation, test of parameter significance, model diagnose and outlier test. It give methods of residual analysis and test, and bring forward questions to be solved: 1 .. how do you deal with it when there is systematical errors in adjustment model ? 2, how do you snoop outliers and estimate systematical errors when both is at one time?In chapter 3, especially researching estimation methods of the semi-parametric mode based on regular matrix; detailedly deriving the penalized least square formulas of the nonparametric model with natural spline, and giving the penalized least square formulas of the semiparametric model with cubic natural spline; in addition, introducing partial kernel smoothing and partial residual estimate. Giving penalized least square formulas of the semi-parametric model with vector measurements. The choices of regular matrix and smoothing factor are systematically discussed, finally giving some statistical properties for the estimates of the semiparametric model.In chapter 4, based on the diagnose means of parametric model, testifying the estimates are identical using forecasted and deleted models to estimate parameters in the semi-parametric model. Some statistical diagnose are derived and testing methods on gross errors are given. Discussing method on the test of non-parameter significance and giving its testing formulas, and finally it give brief processes and steps in the modeling of generalized linear models.In chapter 5, by analysis and comparison of several adjustment models (such as rank-deficient free network, least square collocation, ridge estimate, semi-parametric model). Putting forward the unified principle of their solutions to be derived and the unified formula of solutions on ill-posed problems. The characters of solution on ridge estimate of ill-posed problems are systematically expounded. Along with basic principle of ridge estimate of parameter, the rule of universal penalized least squares is represented based on penalized least squares of the semi-parametric model, and its formula of estimate are derived. In addition, summing up the basic theories and methods of M-estimation, and the principle of parametric robust estimate is applied on estimate of the semi-parametric model, and robust estimate of the semi-parametric model is put forward, some expressions of estimations are got for robust penalized least squares and robust universal penalized least squares. Finally simulation examples are used to illustrate the effect of related estimation methods.In chapter 6, it is combined with some examples in survey, and researched on the applications of the semi-parametric model on surveying data processing. It verifies the practicality and effect of semiparametric model with computing and analyzing of some examples.In the end, the main research work and innovation in the paper are summarized, and some problems to be further studied in the future are bring forward.
引文
[1] 柴根象,洪圣岩.半参数回归模型.安徽教育出版社,1995
    [2] 柴根象,孙平.半参数回归二阶段估计.应用数学学报,Vol.18,No.3,333-336,1995
    [3] 柴根象,徐克军.半参数回归的线性小波光滑.应用概率统计,Vol.15,No.1,97-105,1999
    [4] 陈长生等,半参数回归模型及模拟实例分析.中国卫生统计,Vol.18,No.6,2001
    [5] 陈兰祥,李生文.半参数回归模型参数的Bayes估计.同济大学学报,Vol.26,No.1,1998
    [6] 钱伟民.半参数回归模型误差方差估计的Bootstrap逼近,数理统计与应用概率,Vol.13,No.4,1998
    [7] 钱伟民,翟健茹.污染数据线性回归模型的参数估计,同济大学学报,Vol.31,No.2,246-249,2003
    [8] 钱伟民,柴根象.半参数回归模型的估计的渐进性质,高校应用数学学报,Vol.14,No.2,1997
    [9] 钱伟民.半参数回归模型的误差方差的估计注记,同济大学学报,Vol.26,No.1,83-86,1998
    [10] 王仁宏.数值逼近.高等教育出版社,1999
    [11] 刘大杰,陶本藻.实用测量数据处理方法.测绘出版社,2000
    [12] 李庆海,陶本藻.。概率统计原理在测量中的应用。测绘出版社,1984
    [13] 陈希孺,王松桂.近代回归分析.安徽教育出版社,1987
    [14] 张金愧,线性模型参数估计及其改进,国防科技大学出版社,1999
    [15] 陈希孺等.非参数统计,上海科技出版社,1989
    [16] 周忠模,易杰军等,GPS卫星测量原理与应用.测绘出版社,1997
    [17] 张守信,GPS卫星测量定位理论与应用.国防科技大学出版社,1996
    [18] 张智星.MATLAB程序设计与应用,清华大学出版社,2002
    [19] 王沐然.MATLAB6.0与科学计算,电子工业出版社,2002
    [20] 崔希璋.广义测量平差.测绘出版社,1992
    [21] 崔希璋,於宗俦,陶本藻等.广义测量平差(新版),武汉测绘科技大学学报,2001
    [22] 潘建敏.污染数据半参数回归模型的估计方法.工程数学学报,Vol.14,No.3,1997
    [23] 高集体.半参数回归模型中的大样本理论.中国科技大学博士论文,1992
    [24] 朱仲义,韦博成,半参数非线性模型的统计诊断与影响分析.应角数学学报,Vol.24,No.4.2001.
    [25] 李德仁,误差处理和可靠性理论,测绘出版社,1988
    [26] 陶本藻,刘大杰等.平差模型误差理论及其应用论文集,励绘出版社,1993
    [27] 陶本藻,施闯.姚宜斌.顾及系统误差的平差模型的研究,测绘学院学报,Vol.19,No.2,2002
    [28] 陶本藻等,基于多面核函数配置模型的参数估计,武汉大学学报信息科学版 Vol.28,No.5,547-550,2003
    [29] 陶本藻.系统误差的后验检验和平差补偿,大地测量与地球动力学,Vol.22,NO.1,2002.
    [30] 陶本藻.自由网平差与变形分析,武汉测绘科技大学出版社,2001
    [31] 陶本藻等,多面函数推估与协方差推估,测绘通报,NO.9,4-6,2002
    [32] 周江文等.抗差估计论文集,测绘出版社,1992
    [33] 周江文.系统误差的数学处理,测绘工程,Vol.8,No.2,1999
    [34] 黄幼才.数据探测与抗差估计,测绘出版社,1990
    [35] 欧吉坤.论真误差拟稳解的基本特性,测绘学报,Vol.28,No.2,1999,144-147
    [36] 欧吉坤.一种检验粗差的新方法,科学通报,Vol.44,No.16,1777-1781,1999
    [37] 欧吉坤.GPS精密测量中系统误差的分离方法,数据采集与处理,Vol.18,No.4,365-368,2003
    [38] 欧吉坤.测量平差中不适定问题解的统一表达与选权拟合法,测绘学报,Vol.33,No.4,282-288,2004
    [39] 孙海燕.测量平差函数模型的若干讨论,武汉大学学报(信息科学版),Vol.28,Special Issue,62-64.2003
    [40] 孙海燕,吴云.半参数回归与模型的精化,武汉大学学报(信息科学报),Vol.27,No.2,,172-174,2002
    [41] 王振杰,欧吉坤.用L-曲线法确定岭估计中的岭参数,武汉大学学(信息科学版),Vol.29,No.3,235-238,2004
    [42] 王金岭,陈永奇.粗差数学期望平移模型的理论研究,武汉测绘科技大学学报,No.02,1995
    [43] 王新洲.在无偏估计类中改进最小二乘估计的方法,武汉测绘科技大学学报,Vol.20,No.03,46-50,1995
    [44] 王新洲.广义平差的概括模型,武汉测绘科技大学学报,Vol.25,No.03,257-270,2000
    [45] 王新洲.稳健二次估计理论及其在GPS随机模型中的应用,博士论文,武汉测绘科技大学,1995
    [46] 王振杰,欧吉坤等.用L曲线法确定半参数模型中的平滑因子,武汉大学学报(信息科学版),Vol.29,No.7,651-653,2004
    [47] 隋立新,陶大欣.多种顾及先验信息的平差及其比较,测绘工程,Vol.10,No.4,9-12,2001
    [48] 尊林,朱仲义等.半参数广义线性模型的影响分析与异常点的检验,高校应用数学学报 A辑,Vol.19,No.3,323-332,2004
    [49] 张正禄,张松林等.多维粗差定位与定值的算法研究及实现,武汉大学学报(信息科学版),Vol.28,No.4,400-404,2003
    [50] 秦荣华.系统误差模型的建立方法,上海航天,No.5,5-9,2000
    [51] 姚宜冰,陶本藻.基于最小二乘配置的GPS网平差模型及其应用,测绘学学报,Vol.20,No.1,4-6,2003
    [52] 孙孝前.纵向数据半参数建模中的迭代加权偏样条最小二乘估计,中国科学(a辑),Vol.33,No.5,470-480,2003
    [53] 施云驰,柴根象.半参数回归模型的局部多项式光滑,同济大学学报,Vol.28,No.1,80-83,2000
    [54] 於宗俦,李明锋.多维粗差的同时定位与定值,武汉测绘科技大学学报,Vol.21,No.4,323-329,1996
    [55] 刘永明.GPS水准的粗差检验,工程勘察,No.6,51-54,1994
    [56] 鲁铁定,周世健.GPS高程转换中高程异常粗差的检验与分析,工程勘察,No.4,51-54,2004
    [57] 吴云.半参数回归模型在测量中的应用,硕士论文,武汉大学,2002
    [58] 张松林.非线性半参数模型最小二乘估计理论及应用研究,武汉大学博士论文,2003
    [59] 胡宏昌,孙海燕等.正规矩阵R及平滑因子α的选取,测绘工程,Vol.12,No.4,5-8,2003
    [60] 胡宏昌.半参数模型的估计方法及其应用,武汉大学博士论文,2003
    [61] 彭军还.粗差验后方差的无偏估计与最优稳健估计,测绘学院学报,Vol.17,No.2,82-85,2000
    [62] 彭军还.非线性M估计研究及其应用,博土论文,武汉大学,2003
    [63] 徐进军.稳健时间序列方法及其在变形时频分析中的应用,武汉大学博士论文,2003
    [64] 徐绍铨,李振洪等.GPS高程拟合系统的研究,武汉测绘科技大学学报,Vol.24,No.4,326-329,1999
    [65] 徐天河,杨元喜.均方误差意义下正则化解优于最小二乘解的条件,武汉大学学(信息科学版),Vol.29,No.3,223-226,2004
    [66] 杨元喜,欧吉坤等.抗差最小二乘法,华中理工大学出版社,1995
    [67] 童恒庆.经济回归模型及计算,湖北科技技术出版社,1997
    [68] 刘权威,郭跃宏.同时考虑处理系统误差和粗差剔除的控制网平差,太原工业大学学报,Vol.26,No.2,59-63,1995
    [69] 柴艳菊,欧吉坤.用拟准检定法和岭估计解算病态污染方程,地壳形变与地震,Vol.20,No.4,36-40,1995
    [70] 归庆明,李国重,欧吉坤.有偏估计与LS估计的比较与选择,测绘学报,Vol.32,No.1,26-30,2003
    [71] 归庆明,郭建锋.部分岭估计在整周模糊度解算中的应用,信息工程大学学报,Vol.5,No.2,137-139,2004
    [72] 朱建军.一种稳健性准则及相应的稳健估计,测绘工程,Vol.5,No.4,22-28,1996
    [73] 朱建军.污染模型下的最小二乘估计,中南工业大学学报,Vol.27,No.3,273-277,1996
    [74] 朱建军.污染误差模型下的测量数据处理理论,测绘学报,Vol.28,No.3,215-220,1999
    [75] 张勤,王利.GPS高程拟合中的稳健估计,城市勘察,No.4,9-12,2000
    [76] 刘经南,姚宜斌,施闯.基于等价方差-协方差阵的稳健最小二乘估计理论研究,测绘科学,Vol.25,No.3,1-6,2000
    [77] 丁士俊,陶本藻.半参数回归与平差模型,大地测量与地球动力学,Vol.23,No.4,2003
    [78] 丁士俊,陶本藻.自然样条非参数回归模型及模拟分析,测绘通报,Vol.322,No.1,17-19,2004
    [79] 丁士俊,陶本藻.半参数模型核光滑估计与模拟分析,大地测量与地球动力学,Vol.24,No.4,24-28,2004
    [80] 丁士俊,陶本藻.半参数模型及其在形变分析中的应用,测绘科学,Vol.29,No.5,38-40,2005
    [81] 丁士俊,陶本藻.自然样条半参数模型与系统误差估计,武汉大学学报(信息科学版),Vol.29,No.11,964-967,2004
    [82] Bujia, A., Hastie,T.J. and Tibshirani, R.J. Linear Smoothers and Additive Models, Annual of Statistics, Vol. 17, No.2,PP. 453-555,1989
    [83] Beat Fischer and Markus Hegland, Collocation filtering and nonparametric regression, ZFVl, 1999
    [84] Buckley, M. J., Eagleson, G. K. and Siverman, B.W. The estimation of residual variance in nonparametric regression. Biometrika, 75, pp.189-199, 1988.
    [85] Boyeed, R. A. A study of smoothing spline regression. Ph..D, Thesis, University of Bath, 1986.
    [86] Craven,P.and Wahba, G. Smoothing noisy data with spline funtion. Numer.Math.31,377-403,1979.
    [87] Cox, D., Koh, E., Wahba, G. and Yandell, B.S. Testing the null model hypothesis in(semiparametric) partial and generalized spline models.Ann.statist., 16,113-119, 1988.
    [88] David Ruppert, M. P. Wand, R. J. Carroll. Semiparametric regression. Cambridge university press, 2003.
    [89] Engle, R. F., Granger, C. W. J., Rice, J.A. and Weiss, A Semiparametric estimates of the relation between weather and electricity sales, J. Amer. Statist. Assoc., 81,310-320,1986.
    [90] Eubank, R. L. The hat matric for smoothing splines, statistics and probability letters, 2, 9-14,1984.
    [91] Eubank, R. L. diagnostics for smoothing splines. J. R. Statist. soc.B, 47, NO.2, pp.332-341,1985.
    [92] Eubank, R. L., Hart, J.D. and Speckman, P. Trigonometric Series Regression Etimators with an Aplication to Partially linear Model. J. Mutivariate Anal.,1990,32: 70-83.
    [93] Fessler, J.A. Nonparametrie fixed-interval Smoothing with Vector Splines. IEEE Tran. Vol.39, NO.4,1991.
    [94] Fessler, J.A. Nonparametric fixed-interval smoothing of nonlinear vector-valued measurements. IEEE Tran. Vol.39.NO.4,1991.
    [95] Green, P. J., Jennison, C., and Seheult, A., Analysis of field experiments by least squares smoothing. J. Roy. Statist. soc.B, 47,299-315. 1985
    [96] Green, P. J. Linear models for field trials, smoothing and cross-validation. Biometrika, 72,523-537, 1985.
    [97] Green, P. J. Penalized likelihood for generalized semi-parametric regression models. Int.Statist. Rev. 55, 245-259,1987.
    [98] Green, P. J. and Silverman, B.W. Nonparametric regression and generalized linear models, CHAPMAN and HALL, London, 182, pp. 1994.
    [99] Huber, P.J. Robust statistics, Ann. Statist, 1973(1)
    [100] Hansen P C. Analysis of Discrete Ill-posed Problems by Means of the L-curve. SIAM Review, 34(4), 561-580, 1992
    [101] Hansen P C, O'Leary D P. The use of the L-curve in the regularization of discrete ill-posed problem. SIAM J.Sci.Comput., 14(6), 1487-1503, 1993
    [102] Hutchinson, M.F. Optimising the degree of data smoothing for locally adaptive finite element bivariate smoothing splines. ANZIAM Journal, 42(E): pp.C774-C796, 2000.
    [103] Hutchinson, M.F. and Hancock, P.A. An iterative pocedure for calculating minimum generalized cross validation smoothing splines. ANZIAM Journal, 42(E): pp. C774-796, 2000.
    [104] Hutchinson, M.F. and de Hoog, F.R. Smoothing noisey data with spline function. Numer.Math., 47,99-106,1985.
    [105] Jia, M. Mitigation of systematic errors of GPS positioning using vector semiparametric models, 13th INT.Tech..Meeting of the Satellite Division of the U.S.Inst.of Navigation, Salt Lake City, Utah,11-44 September, 1938-1947, 2000.
    [106] Jia, M. Tsakiti, M and Stewart, M.P. Vector penalized likelihood estimation and its application in GPS positioning. Paper presented to 50th Annual Research Seminar, shool of Geomatic engineering, The university of New South Wales, Sydney, NSW, 1999.
    [107] Meleod, A.I. and Li, W.K. Diagnostic checking ARMA time series models using squared-residual autocorrelations. Journal of Time Series Analysis, No.4, pp.269-273., 1983.
    [108] Rice, J. Bandwidth choice for nonparamettic regression.Ann. Statist., 12, 1215-1230, 1984.
    [109] Robinson, T and Moyeed, R. Making robust the cross-validatory choice of smoothing parameter in spline smoothing regression.Commun.Statist.-Theory Meth.,18, 523-530, 1989.
    [110] Stone, J.C. Consistent nonparametric regression, Ann. Statist. 1977.
    [111] Silverman, B.W. A fast and efficient cross-validation method for smoothing parameter choice in spline regression. J.Amer..Statist.Assoc., 79,584-589, 1984.
    [112] Silverman, B.W. Some aspect of the spline smoothing approach to non-parametric regression curve fitting. J.Roy.Statist.Soc. B, 47, No.1, 1-52, 1985.
    [113] Silverman, B.W. Should we use kernel methods at all? Statistical Science, 6,430-433. 1992.
    [114] Speckman, P. Kernel smoothing in partial linear models. J.Roy.Statist.soc.B 50, No.3, PP. 413-436,1988.
    [115] Yee, T.W. and Wild, C.J. Vector Generalised Additive Modets, J.R.Statist.Soc.B, 58, No.3, PP. 481-493,1996.
    [116] Wecker, W. E. and Ansley, C. F. The signal extraction approach to nonlinear regression and spline smoothing. J. Amer. Statist. Assoc. 78,81 -89,1983.
    [117] Yandell,B.S. Algorithms for multidimensional semiparametric GLM's. Commun. Statist.B, 17-295-312,1988
    [118] Bates, D.M., Lindstrom, M.J., Wahba, G. and Yandell,B.S. GCVPACK—Routines for generalized cross validation. Commun.Statist.Simul.Comput., 16,263-297,1987

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700