请输入您要查询的字词:

 

单词 regression diagnostics
释义
regression diagnostics

Statistics
  • Various statistics that give information about the reliability of the estimates of the multiple regression modelregression diagnostics

    where Y is an n×1 vector of independent and identically distributed response variables, β is a p×1 vector of unknown parameters, and X is an n×p matrix. If β is replaced by its least squares estimate, β̂, the estimated column vector of fitted values, ŷ, is given byregression diagnostics

    where the n×n matrix H, the hat matrix, is given byregression diagnostics

    X′ is the transpose of X, (XX)−1 is the inverse of the matrix XX, and y is the column vector of observed values. Denote the element in the jth row and kth column of H by hjk. The fitted value, ŷj, for the jth observation, yj, is given by regression diagnosticsThus there is a direct link between the fitted and observed values in the form of hjj. This is the leverage: a large value (e.g.>2p/n) indicates an observation having a large influence on the form of the fitted model.

    The most obvious guide to the fit of a model are the residuals, e1, e2,…, where ej is given byregression diagnostics

    If the random variables have common variance σ2 and if s2 is an unbiased estimate of σ2, then the standardized residual is sometimes defined as ej/s. However, an unbiased estimate of the variance of ej is not s2 but s2(1−hjj) and a more appropriate residual (having unit variance if the model is correct) is given by rj, where regression diagnosticsThis is sometimes called the standardized residual and sometimes the Studentized residual.

    The deletion residual is given byregression diagnostics

    where ŷj,−j is the fitted value for observation j based on the fit of the model to all the observations except the observation yj. Dividing the deletion residual by its estimated standard error, we get the Studentized deletion residual which can be written as regression diagnosticswhere s2j is the unbiased estimate of σ2 obtained when observation j is omitted. Confusingly, this may also be called the Studentized residual. See also Anscombe residual; deviance residual.

    A related influence statistic is DFFITS, which is an abbreviation for difference in fits. For observation j, DFFITSj is regression diagnosticsThe influence statistic DFBETA (difference in beta values) applies the idea embodied in DFFITS to the parameter estimates rather than the fitted values. For βk, DFBETAk,−j is regression diagnosticswhere β̂k is the estimate of βk from the complete data, β̂k,−j is the estimate when observation j is omitted, and mkk is the corresponding diagonal element of the p×p matrix (XX)−1.

    A statistic that usefully combines information about leverage and influence is Cook's statistic, Dj, given by regression diagnosticsThis statistic (introduced by Cook in 1977) can also be interpreted as measuring the effect on the parameter estimates of omitting the jth observation. Large values point to possible outliers.


随便看

 

科学参考收录了60776条科技类词条,基本涵盖了常见科技类参考文献及英语词汇的翻译,是科学学习和研究的有利工具。

 

Copyright © 2000-2023 Sciref.net All Rights Reserved
京ICP备2021023879号 更新时间:2024/12/25 13:04:30