期刊
JOURNAL OF CHEMOMETRICS
卷 22, 期 11-12, 页码 601-609出版社
WILEY
DOI: 10.1002/cem.1147
关键词
PCA; SIMCA; leverage distribution; residual variance distribution; type I error; acceptance area; classification; influence plot; outlier
In the projection methods (PCA, PLS) two distance measures are of importance. They are the score distance (SD, a.k.a. leverage) and the orthogonal distance (OD, a.k.a. the residual variance). This paper shows that both distance measures can be modeled by the chi(2)-distribution. Each model includes a scaling factor that can be described by an explicit equation. Moreover, the models depend on an unknown number of degrees of freedom, which have to be estimated using a training dataset. Such modeling is further applied to classification within the SIMCA framework, and various acceptance areas are built for a given significance level. A triangular area, constructed using the sum of the normalized SD and OD, is deemed to be the most practical. This theoretical notion is supported by three examples. The first is based on a simulated dataset, while the other two employ real world data. Copyright (C) 2008 John Wiley & Sons, Ltd.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据