Recent Advances in Data Mining of Enterprise Data: Algorithms and Applications, p. 413-462
DOI: 10.1142/9789812779861_0009
Full text: Download
This chapter aims to present our data mining vision on Statistical Process Control (SPC) analysis, specifically on the design of multivariate control charts for individual observations in the case of independent data and continuous variables. In order to address new SPC issues such as the presence of multiple outliers and incorrect model assumptions in the context of large data sets, we suggest exploitation of some multivariate nonparametric statistical methods. In a model-free environment, we present the way we handle large data sets: a multivariate control scheme based on the data depth approach. We first present the general framework, and then our specific idea on how to design a proper control chart. There follows an example, a simulation study, and some remarks on the choice of the depth function from a data mining perspective. A brief discussion of some open issues in data mining SPC closes the chapter.