Links

Tools

Export citation

Search in Google Scholar

A Graphical Tool for the Detection of Modes in Continuous Data

Journal article published in 2009 by Thomas Burger, Thierry Dhorne
This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Question mark in circle
Preprint: policy unknown
Question mark in circle
Postprint: policy unknown
Question mark in circle
Published version: policy unknown

Abstract

In (Bickel, 2003) is presented a robust parametric estimator for the mode of a monomodal continuous distribution. Therefore, it is necessary that the distribution is monomodal. On the other hand, there have been some non-parametric methods for the estimation of the local modes of multimodal distributions. Here, we present a graphical tool that conveniently helps deciding on visual bases, the number of modes of a distribution. To do so, the distribution is convoluted by a kernel of various scales to let local maxima of the density appear. Conceptually, the approach is similar to time-frequency analysis or wavelet analysis, but in order to best describe the shape of the distribution, Gaussian kernels are used. They are known to be more effi- cient in computer vision and pattern classification, and the corresponding representation fits the theoretical expectations (Mokhtarian, 1992). Some other works have explored this connection between pattern classification and descriptive statistics. Hence, a work with ideas similar to ours has already been proposed to publication (Griffin, unpublished), but to our knowledge, in spite of its quality, it remains unpublished. It is based on a multi-scale mean shift algorithm, and the approach is once again rather formal: the point is more to find the various modes, than to provide a convenient way to represent them. Hence, in spite of a common theoretical framework (the similarity with time-frequency analysis in computer vision), the objective is somewhat different. In addition to this work, we propose a dendrogram-like representation that helps the expert to describe the datasets and/or to propose an adapted mixture model. From an experimental point of view, the method is validated on real and simulated datasets. Finally, an efficient implementation is given.