Published in

American Meteorological Society, Journal of Atmospheric and Oceanic Technology, 5(37), p. 789-806, 2020

DOI: 10.1175/jtech-d-18-0244.1

Links

Tools

Export citation

Search in Google Scholar

Improved Statistical Method for Quality Control of Hydrographic Observations

This paper was not found in any repository, but could be made available legally by the author.
This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Orange circle
Published version: archiving restricted
Data provided by SHERPA/RoMEO

Abstract

AbstractRealistic ocean state prediction and its validation rely on the availability of high quality in situ observations. To detect data errors, adequate quality check procedures must be designed. This paper presents procedures that take advantage of the ever-growing observation databases that provide climatological knowledge of the ocean variability in the neighborhood of an observation location. Local validity intervals are used to estimate binarily whether the observed values are considered as good or erroneous. Whereas a classical approach estimates validity bounds from first- and second-order moments of the climatological parameter distribution, that is, mean and variance, this work proposes to infer them directly from minimum and maximum observed values. Such an approach avoids any assumption of the parameter distribution such as unimodality, symmetry around the mean, peakedness, or homogeneous distribution tail height relative to distribution peak. To reach adequate statistical robustness, an extensive manual quality control of the reference dataset is critical. Once the data have been quality checked, the local minima and maxima reference fields are derived and the method is compared with the classical mean/variance-based approach. Performance is assessed in terms of statistics of good and bad detections. It is shown that the present size of the reference datasets allows the parameter estimates to reach a satisfactory robustness level to always make the method more efficient than the classical one. As expected, insufficient robustness persists in areas with an especially low number of samples and high variability.