The T index: Measuring the reliability of accuracy estimates obtained from non-probability samples

François Waldner

ArticleOPEN ACCESS

The T index: Measuring the reliability of accuracy estimates obtained from non-probability samples

Waldner F

Remote Sensing

DOI: 10.3390/RS12152483

5Citations

22Readers

Abstract

In remote sensing, the term accuracy typically expresses the degree of correctness of a map. Best practices in accuracy assessment have been widely researched and include guidelines on how to select validation data using probability sampling designs. In practice, however, probability samples may be lacking and, instead, cross-validation using non-probability samples is common. This practice is risky because the resulting accuracy estimates can easily be mistaken for map accuracy. The following question arises: to what extent are accuracy estimates obtained from non-probability samples representative of map accuracy? This letter introduces the T index to answer this question. Certain cross-validation designs (such as the common single-split or hold-out validation) provide representative accuracy estimates when hold-out sets are simple random samples of the map population. The T index essentially measures the probability of a hold-out set of unknown sampling design to be a simple random sample. To that aim, we compare its spread in the feature space against the spread of random unlabelled samples of the same size. Data spread is measured by a variant of Moran's I autocorrelation index. Consistent interpretation of the T index is proposed through the prism of significance testing, with T values < 0.05 indicating unreliable accuracy estimates. Its relevance and interpretation guidelines are also illustrated in a case study on crop-type mapping. Uptake of the T index by the remote-sensing community will help inform about-and sometimes caution against-the representativeness of accuracy estimates obtained by cross-validation, so that users can better decide whether a map is fit for their purpose or how its accuracy impacts their application. Subsequently, the T index will build trust and improve the transparency of accuracy assessment in conditions which deviate from best practices.

Author supplied keywords

Cite

CITATION STYLE

APA

Waldner, F. (2020, August 1). The T index: Measuring the reliability of accuracy estimates obtained from non-probability samples. Remote Sensing. MDPI AG. https://doi.org/10.3390/RS12152483

The T index: Measuring the reliability of accuracy estimates obtained from non-probability samples

Abstract

Author supplied keywords

Cite

Register to see more suggestions