TY - JOUR
T1 - Spatially simplified scatterplots for large raster datasets
AU - Li, Bin
AU - Griffith, Daniel A.
AU - Becker, Brian
N1 - Publisher Copyright:
© 2016 Wuhan University. Published by Informa UK Limited, trading as Taylor & Francis Group.
PY - 2016/4/2
Y1 - 2016/4/2
N2 - Scatterplots are essential tools for data exploration. However, this tool poorly scales with data-size, with overplotting and excessive delay being the main problems. Generalization methods in the attribute domain focus on visual manipulations, but do not take into account the inherent nature of information redundancy in most geographic data. These methods may also result in alterations of statistical properties of data. Recent developments in spatial statistics, particularly the formulation of effective sample size and the fast approximation of the eigenvalues of a spatial weights matrix, make it possible to assess the information content of a georeferenced data-set, which can serve as the basis for resampling such data. Experiments with both simulated data and actual remotely sensed data show that an equivalent scatterplot consisting of point clouds and fitted lines can be produced from a small subset extracted from a parent georeferenced data-set through spatial resampling. The spatially simplified data subset also maintains key statistical properties as well as the geographic coverage of the original data.
AB - Scatterplots are essential tools for data exploration. However, this tool poorly scales with data-size, with overplotting and excessive delay being the main problems. Generalization methods in the attribute domain focus on visual manipulations, but do not take into account the inherent nature of information redundancy in most geographic data. These methods may also result in alterations of statistical properties of data. Recent developments in spatial statistics, particularly the formulation of effective sample size and the fast approximation of the eigenvalues of a spatial weights matrix, make it possible to assess the information content of a georeferenced data-set, which can serve as the basis for resampling such data. Experiments with both simulated data and actual remotely sensed data show that an equivalent scatterplot consisting of point clouds and fitted lines can be produced from a small subset extracted from a parent georeferenced data-set through spatial resampling. The spatially simplified data subset also maintains key statistical properties as well as the geographic coverage of the original data.
KW - Scatterplot
KW - effective sample size
KW - spatial autocorrelation
UR - http://www.scopus.com/inward/record.url?scp=84969786259&partnerID=8YFLogxK
U2 - 10.1080/10095020.2016.1179441
DO - 10.1080/10095020.2016.1179441
M3 - Article
AN - SCOPUS:84969786259
VL - 19
SP - 81
EP - 93
JO - Geo-Spatial Information Science
JF - Geo-Spatial Information Science
SN - 1009-5020
IS - 2
ER -