Spatially simplified scatterplots for large raster datasets

Bin Li, Daniel A. Griffith, Brian Becker

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Scatterplots are essential tools for data exploration. However, this tool poorly scales with data-size, with overplotting and excessive delay being the main problems. Generalization methods in the attribute domain focus on visual manipulations, but do not take into account the inherent nature of information redundancy in most geographic data. These methods may also result in alterations of statistical properties of data. Recent developments in spatial statistics, particularly the formulation of effective sample size and the fast approximation of the eigenvalues of a spatial weights matrix, make it possible to assess the information content of a georeferenced data-set, which can serve as the basis for resampling such data. Experiments with both simulated data and actual remotely sensed data show that an equivalent scatterplot consisting of point clouds and fitted lines can be produced from a small subset extracted from a parent georeferenced data-set through spatial resampling. The spatially simplified data subset also maintains key statistical properties as well as the geographic coverage of the original data.

Original languageEnglish
Pages (from-to)81-93
Number of pages13
JournalGeo-Spatial Information Science
Volume19
Issue number2
DOIs
StatePublished - Apr 2 2016

Keywords

  • Scatterplot
  • effective sample size
  • spatial autocorrelation

Fingerprint

Dive into the research topics of 'Spatially simplified scatterplots for large raster datasets'. Together they form a unique fingerprint.

Cite this