Search Page

Showing 1 - 6 of 6
AM-94 - Machine Learning Approaches

Machine learning approaches are increasingly used across numerous applications in order to learn from data and generate new knowledge discoveries, advance scientific studies and support automated decision making. In this knowledge entry, the fundamentals of Machine Learning (ML) are introduced, focusing on how feature spaces, models and algorithms are being developed and applied in geospatial studies. An example of a ML workflow for supervised/unsupervised learning is also introduced. The main challenges in ML approaches and our vision for future work are discussed at the end.

AM-21 - The Evolution of Geospatial Reasoning, Analytics, and Modeling

The field of geospatial analytics and modeling has a long history coinciding with the physical and cultural evolution of humans. This history is analyzed relative to the four scientific paradigms: (1) empirical analysis through description, (2) theoretical explorations using models and generalizations, (3) simulating complex phenomena and (4) data exploration. Correlations among developments in general science and those of the geospatial sciences are explored. Trends identify areas ripe for growth and improvement in the fourth and current paradigm that has been spawned by the big data explosion, such as exposing the ‘black box’ of GeoAI training and generating big geospatial training datasets. Future research should focus on integrating both theory- and data-driven knowledge discovery.

AM-97 - An Introduction to Spatial Data Mining

The goal of spatial data mining is to discover potentially useful, interesting, and non-trivial patterns from spatial data-sets (e.g., GPS trajectory of smartphones). Spatial data mining is societally important having applications in public health, public safety, climate science, etc. For example, in epidemiology, spatial data mining helps to nd areas with a high concentration of disease incidents to manage disease outbreaks. Computational methods are needed to discover spatial patterns since the volume and velocity of spatial data exceed the ability of human experts to analyze it. Spatial data has unique characteristics like spatial autocorrelation and spatial heterogeneity which violate the i.i.d (Independent and Identically Distributed) assumption of traditional statistic and data mining methods. Therefore, using traditional methods may miss patterns or may yield spurious patterns, which are costly in societal applications. Further, there are additional challenges such as MAUP (Modiable Areal Unit Problem) as illustrated by a recent court case debating gerrymandering in elections. In this article, we discuss tools and computational methods of spatial data mining, focusing on the primary spatial pattern families: hotspot detection, collocation detection, spatial prediction, and spatial outlier detection. Hotspot detection methods use domain information to accurately model more active and high-density areas. Collocation detection methods find objects whose instances are in proximity to each other in a location. Spatial prediction approaches explicitly model the neighborhood relationship of locations to predict target variables from input features. Finally, spatial outlier detection methods find data that differ from their neighbors. Lastly, we describe future research and trends in spatial data mining.

AM-107 - Spatial Data Uncertainty

Although spatial data users may not be aware of the inherent uncertainty in all the datasets they use, it is critical to evaluate data quality in order to understand the validity and limitations of any conclusions based on spatial data. Spatial data uncertainty is inevitable as all representations of the real world are imperfect. This topic presents the importance of understanding spatial data uncertainty and discusses major methods and models to communicate, represent, and quantify positional and attribute uncertainty in spatial data, including both analytical and simulation approaches. Geo-semantic uncertainty that involves vague geographic concepts and classes is also addressed from the perspectives of fuzzy-set approaches and cognitive experiments. Potential methods that can be implemented to assess the quality of large volumes of crowd-sourced geographic data are also discussed. Finally, this topic ends with future directions to further research on spatial data quality and uncertainty.

AM-10 - Spatial Interaction

Spatial interaction (SI) is a fundamental concept in the GIScience literature, and may be defined in numerous ways. SI often describes the "flow" of individuals, commodities, capital, and information over (geographic) space resulting from a decision process. Alternatively, SI is sometimes used to refer to the influence of spatial proximity of places on the intensity of relations between those places. SI modeling as a separate research endeavor developed out of a need to mathematically model and understand the underlying determinants of these flows/influences. Proponents of SI modeling include economic geographers, regional scientists, and regional planners, as well as climate scientists, physicists, animal ecologists, and even some biophysical/environmental researchers. Originally developed from theories of interacting particles and gravitational forces in physics, SI modeling has developed through a series of refinements in terms of functional form, conceptual representations of distances, as well as a range of analytically rigorous technical improvements.
 

AM-106 - Error-based Uncertainty

The largest contributing factor to spatial data uncertainty is error. Error is defined as the departure of a measure from its true value. Uncertainty results from: (1) a lack of knowledge of the extent and of the expression of errors and  (2) their propagation through analyses. Understanding error and its sources is key to addressing error-based uncertainty in geospatial practice. This entry presents a sample of issues related to error and error based uncertainty in spatial data. These consist of (1) types of error in spatial data, (2) the special case of scale and its relationship to error and (3) approaches to quantifying error in spatial data.