Analytics and Modeling

This knowledge area embodies a variety of data driven analytics, geocomputational methods, simulation and model driven approaches designed to study complex spatial-temporal problems, develop insights into characteristics of geospatial data sets, create and test geospatial process models, and construct knowledge of the behavior of geographically-explicit and dynamic processes and their patterns.

Topics in this Knowledge Area are listed thematically below. Existing topics are in regular font and linked directly to their original entries (published in 2006; these contain only Learning Objectives). Entries that have been updated and expanded are in bold. Forthcoming, future topics are italicized

 

Methodological Context Surface & Field Analyses Space-Time Analysis & Modeling
Geospatial Analysis & Model Building Modeling Surfaces Time Geography
Changing Context of GIScience Gridding, Interpolation, and Contouring Capturing Spatio-Temporal Dynamics in Computational Modeling 
Building Blocks Inverse Distance Weighting GIS-Based Computational Modeling
Overlay & Combination Operations Radial Basis & Spline Functions Computational Movement Analysis
Areal Interpolation Polynomial Functions Volumes and Space-Time Volumes
Aggregation of Spatial Entities Kriging Interpolation  
Classification & Clustering LiDAR Point Cloud Analysis Geocomputational Methods & Models
Boundaries & Zone Membership Intervisibility, Line-of-Sight, and Viewsheds Cellular Automata
Spatial Queries Digital Elevation Models & Terrain Metrics Agent-based Modeling
Buffers TIN-based Models and Terrain Metrics Simulation Modeling
Grid Operations & Map Algebra Watersheds & Drainage Networks Artificial Neural Networks
Data Exploration & Spatial Statistics 3D Parametric Surfaces Genetic Algorithms & Evolutionary Computing 
Spatial Statistics Network & Location Analysis Big Data & Geospatial Analysis
Spatial Sampling for Spatial Analysis Intro to Network & Location Analysis Problems of Large Spatial Databases
Exploratory Spatial Data Analysis (ESDA) Location & Service Area Problems Pattern Recognition & Matching
Point Pattern Analysis Network Route & Tour Problems Artificial Intelligence Approaches
Kernels & Density Estimation Modelling Accessibility Intro to Spatial Data Mining
Spatial Interaction Location-allocation Modeling Rule Learning for Spatial Data Mining
Cartographic Modeling The Classic Transportation Problem Machine Learning Approaches
Multi-criteria Evaluation   CyberGIS and Cyberinfrastructure
Grid-based Statistics and Metrics   Analysis of Errors & Uncertainty
Landscape Metrics   Error-based Uncertainty
Hot-spot and Cluster Analysis   Conceptual Models of Error & Uncertainty
Global Measures of Spatial Association   Spatial Data Uncertainty
Local Indicators of Spatial Autocorrelation   Problems of Scale & Zoning
Simple Regression & Trend Surface Analysis   Thematic Accuracy & Assessment
Geographically Weighted Regression   Stochastic Simulation & Monte Carlo Methods
Spatial Autoregressive Models   Mathematical Models of Uncertainty
Spatial Filtering Models   Fuzzy Aggregation Operators

 

AM-84 - Simulation Modeling

Advances in computational capacity have enabled dynamic simulation modeling to become increasingly widespread in scientific research. As opposed to conceptual or physical models, simulation models enable numerical experimentation with alternative parametric assumptions for a given model design. Numerous design choices are made in model development that involve continuous or discrete representations of time and space. Simulation modeling approaches include system dynamics, discrete event simulation, agent-based modeling, and multi-method modeling. The model development process involves a shift from qualitative design to quantitative analysis upon implementation of a model in a computer program or software platform. Upon implementation, model analysis is performed through rigorous experimentation to test how model structure produces simulated patterns of behavior over time and space. Validation of a model through correspondence of simulated results with observed behavior facilitates its use as an analytical tool for evaluating strategies and policies that would alter system behavior.

AM-32 - Spatial Autoregressive Models

Regression analysis is a statistical technique commonly used in the social and physical sciences to model relationships between variables. To make unbiased, consistent, and efficient inferences about real-world relationships a researcher using regression analysis relies on a set of assumptions about the process generating the data used in the analysis and the errors produced by the model. Several of these assumptions are frequently violated when the real-world process generating the data used in the regression analysis is spatially structured, which creates dependence among the observations and spatial structure in the model errors. To avoid the confounding effects of spatial dependence, spatial autoregression models include spatial structures that specify the relationships between observations and their neighbors. These structures are most commonly specified using a weights matrix that can take many forms and be applied to different components of the spatial autoregressive model. Properly specified, including these structures in the regression analysis can account for the effects of spatial dependence on the estimates of the model and allow researchers to make reliable inferences. While spatial autoregressive models are commonly used in spatial econometric applications, they have wide applicability for modeling spatially dependent data.

AM-107 - Spatial Data Uncertainty

Although spatial data users may not be aware of the inherent uncertainty in all the datasets they use, it is critical to evaluate data quality in order to understand the validity and limitations of any conclusions based on spatial data. Spatial data uncertainty is inevitable as all representations of the real world are imperfect. This topic presents the importance of understanding spatial data uncertainty and discusses major methods and models to communicate, represent, and quantify positional and attribute uncertainty in spatial data, including both analytical and simulation approaches. Geo-semantic uncertainty that involves vague geographic concepts and classes is also addressed from the perspectives of fuzzy-set approaches and cognitive experiments. Potential methods that can be implemented to assess the quality of large volumes of crowd-sourced geographic data are also discussed. Finally, this topic ends with future directions to further research on spatial data quality and uncertainty.

AM-33 - Spatial filtering
  • Identify modeling situations where spatial filtering might not be appropriate
  • Demonstrate how spatial autocorrelation can be “removed” by resampling
  • Explain how dissolving clusters of blocks with similar values may resolve the spatial correlation problem
  • Explain how the Getis and Tiefelsdorf-Griffith spatial filtering techniques incorporate spatial component variables into OLS regression analysis in order to remedy misspecification and the problem of spatially auto-correlated residuals
  • Explain how spatial correlation can result as a side effect of the spatial aggregation in a given dataset
  • Describe the relationship between factorial kriging and spatial filtering
AM-10 - Spatial Interaction

Spatial interaction (SI) is a fundamental concept in the GIScience literature, and may be defined in numerous ways. SI often describes the "flow" of individuals, commodities, capital, and information over (geographic) space resulting from a decision process. Alternatively, SI is sometimes used to refer to the influence of spatial proximity of places on the intensity of relations between those places. SI modeling as a separate research endeavor developed out of a need to mathematically model and understand the underlying determinants of these flows/influences. Proponents of SI modeling include economic geographers, regional scientists, and regional planners, as well as climate scientists, physicists, animal ecologists, and even some biophysical/environmental researchers. Originally developed from theories of interacting particles and gravitational forces in physics, SI modeling has developed through a series of refinements in terms of functional form, conceptual representations of distances, as well as a range of analytically rigorous technical improvements.
 

AM-26 - Spatial sampling for statistical analysis
  • List and describe several spatial sampling schemes and evaluate each one for specific applications
  • Differentiate between model-based and design-based sampling schemes
  • Design a sampling scheme that will help detect when space-time clusters of events occur
  • Create spatial samples under a variety of requirements, such as coverage, randomness, and transects
  • Describe sampling schemes for accurately estimating the mean of a spatial data set
AM-42 - The Classic Transportation Problem

The classic transportation problem concerns minimizing the cost of transporting a product from sources/supplies to destinations/demands. It is a network-flow problem that arises in industrial logistics and is often solved by linear programming (LP). The three inputs of the model are total units produced at each source, total units needed at each destination, and the cost to transport one unit from each source to each destination. And the objective is to minimize the total cost of transporting all units produced at sources to meet the demands at destinations. The problem solution includes three basic steps: 1) finding an initial basic feasible solution, 2) checking if the current solution is optimal (with the lowest costs), and improving the current solution through iteration. Solving such a problem relies strongly on the network data models, least-cost path algorithms, other functionalities in GIS. And an integrated framework is often adopted to utilize both GIS and non-GIS linear programming solvers to search for the optimal solution.

AM-21 - The Evolution of Geospatial Reasoning, Analytics, and Modeling

The field of geospatial analytics and modeling has a long history coinciding with the physical and cultural evolution of humans. This history is analyzed relative to the four scientific paradigms: (1) empirical analysis through description, (2) theoretical explorations using models and generalizations, (3) simulating complex phenomena and (4) data exploration. Correlations among developments in general science and those of the geospatial sciences are explored. Trends identify areas ripe for growth and improvement in the fourth and current paradigm that has been spawned by the big data explosion, such as exposing the ‘black box’ of GeoAI training and generating big geospatial training datasets. Future research should focus on integrating both theory- and data-driven knowledge discovery.

AM-34 - The Geographically Weighted Regression Framework

Local multivariate statistical models are increasingly encountered in geographical research to estimate spatially varying relationships between a response variable and its associated predictor variables. In geography and many other disciplines, such models have been largely embedded within the framework of regression and can reveal significantly more information about the determinants of observed spatial distribution of the dependent variable than their global regression model counterparts. This section introduces one type of local statistical modeling framework: Geographically Weighted Regression (GWR). Models within this framework estimate location-specific parameter estimates for each covariate, local diagnostic statistics, and bandwidth parameters as indicators of the spatial scales at which the modeled processes operate. These models provide an effective means to estimate how the same factors may evoke different responses across locations and by so doing, bring to the fore the role of geographical context on human preferences and behavior.

AM-86 - Theory of error propagation
  • Describe stochastic error models
  • Exemplify stochastic error models used in GIScience

Pages