Multivariable geostatistics in S: the gstat package

This paper discusses advantages and shortcomings of the S environment for multivariable geostatistics, in particular when extended with the gstat package, an extension package for the S environments (R, S-Plus). The gstat S package provides multivariable geostatistical modelling, prediction and simulation, as well as several visualisation functions. In particular, it makes the calculation, simultaneous fitting, and visualisation of a large number of direct and cross (residual) variograms very easy. Gstat was started 10 years ago and was released under the GPL in 1996; gstat.org was started in 1998. Gstat was not initially written for teaching purposes, but for research purposes, emphasising flexibility, scalability and portability. It can deal with a large number of practical issues in geostatistics, including change of support (block kriging), simple/ordinary/universal (co)kriging, fast local neighbourhood selection, flexible trend modelling, variables with different sampling configurations, and efficient simulation of large spatially correlated random fields, indicator kriging and simulation, and (directional) variogram and cross variogram modelling. The formula/models interface of the S language is used to define multivariable geostatistical models. This paper introduces the gstat S package, and discusses a number of design and implementation issues. It also draws attention to a number of papers on integration of spatial statistics software, GIS and the S environment that were presented on the spatial statistics workshop and sessions during the conference Distributed Statistical Computing 2003.

[1]  Hanan Samet,et al.  Ranking in Spatial Databases , 1995, SSD.

[2]  Hans Wackernagel,et al.  Multivariate Geostatistics: An Introduction with Applications , 1996 .

[3]  Edzer Pebesma,et al.  GSTAT: a program for geostatistical modelling, prediction and simulation , 1998 .

[4]  Brian D. Ripley Spatial Statistics in R , 2001 .

[5]  Michael Edward Hohn,et al.  An Introduction to Applied Geostatistics: by Edward H. Isaaks and R. Mohan Srivastava, 1989, Oxford University Press, New York, 561 p., ISBN 0-19-505012-6, ISBN 0-19-505013-4 (paperback), $55.00 cloth, $35.00 paper (US) , 1991 .

[6]  David W. Scott The New S Language , 1990 .

[7]  Trevor Hastie,et al.  Statistical Models in S , 1991 .

[8]  A. Papritz,et al.  An Empirical Comparison of Kriging Methods for Nonlinear Spatial Point Prediction , 2002 .

[9]  David E. Stewart,et al.  Meschach : matrix computations in C , 1994 .

[10]  N. Cressie,et al.  Multivariable spatial prediction , 1994 .

[11]  Jacques Rivoirard On the Structural Link Between Variables in Kriging with External Drift , 2002 .

[12]  Noel A Cressie,et al.  Statistics for Spatial Data, Revised Edition. , 1994 .

[13]  John M. Chambers,et al.  Programming With Data , 1998 .

[14]  Edzer J. Pebesma,et al.  Latin Hypercube Sampling of Gaussian Random Fields , 1999, Technometrics.

[15]  Roger Bivand Approaches to Classes for Spatial Data in R , 2003 .

[16]  Timothy C. Coburn,et al.  Geostatistics for Natural Resources Evaluation , 2000, Technometrics.

[17]  Roger Bivand,et al.  Using the R statistical data analysis language on GRASS 5 , 2000 .

[18]  T. C. Haas,et al.  Model-based geostatistics. Discussion. Authors' reply , 1998 .

[19]  Fred Espen Benth,et al.  Kriging with Inequality Constraints , 2001 .

[20]  W. W. Stroup,et al.  A Generalized Linear Model Approach to Spatial Data Analysis and Prediction , 1997 .

[21]  Roger Woodard,et al.  Interpolation of Spatial Data: Some Theory for Kriging , 1999, Technometrics.

[22]  P. Burrough,et al.  Principles of geographical information systems , 1998 .

[23]  J. Gómez-Hernández,et al.  Joint Sequential Simulation of MultiGaussian Fields , 1993 .

[24]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .