Help for package dbmss

Type:

Package

Title:

Distance-Based Measures of Spatial Structures

Version:

2.11-0

Description:

Simple computation of spatial statistic functions of distance to characterize the spatial structures of mapped objects, following Marcon, Traissac, Puech, and Lang (2015) <doi:10.18637/jss.v067.c03>. Includes classical functions (Ripley's K and others) and more recent ones used by spatial economists (Duranton and Overman's Kd, Marcon and Puech's M). Relies on 'spatstat' for some core calculation.

URL:

https://ericmarcon.github.io/dbmss/, https://github.com/EricMarcon/dbmss/

BugReports:

https://github.com/EricMarcon/dbmss/issues/

License:

GPL-2 | GPL-3 [expanded from: GNU General Public License]

Depends:

R (≥ 3.5.0), Rcpp (≥ 0.12.14), spatstat.explore

Imports:

cubature, doFuture, foreach, future, ggplot2, methods, progressr, RcppParallel, reshape2, rlang, spatstat.geom, spatstat.utils, spatstat.random, stats, tibble

Suggests:

knitr, pkgdown, rmarkdown, testthat

LinkingTo:

Rcpp, RcppParallel

VignetteBuilder:

knitr

SystemRequirements:

pandoc, GNU make

Encoding:

UTF-8

LazyData:

true

NeedsCompilation:

yes

Packaged:

2025-06-17 13:14:06 UTC; emarc

Author:

Eric Marcon

[aut, cre], Gabriel Lang

[aut], Stephane Traissac

[aut], Florence Puech

[aut], Pierrot Froment [aut]

Maintainer:

Eric Marcon <eric.marcon@agroparistech.fr>

Repository:

CRAN

Date/Publication:

2025-06-17 23:40:02 UTC

Distance Based Measures of Spatial Structures

Description

Simple computation of spatial statistic functions of distance to characterize the spatial structures of mapped objects, including classical ones (Ripley's K and others) and more recent ones used by spatial economists (Duranton and Overman's Kd, Marcon and Puech's M). Relies on spatstat for some core calculation.

Author(s)

Eric Marcon, Gabriel Lang, Stephane Traissac, Florence Puech

Maintainer: Eric Marcon <Eric.Marcon@agroparistech.fr>

References

Marcon, E., and Puech, F. (2003). Evaluating the Geographic Concentration of Industries Using Distance-Based Methods. Journal of Economic Geography, 3(4), 409-428.

Marcon, E. and Puech, F. (2010). Measures of the Geographic Concentration of Industries: Improving Distance-Based Methods. Journal of Economic Geography 10(5): 745-762.

Marcon, E., Puech F. and Traissac, S. (2012). Characterizing the relative spatial structure of point patterns. International Journal of Ecology 2012(Article ID 619281): 11.

Lang G., Marcon E. and Puech F. (2014) Distance-Based Measures of Spatial Concentration: Introducing a Relative Density Function. HAL 01082178, 1-18.

Marcon, E., Traissac, S., Puech, F. and Lang, G. (2015). Tools to Characterize Point Patterns: dbmss for R. Journal of Statistical Software. 67(3): 1-15.

Marcon, E. and Puech, F. (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Checks the arguments of a function of the package dbmss

Description

This function is used internally to verify that arguments passed to dbmss functions such as Mhat are correct.

Usage

CheckdbmssArguments()

Details

The function compares the arguments passed to its parent function to the type they should be and performs some extra tests (e.g. risk threshold must be between 0 and 1). It stops if an argument is not correct.

Value

Returns TRUE or stops if a problem is detected.

Estimation of the confidence envelope of the D function under its null hypothesis

Description

Simulates point patterns according to the null hypothesis and returns the envelope of D according to the confidence level.

Usage

DEnvelope(X, r = NULL, NumberOfSimulations = 100, Alpha = 0.05,
          Cases, Controls, Intertype = FALSE, Global = FALSE,
          verbose = interactive(), parallel = FALSE, parallel_pgb_refresh = 1/10)

Arguments

X

A point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

NumberOfSimulations

The number of simulations to run, 100 by default.

Alpha

The risk level, 5% by default.

Cases

One of the point types

Controls

One of the point types.

Intertype

Logical; if TRUE, D is computed as Di in Marcon and Puech (2012).

Global

Logical; if TRUE, a global envelope sensu Duranton and Overman (2005) is calculated.

verbose

Logical; if TRUE, print progress reports during the simulations.

parallel

Logical; if TRUE, simulations can be run in parallel, see details.

parallel_pgb_refresh

The proportion of simulations steps to be displayed by the parallel progress bar. 1 will show all but may slow down the computing, 1/100 only one out of a hundred.

Details

The only null hypothesis is random labeling: marks are distributed randomly across points.

This envelope is local by default, that is to say it is computed separately at each distance. See Loosmore and Ford (2006) for a discussion.

The global envelope is calculated by iteration: the simulations reaching one of the upper or lower values at any distance are eliminated at each step. The process is repeated until Alpha / Number of simulations simulations are dropped. The remaining upper and lower bounds at all distances constitute the global envelope. Interpolation is used if the exact ratio cannot be reached.

Parallel simulations rely on the future and doFuture packages. Before calling the function with argument parallel = TRUE, you must choose a strategy and set it with plan. Their progress bar relies on the progressr package. They must be activated by the user by handlers.

Value

An envelope object (envelope). There are methods for print and plot for this class.

The fv contains the observed value of the function, its average simulated value and the confidence envelope.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Kenkel, N. C. (1988). Pattern of Self-Thinning in Jack Pine: Testing the Random Mortality Hypothesis. Ecology 69(4): 1017-1024.

Loosmore, N. B. and Ford, E. D. (2006). Statistical inference using the G or K point pattern spatial statistics. Ecology 87(8): 1925-1931.

Marcon, E. and F. Puech (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Examples

data(paracou16)
# Keep only 20% of points to run this example
X <- as.wmppp(rthin(paracou16, 0.2))
autoplot(X,
  labelSize = expression("Basal area (" ~cm^2~ ")"),
  labelColor = "Species")

# Calculate confidence envelope (should be 1000 simulations, reduced to 20 to save time)
r <- 0:30
NumberOfSimulations <- 20
Alpha <- .05
# Plot the envelope (after normalization by pi.r^2)
autoplot(DEnvelope(X, r, NumberOfSimulations, Alpha,
    "V. Americana", "Q. Rosea", Intertype = TRUE), ./(pi*r^2) ~ r)

Estimation of the D function

Description

Estimates the D function

Usage

Dhat(X, r = NULL, Cases, Controls = NULL, Intertype = FALSE, CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

Cases

One of the point types.

Controls

One of the point types. If NULL, controls are all types except for cases.

Intertype

Logical; if TRUE, D is computed as Di in Marcon and Puech (2012).

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

The Di function allows comparing the structure of the cases to that of the controls around cases, that is to say the comparison is made around the same points. This has been advocated by Arbia et al. (2008) and formalized by Marcon and Puech (2012).

Value

An object of class fv, see fv.object, which can be plotted directly using plot.fv.

Note

The computation of Dhat relies on spatstat functions Kest and Kcross.

References

Arbia, G., Espa, G. and Quah, D. (2008). A class of spatial econometric methods in the empirical analysis of clusters of firms in the space. Empirical Economics 34(1): 81-103.

Diggle, P. J. and Chetwynd, A. G. (1991). Second-Order Analysis of Spatial Clustering for Inhomogeneous Populations. Biometrics 47(3): 1155-1163.

Marcon, E. and F. Puech (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Examples

data(paracou16)
autoplot(paracou16)

# Calculate D
r <- 0:30
(Paracou <- Dhat(paracou16, r, "V. Americana", "Q. Rosea", Intertype = TRUE))

# Plot (after normalization by pi.r^2)
autoplot(Paracou, ./(pi*r^2) ~ r)

Create a Distance table object.

Description

Creates an object of class "Dtable" representing a set of points with weights and labels and the distances between them.

Usage

Dtable(Dmatrix, PointType = NULL, PointWeight = NULL)

Arguments

Dmatrix

A square matrix containing distances.

PointType

A vector describing the point types. Its length must correspond to the number of points.

PointWeight

A vector describing the point weights. Its length must correspond to the number of points.

Details

The distance matrix is not necessarily symmetric, so distances are understood in the common sense, not in the mathematical sense. Asymmetric distances are appropriate when paths between points are one-way only.

The points of origin are in lines, the targets in columns. The diagonal of the matrix must contain zeros (the distance between a point and itself is 0), and all other distances must be positive (they can be 0).

Value

An object of class "Dtable". It is a list:

Dmatrix

The distance matrix.

n

The number of points.

marks

A list of two items: PointType, a vector of factors containing the point types and PointWeight, the numeric vector of weights.

Examples

# A Dtable containing two points
Dmatrix <- matrix(c(0,1,1,0), nrow=2)
PointType <- c("Type1", "Type2")
PointWeight <- c(2,3)
Dtable(Dmatrix, PointType, PointWeight)

Transform simulation values to an fv

Description

This function is used internally to calculate envelope values and store them into an fv.object.

Usage

FillEnvelope(Envelope, Alpha, Global)

Arguments

Envelope

An envelope object (envelope) containing all the simulated function values.

Alpha

The risk level.

Global

Logical; if TRUE, a global envelope sensu Duranton and Overman (2005) is calculated.

Value

Returns the envelope object (envelope) with hi and lo values calculated from the simlations.

Estimation of the global confidence interval of simulations

Description

Calculates the global confidence interval envelope sensu Duranton and Overman (2005) according to simulations of the null hypothesis of a function.

Usage

GlobalEnvelope(Simulations, Alpha)

Arguments

Simulations

An object of class fv, see fv.object, containing the simulated values.

Alpha

The risk level.

Details

Value

A matrix with two lines:

Glo

the lower bound of the envelope

Ghi

the upper bound of the envelope

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Goodness of Fit test between a distance based measure of spatial structure and simulations of its null hypothesis

Description

Calculates the risk to reject the null hypothesis erroneously, based on the distribution of the simulations.

Usage

GoFtest(Envelope,
        Test = "DCLF",
        Scaling = "Asymmetric",
        Range = NULL,
        Alpha = 0.05,
        CheckArguments = TRUE)

Arguments

Envelope

An envelope object (envelope) containing simulations in its simfuns attribute. It may be the result of any estimation function of the dbmss package or obtained by the envelope function with argument savefuns=TRUE.

Test

A string specifying the method to summarize the deviation from the null hypothesis. The deviation may be summarized different ways:

"DCLF": the integrated squared deviation is utilized, a Diggle-Cressie-Loosmore-Ford test is performed (default);
"Integral": the integrated absolute deviation is utilized;
"MAD": the Maximum Absolute Deviation is utilized, a MAD test is performed.

Scaling

A string specifying the method to scale the residuals of the test. Scaling may be:

"Asymmetric": the differences between the 2.5% lower quantiles of simulations and the expected value is utilized to scale negative residuals, and the differences between the 2.5% upper quantiles and the expected value is utilized to scale positive residuals (default);
"Quantile": ranges between the 2.5% upper and 2.5% lower quantiles is utilized to scale residuals, disregarding whether residuals are negative or positive;
"Studentized": the standard deviation of simulations is utilized.
"None": does not scale residuals.

Range

A vector of length two containing the minimum and the maximum distance over which to compute the test. If NULL, or if the selected range is outside the simulated distances, all distances in the Envelope argument are used.

Alpha

The risk level, 5% by default.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

This function gathers multiple Goodness of Fit tests: the DCLF test (Diggle 1986, Cressie 1993, Loosmore & Ford 2006, Marcon et al. 2012, Myllymäki et al. 2015), the integrated deviation test (Diggle 1979), and the MAD test (Diggle 1979, Myllymäki et al. 2015).

Monte Carlo simulations assess how well observed distance-based measures of spatial structure align with expected measures under the null hypothesis, estimated here by the mean value of simulations. For both observed and simulated measures, residuals — calculated as the differences between observed and expected values — are computed at each distance r. These residuals are then transformed into test-specific statistics u:

Test = "MAD" uses the maximum absolute residual;
Test = "Integral" and Test = "DCLF" use residuals to approximate the integrated deviation, and the integrated squared deviation.

A rank test on u evaluates the null hypothesis that the observed point pattern originates from the same point process as the simulations.

The unequal variance of the residuals at different intervals of r influences u statistics, and the power of Goodness of Fit tests. Myllymäki et al. (2015) proposed to scale residuals using pointwise quantiles (Scaling = "Asymmetric", and Scaling = "Quantile"), and pointwise standard deviations (Scaling = "Studentized").

Goodness of Fit tests are sensitive to the distance interval over which they are performed. It is recommended to choose the distance interval to test based on a priori hypotheses (Wiegand & Moloney 2013, Baddeley et al. 2015).

Value

A p-value.

Note

Ktest is a much better test (it does not rely on simulations) but it is limited to the K function against complete spatial randomness (CSR) in a rectangle window.

This test is inspired from Myllymäki et al. (2015), and a similar function deviation_test() exists in the R package GET (Myllymäki & Mrkvička, 2024)

References

Baddeley, A., Rubak, E., & Turner, R. (2015). Spatial Point Patterns: Methodology and Applications with R (0 ed.). Chapman and Hall/CRC. https://doi.org/10.1201/b19708

Cressie, N. A. C. (1993). Statistics for Spatial Data (1st ed.). Wiley. https://doi.org/10.1002/9781119115151

Diggle, P. J. (1979). On Parameter Estimation and Goodness-of-Fit Testing for Spatial Point Patterns. Biometrics, 35(1), 87. https://doi.org/10.2307/2529938

Diggle, P. J. (1986). Displaced amacrine cells in the retina of a rabbit: Analysis of a bivariate spatial point pattern. Journal of Neuroscience Methods, 18(1–2), 115–125. https://doi.org/10.1016/0165-0270(86)90115-9

Loosmore, N. B., & Ford, E. D. (2006). Statistical inference using the G or K point pattern spatial statistics. Ecology, 87(8), 1925–1931. https://doi.org/10.1890/0012-9658(2006)87[1925:SIUTGO]2.0.CO;2

Marcon, E., Puech, F., & Traissac, S. (2012). Characterizing the Relative Spatial Structure of Point Patterns. International Journal of Ecology, 2012, 1–11. https://doi.org/10.1155/2012/619281

Myllymäki, M., & Mrkvička, T. (2024). GET: Global Envelopes in R. Journal of Statistical Software, 111(3), 1-40. https://doi.org/10.18637/jss.v111.i03

Myllymäki, M., Grabarnik, P., Seijo, H., & Stoyan, D. (2015). Deviation test construction and power comparison for marked spatial point patterns. Spatial Statistics, 11, 19–34. https://doi.org/10.1016/j.spasta.2014.11.004

Wiegand, T., & Moloney, K. A. (2013). Handbook of Spatial Point-Pattern Analysis in Ecology. Chapman and Hall/CRC. https://doi.org/10.1201/b16195

Examples


# Simulate a Matern (Neyman Scott) point pattern
nclust <- function(x0, y0, radius, n) {
  return(runifdisc(n, radius, centre=c(x0, y0)))
}
X <- rNeymanScott(20, 0.2, nclust, radius=0.3, n=10)
autoplot(as.wmppp(X))

# Calculate confidence envelope (should be 1000 simulations, reduced to 50 to save time)
r <- seq(0, 0.3, 0.01)
NumberOfSimulations <- 50
Alpha <- .10
Envelope <- KEnvelope(as.wmppp(X), r, NumberOfSimulations, Alpha)
autoplot(Envelope, ./(pi*r^2) ~ r)

# DCLF test using asymmetric scaling.
# Power is correct if enough simulations are run (say >1000).
paste(
  "p-value =",
  GoFtest(Envelope, Test = "DCLF", Scaling = "Asymmetric", Range = c(0.1, 0.2))
)

# MAD test using asymmetric scaling.
paste(
  "p-value =",
  GoFtest(Envelope, Test = "MAD", Scaling = "Asymmetric", Range = c(0.1, 0.2))
)

Estimation of the confidence envelope of the K function under its null hypothesis

Description

Simulates point patterns according to the null hypothesis and returns the envelope of K according to the confidence level.

Usage

KEnvelope(X, r = NULL, NumberOfSimulations = 100, Alpha = 0.05,
          ReferenceType = "", NeighborType = ReferenceType,
          SimulationType = "RandomPosition", Precision = 0, Global = FALSE,
          verbose = interactive(), parallel = FALSE, parallel_pgb_refresh = 1/10)

Arguments

X

A point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

NumberOfSimulations

The number of simulations to run, 100 by default.

Alpha

The risk level, 5% by default.

ReferenceType

One of the point types. Default is all point types.

NeighborType

One of the point types. By default, the same as reference type.

SimulationType

A string describing the null hypothesis to simulate. The null hypothesis may be "RandomPosition": points are drawn in a Poisson process (default); "RandomLabeling": randomizes point types, keeping locations unchanged; "PopulationIndependence": keeps reference points unchanged, shifts other point locations.

Precision

Accuracy of point coordinates, measured as a part of distance unit. See rRandomPositionK. Default is 0 for no approximation.

Global

Logical; if TRUE, a global envelope sensu Duranton and Overman (2005) is calculated.

verbose

Logical; if TRUE, print progress reports during the simulations.

parallel

Logical; if TRUE, simulations can be run in parallel, see details.

parallel_pgb_refresh

The proportion of simulations steps to be displayed by the parallel progress bar. 1 will show all but may slow down the computing, 1/100 only one out of a hundred.

Details

This envelope is local by default, that is to say it is computed separately at each distance. See Loosmore and Ford (2006) for a discussion.

Value

An envelope object (envelope). There are methods for print and plot for this class.

The fv contains the observed value of the function, its average simulated value and the confidence envelope.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Kenkel, N. C. (1988). Pattern of Self-Thinning in Jack Pine: Testing the Random Mortality Hypothesis. Ecology 69(4): 1017-1024.

Loosmore, N. B. and Ford, E. D. (2006). Statistical inference using the G or K point pattern spatial statistics. Ecology 87(8): 1925-1931.

Marcon, E. and F. Puech (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Silverman, B. W. (1986). Density estimation for statistics and data analysis. Chapman and Hall, London.

Examples

data(paracou16)
# Keep only 20% of points to run this example
X <- as.wmppp(rthin(paracou16, 0.2))
autoplot(X,
  labelSize = expression("Basal area (" ~cm^2~ ")"),
  labelColor = "Species")

# Calculate confidence envelope (should be 1000 simulations, reduced to 20 to save time)
r <- 0:30
NumberOfSimulations <- 20
# Plot the envelope
autoplot(KEnvelope(X, r, NumberOfSimulations), ./(pi*r^2) ~ r)

Estimation of the confidence envelope of the Kd function under its null hypothesis

Description

Simulates point patterns according to the null hypothesis and returns the envelope of Kd according to the confidence level.

Usage

KdEnvelope(X, r = NULL, NumberOfSimulations = 100, Alpha = 0.05, ReferenceType,
           NeighborType = ReferenceType, Weighted = FALSE, Original = TRUE,
           Approximate = ifelse(X$n < 10000, 0, 1), Adjust = 1, MaxRange = "ThirdW",
           StartFromMinR = FALSE,
           SimulationType = "RandomLocation", Global = FALSE,
           verbose = interactive(), parallel = FALSE, parallel_pgb_refresh = 1/10)

Arguments

X

A point pattern (wmppp.object) or a Dtable object.

r

A vector of distances. If NULL, a default value is set: 512 equally spaced values are used, and the first 256 are returned, corresponding to half the maximum distance between points (following Duranton and Overman, 2005).

NumberOfSimulations

The number of simulations to run, 100 by default.

Alpha

The risk level, 5% by default.

ReferenceType

One of the point types.

NeighborType

One of the point types. By default, the same as reference type.

Weighted

Logical; if TRUE, estimates the Kemp function.

Original

Logical; if TRUE (by default), the original bandwidth selection by Duranton and Overman (2005) following Silverman (2006: eq 3.31) is used. If FALSE, it is calculated following Sheather and Jones (1991), i.e. the state of the art. See bw.SJ for more details.

Approximate

if not 0 (1 is a good choice), exact distances between pairs of points are rounded to 1024 times Approximate single values equally spaced between 0 and the largest distance. This technique (Scholl and Brenner, 2015) allows saving a lot of memory when addressing large point sets (the default value is 1 over 10000 points). Increasing Approximate allows better precision at the cost of proportional memory use. Ignored if X is a Dtable object.

Adjust

Force the automatically selected bandwidth (following Silverman, 1986) to be multiplied by Adjust. Setting it to values lower than one (1/2 for example) will sharpen the estimation. If not 1, Original is ignored.

MaxRange

The maximum value of r to consider, ignored if r is not NULL. Default is "ThirdW", one third of the diameter of the window. Other choices are "HalfW", and "QuarterW" and "D02005". "HalfW", and "QuarterW" are for half or the quarter of the diameter of the window. "D02005" is for the median distance observed between points, following Duranton and Overman (2005). "ThirdW" should be close to "DO2005" but has the advantage to be independent of the point types chosen as ReferenceType and NeighborType, to simplify comparisons between different types. "D02005" is approximated by "ThirdW" if Approximate is not 0. if X is a Dtable object, the diameter of the window is taken as the max distance between points.

StartFromMinR

Logical; if TRUE, points are assumed to be further from each other than the minimum observed distance, So Kd will not be estimated below it: it is assumed to be 0. If FALSE, by default, distances are smoothed down to r=0. Ignored if Approximate is not 0: then, estimation always starts from r=0.

SimulationType

A string describing the null hypothesis to simulate. The null hypothesis may be "RandomLocation": points are redistributed on the actual locations (default); "RandomLabeling": randomizes point types, keeping locations and weights unchanged; "PopulationIndependence": keeps reference points unchanged, randomizes other point locations.

Global

Logical; if TRUE, a global envelope sensu Duranton and Overman (2005) is calculated.

verbose

Logical; if TRUE, print progress reports during the simulations.

parallel

Logical; if TRUE, simulations can be run in parallel, see details.

parallel_pgb_refresh

The proportion of simulations steps to be displayed by the parallel progress bar. 1 will show all but may slow down the computing, 1/100 only one out of a hundred.

Details

This envelope is local by default, that is to say it is computed separately at each distance. See Loosmore and Ford (2006) for a discussion.

Value

An envelope object (envelope). There are methods for print and plot for this class.

The fv contains the observed value of the function, its average simulated value and the confidence envelope.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Kenkel, N. C. (1988). Pattern of Self-Thinning in Jack Pine: Testing the Random Mortality Hypothesis. Ecology 69(4): 1017-1024.

Loosmore, N. B. and Ford, E. D. (2006). Statistical inference using the G or K point pattern spatial statistics. Ecology 87(8): 1925-1931.

Marcon, E. and F. Puech (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Scholl, T. and Brenner, T. (2015) Optimizing distance-based methods for large data sets, Journal of Geographical Systems 17(4): 333-351.

Silverman, B. W. (1986). Density estimation for statistics and data analysis. Chapman and Hall, London.

Examples

data(paracou16)
autoplot(paracou16[marks(paracou16)$PointType=="Q. Rosea"])

# Calculate confidence envelope
plot(KdEnvelope(paracou16, , ReferenceType="Q. Rosea", Global=TRUE))

# Center of the confidence interval
Kdhat(paracou16, ReferenceType="") -> kd
lines(kd$Kd ~ kd$r, lty=3, col="green")

Estimation of the Kd function

Description

Estimates the Kd function

Usage

 Kdhat(X, r = NULL, ReferenceType, NeighborType = ReferenceType, Weighted = FALSE,
       Original = TRUE, Approximate = ifelse(X$n < 10000, 0, 1), Adjust = 1,
       MaxRange = "ThirdW", StartFromMinR = FALSE, CheckArguments = TRUE)

Arguments

X

A weighted, marked planar point pattern (wmppp.object) or a Dtable object.

r

A vector of distances. If NULL, a default value is set: 512 equally spaced values are used, from the smallest distance between points to half the diameter of the window.

ReferenceType

One of the point types. If "", all points are considered (this is not the default value; NeighborType is ignored then) to estimate the average value of simulated Kd values under the null hypothesis of RandomLocation (Marcon and Puech, 2012).

NeighborType

One of the point types. By default, the same as reference type.

Weighted

Logical; if TRUE, estimates the Kemp function.

Original

Logical; if TRUE (by default), the original bandwidth selection by Duranton and Overman (2005) following Silverman (1986: eq 3.31) is used. If FALSE, it is calculated following Sheather and Jones (1991), i.e. the state of the art. See bw.SJ for more details.

Approximate

Adjust

Force the automatically selected bandwidth (following Original) to be multiplied by Adjust. Setting it to values lower than one (1/2 for example) will sharpen the estimation.

MaxRange

StartFromMinR

Logical; if TRUE, points are assumed to be further from each other than the minimum observed distance, so Kd will not be estimated below it: it is assumed to be 0. If FALSE, distances are smoothed down to r=0. Ignored if Approximate is not 0: then, estimation always starts from r=0.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

Kd is a density, absolute measure of a point pattern structure. Kd is computed efficiently by building a matrix of distances between point pairs and calculating the density of their distribution (the default values of r are those of the density function). The kernel estimator is Gaussian.

The weighted Kd function has been named Kemp (emp is for employees) by Duranton and Overman (2005).

If X is not a Dtable object, the maximum value of r is obtained from the geometry of the window rather than caculating the median distance between points as suggested by Duranton and Overman (2005) to save (a lot of) calculation time.

Value

An object of class fv, see fv.object, which can be plotted directly using plot.fv.

Note

Estimating Kd relies on calculating distances, exactly or approximately (if Approximate is not 0). Then distances are smoothed by estimating their probability density. Reflection is used to estimate density close to the lowest distance, that is the minimum observed distance (if StartFromMinR is TRUE) or 0: all distances below 4 times the estimation kernel bandwith apart from the lowest distance are duplicated (symmetrically with respect to the lowest distance) to avoid edge effects (underestimation of the density close to the lowest distance).

Density estimation heavily relies on the bandwith. Starting from version 2.7, the optimal bandwith is computed from the distribution of distances between pairs of points up to twice the maximum distance considered. The consequence is that choosing a smaller range of distances in argument r results in less smoothed Kd values. The default values (r = NULL, MaxRange = "ThirdW") are such that almost all the pairs of points (except those more than 2/3 of the window diameter apart) are taken into account to determine the bandwith.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Marcon, E. and F. Puech (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Scholl, T. and Brenner, T. (2015) Optimizing distance-based methods for large data sets, Journal of Geographical Systems 17(4): 333-351.

Sheather, S. J. and Jones, M. C. (1991) A reliable data-based bandwidth selection method for kernel density estimation. Journal of the Royal Statistical Society series B, 53, 683-690.

Silverman, B. W. (1986). Density estimation for statistics and data analysis. Chapman and Hall, London.

Examples

data(paracou16)
autoplot(paracou16)

# Calculate Kd
(Paracou <- Kdhat(paracou16, , "Q. Rosea", "V. Americana"))
# Plot
autoplot(Paracou)

Estimation of the K function

Description

Estimates the K function

Usage

Khat(X, r = NULL, ReferenceType = "", NeighborType = ReferenceType, CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

ReferenceType

One of the point types. Default is all point types.

NeighborType

One of the point types. By default, the same as reference type.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

K is a cumulative, topographic measure of a point pattern structure.

Value

An object of class fv, see fv.object, which can be plotted directly using plot.fv.

Note

The computation of Khat relies on spatstat functions Kest and Kcross.

References

Ripley, B. D. (1976). The Foundations of Stochastic Geometry. Annals of Probability 4(6): 995-998.

Ripley, B. D. (1977). Modelling Spatial Patterns. Journal of the Royal Statistical Society B 39(2): 172-212.

Examples

data(paracou16)
autoplot(paracou16)

# Calculate K
r <- 0:30
(Paracou <- Khat(paracou16, r))

# Plot (after normalization by pi.r^2)
autoplot(Paracou, ./(pi*r^2) ~ r)

Estimation of the confidence envelope of the Kinhom function under its null hypothesis

Description

Simulates point patterns according to the null hypothesis and returns the envelope of Kinhom according to the confidence level.

Usage

KinhomEnvelope(X, r = NULL, NumberOfSimulations = 100, Alpha = 0.05,
               ReferenceType = "", lambda = NULL,
               SimulationType = "RandomPosition", Global = FALSE,
               verbose = interactive(), parallel = FALSE,
               parallel_pgb_refresh = 1/10)

Arguments

X

A point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

NumberOfSimulations

The number of simulations to run.

Alpha

The risk level.

ReferenceType

One of the point types. Default is all point types.

lambda

An estimation of the point pattern density, obtained by the density.ppp function.

SimulationType

A string describing the null hypothesis to simulate. The null hypothesis, may be "RandomPosition": points are drawn in an inhomogenous Poisson process (intensity is either lambda or estimated from X); "RandomLocation": points are redistributed across actual locations; "RandomLabeling": randomizes point types, keeping locations unchanged; "PopulationIndependence": keeps reference points unchanged, redistributes others across actual locations.

Global

Logical; if TRUE, a global envelope sensu Duranton and Overman (2005) is calculated.

verbose

Logical; if TRUE, print progress reports during the simulations.

parallel

Logical; if TRUE, simulations can be run in parallel, see details.

parallel_pgb_refresh

The proportion of simulations steps to be displayed by the parallel progress bar. 1 will show all but may slow down the computing, 1/100 only one out of a hundred.

Details

The random location null hypothesis is that of Duranton and Overman (2005). It is appropriate to test the univariate Kinhom function of a single point type, redistributing it over all point locations. It allows fixing lambda along simulations so the warning message can be ignored.

The random labeling hypothesis is appropriate for the bivariate Kinhom function.

The population independence hypothesis is that of Marcon and Puech (2010).

This envelope is local by default, that is to say it is computed separately at each distance. See Loosmore and Ford (2006) for a discussion.

Value

An envelope object (envelope). There are methods for print and plot for this class.

The fv contains the observed value of the function, its average simulated value and the confidence envelope.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Kenkel, N. C. (1988). Pattern of Self-Thinning in Jack Pine: Testing the Random Mortality Hypothesis. Ecology 69(4): 1017-1024.

Loosmore, N. B. and Ford, E. D. (2006). Statistical inference using the G or K point pattern spatial statistics. Ecology 87(8): 1925-1931.

Marcon, E. and Puech, F. (2010). Measures of the Geographic Concentration of Industries: Improving Distance-Based Methods. Journal of Economic Geography 10(5): 745-762.

Marcon, E. and F. Puech (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Examples

data(paracou16)
# Keep only 20% of points to run this example
X <- as.wmppp(rthin(paracou16, 0.2))
autoplot(X,
  labelSize = expression("Basal area (" ~cm^2~ ")"),
  labelColor = "Species")

# Density of all trees
lambda <- density.ppp(X, bw.diggle(X))
plot(lambda)
V.americana <- X[marks(X)$PointType=="V. Americana"]
plot(V.americana, add=TRUE)

# Calculate Kinhom according to the density of all trees
# and confidence envelope (should be 1000 simulations, reduced to 4 to save time)
r <- 0:30
NumberOfSimulations <- 4
Alpha <- .10
autoplot(KinhomEnvelope(X, r,NumberOfSimulations, Alpha, ,
    SimulationType="RandomPosition", lambda=lambda), ./(pi*r^2) ~ r)

Estimation of the inhomogenous K function

Description

Estimates the Kinhom function

Usage

Kinhomhat(X, r = NULL, ReferenceType = "", lambda = NULL, CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

ReferenceType

One of the point types. Default is all point types.

lambda

An estimation of the point pattern density, obtained by the density.ppp function.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

Kinhom is a cumulative, topographic measure of an inhomogenous point pattern structure.

By default, density estimation is performed at points by density.ppp using the optimal bandwith (bw.diggle). It can be calculated separately (see example), including at pixels if the point pattern is too large for the default estimation to succeed, and provided as the argument lambda: Arbia et al. (2012) for example use another point pattern as a reference to estimate density.

Bivariate Kinhom is not currently supported.

Value

An object of class fv, see fv.object, which can be plotted directly using plot.fv.

Note

The computation of Kinhomhat relies on spatstat functions Kinhom, density.ppp and bw.diggle.

References

Baddeley, A. J., J. Moller, et al. (2000). Non- and semi-parametric estimation of interaction in inhomogeneous point patterns. Statistica Neerlandica 54(3): 329-350.

Arbia, G., G. Espa, et al. (2012). Clusters of firms in an inhomogeneous space: The high-tech industries in Milan. Economic Modelling 29(1): 3-11.

Examples

data(paracou16)

# Density of all trees
lambda <- density.ppp(paracou16, bw.diggle(paracou16))
plot(lambda)
# Reduce the point pattern to one type of trees
V.americana <- paracou16[marks(paracou16)$PointType=="V. Americana"]
plot(V.americana, add=TRUE)

# Calculate Kinhom according to the density of all trees
r <- 0:30
autoplot(Kinhomhat(paracou16, r, "V. Americana", lambda), ./(pi*r^2) ~ r)

Estimation of the confidence envelope of the Lmm function under its null hypothesis

Description

Simulates point patterns according to the null hypothesis and returns the envelope of Lmm according to the confidence level.

Usage

KmmEnvelope(X, r = NULL, NumberOfSimulations = 100, Alpha = 0.05,
            ReferenceType = "", Global = FALSE,
            verbose = interactive(), parallel = FALSE, parallel_pgb_refresh = 1/10)

Arguments

X

A point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

NumberOfSimulations

The number of simulations to run, 100 by default.

Alpha

The risk level, 5% by default.

ReferenceType

One of the point types. Others are ignored. Default is all point types.

Global

Logical; if TRUE, a global envelope sensu Duranton and Overman (2005) is calculated.

verbose

Logical; if TRUE, print progress reports during the simulations.

parallel

Logical; if TRUE, simulations can be run in parallel, see details.

parallel_pgb_refresh

The proportion of simulations steps to be displayed by the parallel progress bar. 1 will show all but may slow down the computing, 1/100 only one out of a hundred.

Details

This envelope is local by default, that is to say it is computed separately at each distance. See Loosmore and Ford (2006) for a discussion.

Value

An envelope object (envelope). There are methods for print and plot for this class.

The fv contains the observed value of the function, its average simulated value and the confidence envelope.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Kenkel, N. C. (1988). Pattern of Self-Thinning in Jack Pine: Testing the Random Mortality Hypothesis. Ecology 69(4): 1017-1024.

Loosmore, N. B. and Ford, E. D. (2006). Statistical inference using the G or K point pattern spatial statistics. Ecology 87(8): 1925-1931.

Marcon, E. and F. Puech (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Examples

data(paracou16)
# Keep only 20% of points to run this example
X <- as.wmppp(rthin(paracou16, 0.2))
autoplot(X,
  labelSize = expression("Basal area (" ~cm^2~ ")"),
  labelColor = "Species")

# Calculate confidence envelope (should be 1000 simulations, reduced to 4 to save time)
r <- seq(0, 30, 2)
NumberOfSimulations <- 4
Alpha <- .10
autoplot(KmmEnvelope(X, r, NumberOfSimulations, Alpha), ./(pi*r^2) ~ r)

Estimation of the Kmm function

Description

Estimates of the Kmm function

Usage

Kmmhat(X, r = NULL, ReferenceType = "", CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

ReferenceType

One of the point types. Others are ignored. Default is all point types.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

The Kmm function is used to test the independence of marks.

Value

An object of class fv, see fv.object, which can be plotted directly using plot.fv.

Note

The function is computed using markcorrint in spatstat.

References

Penttinen, A., Stoyan, D. and Henttonen, H. M. (1992). Marked Point Processes in Forest Statistics. Forest Science 38(4): 806-824.

Penttinen, A. (2006). Statistics for Marked Point Patterns. in The Yearbook of the Finnish Statistical Society. The Finnish Statistical Society, Helsinki: 70-91.

Examples

data(paracou16)
# Keep only 50% of points to run this example
X <- as.wmppp(rthin(paracou16, 0.5))
autoplot(X, 
  labelSize = expression("Basal area (" ~cm^2~ ")"), 
  labelColor = "Species")

# Calculate Kmm
r <- seq(0, 30, 2)
(Paracou <- Kmmhat(X, r))

# Plot
autoplot(Paracou, ./(pi*r^2) ~ r)

Test of a point pattern against Complete Spatial Randomness

Description

Tests the point pattern against CSR using values of the K function

Usage

Ktest(X, r)

Arguments

X

A point pattern (ppp.object). Marks are ignored. The window must be a rectangle sensu spatstat (tested by is.rectangle).

r

A vector of distances.

Details

The test returns the risk to reject CSR erroneously, i.e. the p-value of the test, based on the distribution of the K function.

If r includes 0, it will be silently removed because no neighbor point can be found at distance 0. The longer r, the more accurate the test is in theory but at the cost of computation time first, and of computation accuracy then because a matrix of size the length of r must be inverted. 10 values in r seems to be a reasonable choice.

Value

A p-value.

Author(s)

Gabriel Lang <Gabriel.Lang@agroparistech.fr>, Eric Marcon<Eric.Marcon@agroparistech.fr>

References

Lang, G. and Marcon, E. (2013). Testing randomness of spatial point patterns with the Ripley statistic. ESAIM: Probability and Statistics. 17: 767-788.

Marcon, E., S. Traissac, and Lang, G. (2013). A Statistical Test for Ripley's Function Rejection of Poisson Null Hypothesis. ISRN Ecology 2013(Article ID 753475): 9.

Examples

# Simulate a Matern (Neyman Scott) point pattern
nclust <- function(x0, y0, radius, n) {
  return(runifdisc(n, radius, centre=c(x0, y0)))
}
X <- rNeymanScott(20, 0.1, nclust, radius=0.2, n=5)
autoplot(as.wmppp(X))

# Test it
Ktest(X, r=seq(0.1, .5, .1))

Estimation of the confidence envelope of the L function under its null hypothesis

Description

Simulates point patterns according to the null hypothesis and returns the envelope of L according to the confidence level.

Usage

LEnvelope(X, r = NULL, NumberOfSimulations = 100, Alpha = 0.05,
          ReferenceType = "", NeighborType = "",
          SimulationType = "RandomPosition", Precision = 0, Global = FALSE,
          verbose = interactive(), parallel = FALSE, parallel_pgb_refresh = 1/10)

Arguments

X

A point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

NumberOfSimulations

The number of simulations to run, 100 by default.

Alpha

The risk level, 5% by default.

ReferenceType

One of the point types. Default is all point types.

NeighborType

One of the point types. Default is all point types.

SimulationType

Precision

Accuracy of point coordinates, measured as a part of distance unit. See rRandomPositionK. Default is 0 for no approximation.

Global

Logical; if TRUE, a global envelope sensu Duranton and Overman (2005) is calculated.

verbose

Logical; if TRUE, print progress reports during the simulations.

parallel

Logical; if TRUE, simulations can be run in parallel, see details.

parallel_pgb_refresh

The proportion of simulations steps to be displayed by the parallel progress bar. 1 will show all but may slow down the computing, 1/100 only one out of a hundred.

Details

This envelope is local by default, that is to say it is computed separately at each distance. See Loosmore and Ford (2006) for a discussion.

Value

An envelope object (envelope). There are methods for print and plot for this class.

The fv contains the observed value of the function, its average simulated value and the confidence envelope.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Kenkel, N. C. (1988). Pattern of Self-Thinning in Jack Pine: Testing the Random Mortality Hypothesis. Ecology 69(4): 1017-1024.

Loosmore, N. B. and Ford, E. D. (2006). Statistical inference using the G or K point pattern spatial statistics. Ecology 87(8): 1925-1931.

Marcon, E. and F. Puech (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Examples

data(paracou16)
# Keep only 20% of points to run this example
X <- as.wmppp(rthin(paracou16, 0.2))
autoplot(X,
  labelSize = expression("Basal area (" ~cm^2~ ")"),
  labelColor = "Species")

# Calculate confidence envelope (should be 1000 simulations, reduced to 20 to save time)
r <- 0:30
NumberOfSimulations <- 20
# Plot the envelope
autoplot(LEnvelope(X, r, NumberOfSimulations))

Estimation of the L function

Description

Estimates the L function

Usage

Lhat(X, r = NULL, ReferenceType = "", NeighborType = "", CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

ReferenceType

One of the point types. Default is all point types.

NeighborType

One of the point types. Default is all point types.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

L is the normalized version of K: L(r)=\sqrt{\frac{K}{\pi}}-r.

Value

An object of class fv, see fv.object, which can be plotted directly using plot.fv.

Note

L was originally defined as L(r)=\sqrt{\frac{K}{\pi}}. It has been used as L(r)=\sqrt{\frac{K}{\pi}}-r in a part of the literature because this normalization is easier to plot.

References

Besag, J. E. (1977). Comments on Ripley's paper. Journal of the Royal Statistical Society B 39(2): 193-195.

Examples

data(paracou16)
autoplot(paracou16)

# Calculate L
r <- 0:30
(Paracou <- Lhat(paracou16, r))

# Plot
autoplot(Paracou)

Estimation of the confidence envelope of the Lmm function under its null hypothesis

Description

Simulates point patterns according to the null hypothesis and returns the envelope of Lmm according to the confidence level.

Usage

LmmEnvelope(X, r = NULL, NumberOfSimulations = 100, Alpha = 0.05,
            ReferenceType = "", Global = FALSE,
            verbose = interactive(), parallel = FALSE, parallel_pgb_refresh = 1/10)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

NumberOfSimulations

The number of simulations to run, 100 by default.

Alpha

The risk level, 5% by default.

ReferenceType

One of the point types. Others are ignored. Default is all point types.

Global

Logical; if TRUE, a global envelope sensu Duranton and Overman (2005) is calculated.

verbose

Logical; if TRUE, print progress reports during the simulations.

parallel

Logical; if TRUE, simulations can be run in parallel, see details.

parallel_pgb_refresh

The proportion of simulations steps to be displayed by the parallel progress bar. 1 will show all but may slow down the computing, 1/100 only one out of a hundred.

Details

This envelope is local by default, that is to say it is computed separately at each distance. See Loosmore and Ford (2006) for a discussion.

Value

An envelope object (envelope). There are methods for print and plot for this class.

The fv contains the observed value of the function, its average simulated value and the confidence envelope.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Kenkel, N. C. (1988). Pattern of Self-Thinning in Jack Pine: Testing the Random Mortality Hypothesis. Ecology 69(4): 1017-1024.

Loosmore, N. B. and Ford, E. D. (2006). Statistical inference using the G or K point pattern spatial statistics. Ecology 87(8): 1925-1931.

Marcon, E. and F. Puech (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Examples

data(paracou16)
# Keep only 20% of points to run this example
X <- as.wmppp(rthin(paracou16, 0.2))
autoplot(X,
  labelSize = expression("Basal area (" ~cm^2~ ")"),
  labelColor = "Species")

# Calculate confidence envelope (should be 1000 simulations, reduced to 4 to save time)
r <- seq(0, 30, 2)
NumberOfSimulations <- 4
Alpha <- .10
autoplot(LmmEnvelope(X, r, NumberOfSimulations, Alpha))

Estimation of the Lmm function

Description

Estimates the Lmm function

Usage

Lmmhat(X, r = NULL, ReferenceType = "", CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

ReferenceType

One of the point types. Others are ignored. Default is all point types.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

Lmm is the normalized version of Kmm: Lmm(r)=\sqrt{\frac{Kmm}{\pi}}-r.

Value

An object of class fv, see fv.object, which can be plotted directly using plot.fv.

References

Penttinen, A., Stoyan, D. and Henttonen, H. M. (1992). Marked Point Processes in Forest Statistics. Forest Science 38(4): 806-824.

Espa, G., Giuliani, D. and Arbia, G. (2010). Weighting Ripley's K-function to account for the firm dimension in the analysis of spatial concentration. Discussion Papers, 12/2010. Universita di Trento, Trento: 26.

Examples

data(paracou16)
# Keep only 50% of points to run this example
X <- as.wmppp(rthin(paracou16, 0.5))
autoplot(X, 
  labelSize = expression("Basal area (" ~cm^2~ ")"), 
  labelColor = "Species")

# Calculate Lmm
r <- seq(0, 30, 2)
(Paracou <- Lmmhat(X, r))

# Plot
autoplot(Paracou)

Estimation of the confidence envelope of the M function under its null hypothesis

Description

Simulates point patterns according to the null hypothesis and returns the envelope of M according to the confidence level.

Usage

MEnvelope(X, r = NULL, NumberOfSimulations = 100, Alpha = 0.05,
          ReferenceType, NeighborType = ReferenceType,
          CaseControl = FALSE, SimulationType = "RandomLocation", Global = FALSE,
          verbose = interactive(), parallel = FALSE, parallel_pgb_refresh = 1/10)

Arguments

X

A point pattern (wmppp.object) or a Dtable object.

r

A vector of distances. If NULL, a default value is set: 32 unequally spaced values are used up to half the maximum distance between points d_m. The first value is 0, first steps are small (d_m/200) then incresase progressively up to d_m/20.

NumberOfSimulations

The number of simulations to run, 100 by default.

Alpha

The risk level, 5% by default.

ReferenceType

One of the point types.

NeighborType

One of the point types, equal to the reference type by default to caculate univariate M.

CaseControl

Logical; if TRUE, the case-control version of M is computed. ReferenceType points are cases, NeighborType points are controls.

SimulationType

Global

Logical; if TRUE, a global envelope sensu Duranton and Overman (2005) is calculated.

verbose

Logical; if TRUE, print progress reports during the simulations.

parallel

Logical; if TRUE, simulations can be run in parallel, see details.

parallel_pgb_refresh

The proportion of simulations steps to be displayed by the parallel progress bar. 1 will show all but may slow down the computing, 1/100 only one out of a hundred.

Details

This envelope is local by default, that is to say it is computed separately at each distance. See Loosmore and Ford (2006) for a discussion.

Value

An envelope object (envelope). There are methods for print and plot for this class.

The fv contains the observed value of the function, its average simulated value and the confidence envelope.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Kenkel, N. C. (1988). Pattern of Self-Thinning in Jack Pine: Testing the Random Mortality Hypothesis. Ecology 69(4): 1017-1024.

Loosmore, N. B. and Ford, E. D. (2006). Statistical inference using the G or K point pattern spatial statistics. Ecology 87(8): 1925-1931.

Marcon, E. and F. Puech (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Examples

data(paracou16)
# Keep only 50% of points to run this example
X <- as.wmppp(rthin(paracou16, 0.5))
autoplot(X,
  labelSize = expression("Basal area (" ~cm^2~ ")"),
  labelColor = "Species")

# Calculate confidence envelope (should be 1000 simulations, reduced to 4 to save time)
NumberOfSimulations <- 4
Alpha <- .10
autoplot(MEnvelope(X, , NumberOfSimulations, Alpha,
    "V. Americana", "Q. Rosea", FALSE, "RandomLabeling"))

Estimation of the M function

Description

Estimates the M function

Usage

Mhat(X, r = NULL, ReferenceType, NeighborType = ReferenceType,
    CaseControl = FALSE, Individual = FALSE, CheckArguments = TRUE)

Arguments

X

A weighted, marked planar point pattern (wmppp.object) or a Dtable object.

r

A vector of distances. If NULL, a default value is set: 64 unequally spaced values are used up to half the maximum distance between points d_m. The first value is 0, first steps are small (d_m/800) then increase progressively up to d_m/40.

ReferenceType

One of the point types.

NeighborType

One of the point types. By default, the same as reference type.

CaseControl

Logical; if TRUE, the case-control version of M is computed. ReferenceType points are cases, NeighborType points are controls.

Individual

Logical; if TRUE, values of the function around each individual point are returned.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

M is a weighted, cumulative, relative measure of a point pattern structure. Its value at any distance is the ratio of neighbors of the NeighborType to all points around ReferenceType points, normalized by its value over the windows.

If CaseControl is TRUE, then ReferenceType points are cases and NeighborType points are controls. The univariate concentration of cases is calculated as if NeighborType was equal to ReferenceType, but only controls are considered when counting all points around cases (Marcon et al., 2012). This makes sense when the sampling design is such that all points of ReferenceType (the cases) but only a sample of the other points (the controls) are recorded. Then, the whole distribution of points is better represented by the controls alone.

Value

An object of class fv, see fv.object, which can be plotted directly using plot.fv.

If Individual is set to TRUE, the object also contains the value of the function around each individual ReferenceType point taken as the only reference point. The column names of the fv are "M_" followed by the point names, i.e. the row names of the marks of the point pattern.

References

Marcon, E. and Puech, F. (2010). Measures of the Geographic Concentration of Industries: Improving Distance-Based Methods. Journal of Economic Geography 10(5): 745-762.

Marcon, E., F. Puech and S. Traissac (2012). Characterizing the relative spatial structure of point patterns. International Journal of Ecology 2012(Article ID 619281): 11.

Marcon, E., and Puech, F. (2017). A Typology of Distance-Based Measures of Spatial Concentration. Regional Science and Urban Economics 62:56-67

Examples

data(paracou16)
autoplot(paracou16)

# Calculate M
autoplot(Mhat(paracou16, , "V. Americana", "Q. Rosea"))

Spatial smoothing of individual dbmss's

Description

Performs spatial smoothing of the individual values of distance-based measures computed in the neighborhood of each point (Marcon and Puech, 2023).

Usage

  ## S3 method for class 'wmppp'
 Smooth(X, fvind, distance = NULL, Quantiles = FALSE, 
      sigma = bw.scott(X, isotropic = TRUE), Weighted = TRUE, Adjust = 1, 
      Nbx = 128, Nby = 128, ..., CheckArguments = TRUE)

Arguments

X

A point pattern (wmppp.object).

fvind

An object of class fv, see fv.object, obtained a distance-based method, such as Mhat with individual values (argument Individual = TRUE).

distance

The distance at which the function value must be considered. The default value is the median distance used to calculate the function values.

Quantiles

If FALSE (default), the dbmss is smoothed to produce a map of the measure. If TRUE, its quantiles (computed by Mhat with argument Quantiles = TRUE) are smoothed to produce a map of the confidence level of the measure.

Weighted

If TRUE (default), the point weights are taken into account for smoothing.

sigma

The bandwidth used for smoothing. A Gaussian kernel is used (see Smooth.ppp). Its bandwidth is chosen by default according to Scott's rule (see bw.scott).

Adjust

Force the selected bandwidth (sigma) to be multiplied by Adjust. Setting it to values smaller than one (1/2 for example) will sharpen the estimation.

Nbx, Nby

The number of columns and rows (pixels) of the resulting map, 128 by default. Increase it for quality, paid by increasing computing time.

...

Extra arguments, passed to Smooth.ppp.

CheckArguments

If TRUE (default), the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Value

An image that can be plotted. If quantiles have been computed in fvind, attributes "High" and "Low" contain logical vectors to indentify significantly high and low quantiles.

References

Marcon, E. and Puech, F. (2023). Mapping distributions in non-homogeneous space with distance-based methods. Journal of Spatial Econometrics 4(1), 13.

Examples

  ReferenceType <- "V. Americana"
  NeighborType <- "Q. Rosea"
  # Calculate individual intertype M(distance) values
  fvind <- Mhat(paracou16, r=c(0, 30), ReferenceType, NeighborType, Individual=TRUE)
  # Plot the point pattern with values of M(30 meters)
  p16_map <- Smooth(paracou16, fvind, distance=30)
  plot(p16_map, main = "")
  # Add the reference points to the plot
  is.ReferenceType <- marks(paracou16)$PointType == ReferenceType
  points(x=paracou16$x[is.ReferenceType], y=paracou16$y[is.ReferenceType], pch=20)
  # Add contour lines
  contour(p16_map, nlevels = 5, add = TRUE)

Converts data to class Dtable

Description

Creates an object of class "Dtable" representing a set of points with weights and labels and the distances between them.. This is a generic method.

Usage

as.Dtable(X, ...)
  ## S3 method for class 'ppp'
as.Dtable(X, ...)
  ## S3 method for class 'data.frame'
as.Dtable(X, ...)

Arguments

X

Data to be converted into a "Dtable".

...

Extra arguments.

Details

This is a generic method, implemented for ppp and data.frame.

Data is first converted to a (wmppp.object). Then, the distance matrix between points is calculated and the marks are kept.

Value

An object of class "Dtable".

Converts data to class wmppp

Description

Creates a Weighted, Marked, Planar Point Pattern, i.e. an object of class "wmppp" representing a two-dimensional point pattern with weights and labels. This is a generic method.

Usage

as.wmppp(X, ...)
  ## S3 method for class 'ppp'
as.wmppp(X, ...)
  ## S3 method for class 'data.frame'
as.wmppp(X, window = NULL, unitname = NULL, ...)

Arguments

X

Data to be converted into a weighted, marked, planar point pattern (wmppp.object)

window

An object of calls "owin" (owin.object).

unitname

Name of unit of length. Either a single character string, or a vector of two character strings giving the singular and plural forms, respectively.

...

Extra arguments.

Details

This is a generic method, implemented for ppp and data.frame:

If the dataset X is an object of class "ppp" (ppp.object), the marks are converted to point weights if they are numeric or to point types if they are factors. Default weights are set to 1, default types to "All". If marks are a dataframe with column names equal to PointType and PointWeight, they are not modified. Row names of the dataframe are preserved as row names of the marks, to identify points.
If the dataset X is a dataframe, see wmppp.

Value

An object of class "wmppp".

ggplot methods to plot dbmss objects

Description

S3 methods for the autoplot generic.

Usage

  ## S3 method for class 'envelope'
autoplot(object, fmla, ..., ObsColor = "black",
        H0Color = "red", ShadeColor = "grey75", alpha = 0.3, main = NULL,
        xlab = NULL, ylab = NULL, LegendLabels = NULL)
  ## S3 method for class 'fv'
autoplot(object, fmla, ..., ObsColor = "black",
        H0Color = "red", ShadeColor = "grey75", alpha = 0.3, main = NULL,
        xlab = NULL, ylab = NULL, LegendLabels = NULL)
  ## S3 method for class 'wmppp'
autoplot(object, ..., show.window = TRUE,
        MaxPointTypes = 6, Other = "Other",
        main = NULL, xlab = NULL, ylab = NULL, LegendLabels = NULL,
        labelSize = "Weight", labelColor = "Type", palette="Set1",
        windowColor = "black", windowFill = "transparent", alpha = 1)

Arguments

object

An object to be plotted.

fmla

An R language formula determining which variables or expressions are plotted. Either a formula object, or a string that can be parsed as a formula. See plot.fv.

...

Extra arguments, currently unused.

ObsColor

The color of the line representing observed values of the function.

H0Color

The color of the line representing the null hypothesis values of the function.

ShadeColor

The color of the confidence envelope.

alpha

The opacity of the confidence envelope (in function values) or the points (in maps), between 0 and 1.

main

The title of the plot.

xlab, ylab

The axes labels.

LegendLabels

A vector of characters. The first two items describe the observed and null-hypothesis curves, the third and last item the confidence interval. To be used only in plots with two curves (typically observed and expected values). The default is 'NULL' to display the full description of functions.

show.window

if 'TRUE', the borders of the window containing the points are shown on the point map.

MaxPointTypes

The maximum number of different point types to show. If the point set contains more of them, the less frequent ones are gathered as "Other". This number must be limited for readability and not to exceed the number of colors offered by the palette.

Other

The name of the point types gathered as "Other".

labelSize

The guide of the point size legend in point maps, i.e. what the 'PointSize' mark represents.

labelColor

The guide of the point color legend in point maps, i.e. what the 'PointType' mark represents.

palette

The color palette used to display point types in maps. See scale_color_brewer

windowColor

The color used to draw the limits of the windows in point maps.

windowFill

The color used to fill the windows in point maps.

Details

Plots of 'wmppp' objects are a single representation of both point types and point weights. Rectangular and polygonal windows (see owin.object) are supported but mask windows are ignored (use the 'plot' method if necessary).

Value

A ggplot object.

Author(s)

Eric Marcon <Eric.Marcon@agroparistech.fr>, parts of the code from spatstat.explore::plot.fv.

Examples

data(paracou16)
# Keep only 20% of points to run this example
X <- as.wmppp(rthin(paracou16, 0.2))
autoplot(X)

# Plot the envelope (should be 1000 simulations, reduced to 20 to save time)
autoplot(KdEnvelope(X, ReferenceType="Q. Rosea", NumberOfSimulations=20))

# With a formula and a compact legend
autoplot(KEnvelope(X, NumberOfSimulations=20),
    ./(pi*r^2) ~ r,
    LegendLabels=c("Observed", "Expected", "Confidence\n enveloppe"))

Class of envelope of function values (fv)

Description

A class "dbmssEnvelope", i.e. a particular type of see envelope to represent several estimates of the same function and its confidence envelope.

Details

"dbmssEnvelope" objects are similar to envelope objects. The differences are that the risk level is chosen (instead of the simulation rank to use as the envelope), so the rank is calculated (interpolation is used if necessary), and a global envelope can be calculated following Duranton and Overman (2005).

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106

Computes simulation envelopes of a summary function.

Description

Prints a useful summary of a confidence envelope of class "dbmssEnvelope"

Usage

## S3 method for class 'Dtable'
envelope(Y, fun = Kest, nsim = 99, nrank = 1, ..., 
          funargs = list(), funYargs = funargs, simulate = NULL, 
          verbose = TRUE, savefuns = FALSE, Yname = NULL, envir.simul = NULL)

Arguments

Y

An object of class Dtable.

fun

Function that computes the desired summary statistic for Y.

nsim

Number of simulated point patterns to be generated when computing the envelopes.

nrank

Integer. Rank of the envelope value amongst the nsim simulated values. A rank of 1 means that the minimum and maximum simulated values will be used.

...

Extra arguments passed to fun.

funargs

A list, containing extra arguments to be passed to fun.

funYargs

Optional. A list, containing extra arguments to be passed to fun when applied to the original data Y only.

simulate

Optional. Specifies how to generate the simulated point patterns.

verbose

Logical flag indicating whether to print progress reports during the simulations.

savefuns

Logical flag indicating whether to save all the simulated function values.

Yname

Character string that should be used as the name of the data Y when printing or plotting the results.

envir.simul

Environment in which to evaluate the expression simulate, if not the current environment.

Details

This is the S3 method envelope for Dtable objects.

Author(s)

Eric Marcon <Eric.Marcon@agroparistech.fr>. Relies on the envelope engine of spatstat.

Estimation of the confidence envelope of the g function under its null hypothesis

Description

Simulates point patterns according to the null hypothesis and returns the envelope of g according to the confidence level.

Usage

gEnvelope(X, r = NULL, NumberOfSimulations = 100, Alpha = 0.05,
         ReferenceType = "", NeighborType = "",
         SimulationType = "RandomPosition", Precision = 0, Global = FALSE,
         verbose = interactive(), parallel = FALSE, parallel_pgb_refresh = 1/10)

Arguments

X

A point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

NumberOfSimulations

The number of simulations to run, 100 by default.

Alpha

The risk level, 5% by default.

ReferenceType

One of the point types. Default is all point types.

NeighborType

One of the point types. Default is all point types.

SimulationType

Precision

Accuracy of point coordinates, measured as a part of distance unit. See rRandomPositionK. Default is 0 for no approximation.

Global

Logical; if TRUE, a global envelope sensu Duranton and Overman (2005) is calculated.

verbose

Logical; if TRUE, print progress reports during the simulations.

parallel

Logical; if TRUE, simulations can be run in parallel, see details.

parallel_pgb_refresh

The proportion of simulations steps to be displayed by the parallel progress bar. 1 will show all but may slow down the computing, 1/100 only one out of a hundred.

Details

This envelope is local by default, that is to say it is computed separately at each distance. See Loosmore and Ford (2006) for a discussion.

Value

An envelope object (envelope). There are methods for print and plot for this class.

The fv contains the observed value of the function, its average simulated value and the confidence envelope.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Kenkel, N. C. (1988). Pattern of Self-Thinning in Jack Pine: Testing the Random Mortality Hypothesis. Ecology 69(4): 1017-1024.

Loosmore, N. B. and Ford, E. D. (2006). Statistical inference using the G or K point pattern spatial statistics. Ecology 87(8): 1925-1931.

Marcon, E. and F. Puech (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Examples

data(paracou16)
# Keep only 20% of points to run this example
X <- as.wmppp(rthin(paracou16, 0.2))
autoplot(X,
  labelSize = expression("Basal area (" ~cm^2~ ")"),
  labelColor = "Species")

# Calculate confidence envelope (should be 1000 simulations, reduced to 10 to save time)
r <- 0:40
NumberOfSimulations <- 10
# Plot the envelope
autoplot(gEnvelope(X, r, NumberOfSimulations))

Estimation of the g function

Description

Estimates the g function

Usage

ghat(X, r = NULL, ReferenceType = "", NeighborType = "", CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

r

A vector of distances. If NULL, a sensible default value is chosen (512 intervals, from 0 to half the diameter of the window) following spatstat.

ReferenceType

One of the point types. Default is all point types.

NeighborType

One of the point types. Default is all point types.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

The computation of ghat relies on spatstat function sewpcf. The kernel estimation of the number of neighbors follows Stoyan and Stoyan (1994, pages 284–285).

Value

An object of class fv, see fv.object, which can be plotted directly using plot.fv.

References

Stoyan, D. and Stoyan, H. (1994) Fractals, random shapes and point fields: methods of geometrical statistics. John Wiley and Sons.

Examples

data(paracou16)
autoplot(paracou16)

# Calculate g
r <- 0:30
(Paracou <- ghat(paracou16, r, "Q. Rosea", "V. Americana"))

# Plot
autoplot(Paracou)

Test whether an object is a weighted, marked, planar point pattern

Description

Check whether its argument is an object of class "wmppp" (wmppp.object).

Usage

is.wmppp(X)

Arguments

X

Any object

Value

TRUE if X is a weighted, marked, planar point pattern, otherwise FALSE.

Estimation of the confidence envelope of the m function under its null hypothesis

Description

Simulates point patterns according to the null hypothesis and returns the envelope of m according to the confidence level.

Usage

mEnvelope(X, r = NULL, NumberOfSimulations = 100, Alpha = 0.05,
          ReferenceType, NeighborType = ReferenceType, CaseControl = FALSE,
          Original = TRUE, Approximate = ifelse(X$n < 10000, 0, 1), Adjust = 1,
          MaxRange = "ThirdW", SimulationType = "RandomLocation", Global = FALSE,
          verbose = interactive(), parallel = FALSE, parallel_pgb_refresh = 1/10)

Arguments

X

A point pattern (wmppp.object).

r

A vector of distances. If NULL, a default value is set: 512 equally spaced values are used up to the median distance between points (following Duranton and Overman, 2005).

NumberOfSimulations

The number of simulations to run, 100 by default.

Alpha

The risk level, 5% by default.

ReferenceType

One of the point types.

NeighborType

One of the point types, equal to the reference type by default to caculate univariate M.

CaseControl

Logical; if TRUE, the case-control version of M is computed. ReferenceType points are cases, NeighborType points are controls.

Original

Approximate

Adjust

Force the automatically selected bandwidth (following Original) to be multiplied by Adjust. Setting it to values lower than one (1/2 for example) will sharpen the estimation.

MaxRange

SimulationType

Global

Logical; if TRUE, a global envelope sensu Duranton and Overman (2005) is calculated.

verbose

Logical; if TRUE, print progress reports during the simulations.

parallel

Logical; if TRUE, simulations can be run in parallel, see details.

parallel_pgb_refresh

The proportion of simulations steps to be displayed by the parallel progress bar. 1 will show all but may slow down the computing, 1/100 only one out of a hundred.

Details

This envelope is local by default, that is to say it is computed separately at each distance. See Loosmore and Ford (2006) for a discussion.

Value

An envelope object (envelope). There are methods for print and plot for this class.

The fv contains the observed value of the function, its average simulated value and the confidence envelope.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Kenkel, N. C. (1988). Pattern of Self-Thinning in Jack Pine: Testing the Random Mortality Hypothesis. Ecology 69(4): 1017-1024.

Lang G., Marcon E. and Puech F. (2014) Distance-Based Measures of Spatial Concentration: Introducing a Relative Density Function. HAL 01082178, 1-18.

Loosmore, N. B. and Ford, E. D. (2006). Statistical inference using the G or K point pattern spatial statistics. Ecology 87(8): 1925-1931.

Marcon, E. and F. Puech (2017). A typology of distance-based measures of spatial concentration. Regional Science and Urban Economics. 62:56-67.

Scholl, T. and Brenner, T. (2015) Optimizing distance-based methods for large data sets, Journal of Geographical Systems 17(4): 333-351.

Silverman, B. W. (1986). Density estimation for statistics and data analysis. Chapman and Hall, London.

Examples

data(paracou16)
# Keep only 50% of points to run this example
X <- as.wmppp(rthin(paracou16, 0.5))
autoplot(X,
  labelSize = expression("Basal area (" ~cm^2~ ")"),
  labelColor = "Species")

# Calculate confidence envelope (should be 1000 simulations, reduced to 4 to save time)
NumberOfSimulations <- 4
Alpha <- .10
autoplot(mEnvelope(X, , NumberOfSimulations, Alpha,
    "V. Americana", "Q. Rosea", Original = FALSE, SimulationType = "RandomLabeling"))

marks method for Dtable and wmppp objects

Description

S3 methods for the marks generic.

Usage

  ## S3 method for class 'Dtable'
marks(x, ...)
  ## S3 replacement method for class 'Dtable'
marks(x, ...) <- value
  ## S3 replacement method for class 'wmppp'
marks(x, ..., dfok = TRUE, drop = TRUE) <- value

Arguments

x

A Dtable or wmppp.object object.

...

Extra arguments, currently unused.

value

The value to set.

dfok

Ignored.

drop

Ignored.

Details

These functions extract or modify the marks of a Dtable.

'marks<-.wmppp()' just calls 'marks<-.ppp()' and keeps the class of the wmppp object. The conformity of the marks with the definition of the class "wmppp", i.e. a dataframe with columns "PointType" and "PointWeight" of the same length as the number of points, is not checked.

Value

A dataframe with columns "PointType" and "PointWeight".

Author(s)

Eric Marcon <Eric.Marcon@agroparistech.fr>

Examples

# A Dtable containing two points
Dmatrix <- matrix(c(0,1,1,0), nrow=2)
PointType <- c("Type1", "Type2")
PointWeight <- c(2,3)
X <- Dtable(Dmatrix, PointType, PointWeight)
# Extract the marks
marks(X)

Estimation of the m function

Description

Estimates the m function

Usage

mhat(X, r = NULL, ReferenceType, NeighborType = ReferenceType,
    CaseControl = FALSE, Original = TRUE, Approximate = ifelse(X$n < 10000, 0, 1),
    Adjust = 1, MaxRange = "ThirdW", Individual = FALSE, CheckArguments = TRUE)

Arguments

X

A weighted, marked planar point pattern (wmppp.object) or a Dtable object.

r

A vector of distances. If NULL, a default value is set: 512 equally spaced values are used, from the smallest distance to the range defined by MaxRange. the between points to half the diameter of the window.

ReferenceType

One of the point types.

NeighborType

One of the point types. By default, the same as reference type.

CaseControl

Logical; if TRUE, the case-control version of M is computed. ReferenceType points are cases, NeighborType points are controls.

Original

Approximate

Adjust

Force the automatically selected bandwidth (following Original) to be multiplied by Adjust. Setting it to values lower than one (1/2 for example) will sharpen the estimation.

MaxRange

The maximum value of r to consider, ignored if r is not NULL. Default is "ThirdW", one third of the diameter of the window. Other choices are "HalfW", and "QuarterW" and "D02005". "HalfW", and "QuarterW" are for half or the quarter of the diameter of the window. "D02005" is for the median distance observed between points, following Duranton and Overman (2005). "ThirdW" should be close to "DO2005" but has the advantage to be independent of the point types chosen as ReferenceType and NeighborType, to simplify comparisons between different types. "D02005" is approximated by "ThirdW" if Approximate is not 0. If X is a Dtable object, the diameter of the window is taken as the max distance between points.

Individual

Logical; if TRUE, values of the function around each individual point are returned.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

m is a weighted, density, relative measure of a point pattern structure (Lang et al., 2014). Its value at any distance is the ratio of neighbors of the NeighborType to all points around ReferenceType points, normalized by its value over the windows.

The number of neighbors at each distance is estimated by a Gaussian kernel whose bandwith is chosen optimally according to Silverman (1986: eq 3.31). It can be sharpened or smoothed by multiplying it by Adjust. The bandwidth of Sheather and Jones (1991) would be better but it is very slow to calculate for large point patterns and it sometimes fails. It is often sharper than that of Silverman.

Value

An object of class fv, see fv.object, which can be plotted directly using plot.fv.

If Individual is set to TRUE, the object also contains the value of the function around each individual ReferenceType point taken as the only reference point. The column names of the fv are "m_" followed by the point names, i.e. the row names of the marks of the point pattern.

Note

Estimating m relies on calculating distances, exactly or approximately (if Approximate is not 0). Then distances are smoothed by estimating their probability density. In contrast with Kdhat, reflection is not used to estimate density close to the lowest distance. The same kernel estimation is applied to the distances from reference points of neighbor points and of all points. Since m is a relative function, a ratio of densities is calculated, that makes the features of the estimation vanish.

Density estimation heavily relies on the bandwith. Starting from version 2.7, the optimal bandwith is computed from the distribution of distances between pairs of points up to twice the maximum distance considered. The consequence is that choosing a smaller range of distances in argument r results in less smoothed m values. The default values (r = NULL, MaxRange = "ThirdW") are such that almost all the pairs of points (except those more than 2/3 of the window diameter apart) are taken into account to determine the bandwith.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Lang G., Marcon E. and Puech F. (2014) Distance-Based Measures of Spatial Concentration: Introducing a Relative Density Function. HAL 01082178, 1-18.

Marcon, E., F. Puech and S. Traissac (2012). Characterizing the relative spatial structure of point patterns. International Journal of Ecology 2012(Article ID 619281): 11.

Scholl, T. and Brenner, T. (2015) Optimizing distance-based methods for large data sets, Journal of Geographical Systems 17(4): 333-351.

Sheather, S. J. and Jones, M. C. (1991) A reliable data-based bandwidth selection method for kernel density estimation. Journal of the Royal Statistical Society series B, 53, 683-690.

Silverman, B. W. (1986). Density estimation for statistics and data analysis. Chapman and Hall, London.

Examples

data(paracou16)
autoplot(paracou16)

# Calculate M
autoplot(mhat(paracou16, , "V. Americana", "Q. Rosea"))

Paracou field station plot 16, partial map

Description

This point pattern is from Paracou field station, French Guiana, managed by Cirad.

Usage

data(paracou16)

Format

An object of class ppp.object representing the point pattern of tree locations in a 250 x 300 meter sampling region. Each tree is marked with its species ("Q. Rosea", "V. Americana" or "Other"), and basal area (square centimeters).

Source

Permanent data census of Paracou and Marcon et al. (2012).

References

Gourlet-Fleury, S., Guehl, J. M. and Laroussinie, O., Eds. (2004). Ecology & management of a neotropical rainforest. Lessons drawn from Paracou, a long-term experimental research site in French Guiana. Paris, Elsevier.

Marcon, E., F. Puech and S. Traissac (2012). Characterizing the relative spatial structure of point patterns. International Journal of Ecology 2012(Article ID 619281): 11.

Examples

data(paracou16)
# Plot (second column of marks is Point Types) 
autoplot(paracou16, which.marks=2, leg.side="right")

Auxiliary functions to count point neighbors

Description

C++ routines used for fast count of neighbors.

Usage

parallelCountNbd(r, x, y, Weight, IsReferenceType, IsNeighborType)
parallelCountNbdCC(r, x, y, Weight, IsReferenceType, IsNeighborType)
parallelCountNbdm(x, y, ReferencePoints)
parallelCountNbdDt(r, Dmatrix, Weight, IsReferenceType, IsNeighborType)
parallelCountNbdDtCC(r, Dmatrix, Weight, IsReferenceType, IsNeighborType)
DistKd(x, y, PointWeight, Weight, Dist, IsReferenceType, IsNeighborType)
CountNbdKd(r, x, y, Weight, Nbd, IsReferenceType, IsNeighborType)

Arguments

r

A vector of distances.

x

A vector of x coordinates.

y

A vector of y coordinates.

Dmatrix

A matrix of distances.

PointWeight

A vector of point weights.

Weight

A vector of point weights.

ReferencePoints

A vector of pont indices.

Nbd

A matrix to store the weight of neighbors.

Dist

A vector of point-pair distances.

IsReferenceType

A boolean vector to recognize reference points.

IsNeighborType

A boolean vector to recognize neighbor points.

Details

These routines are called internally.

Print a confidence envelope

Description

Prints useful information of a confidence envelope of class "dbmssEnvelope"

Usage

## S3 method for class 'dbmssEnvelope'
print(x, ...)

Arguments

x

An object of class "dbmssEnvelope".

...

Ignored.

Details

"dbmssEnvelope" objects are similar to envelope objects. The way they are printed is different to take into account the possibility of building global envelope following Duranton and Overman (2005): the global envelope is calculated by iteration: the simulations reaching one of the upper or lower values at any distance are eliminated at each step. The process is repeated until Alpha / Number of simulations simulations are dropped. The remaining upper and lower bounds at all distances constitute the global envelope. Interpolation is used if the exact ratio cannot be reached.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106

Examples

data(paracou16)
autoplot(paracou16)

# Calculate intertype K envelope
Envelope <- KEnvelope(paracou16, NumberOfSimulations = 20, Global = TRUE,
  ReferenceType = "V. Americana", NeighborType = "Q. Rosea")
autoplot(Envelope)
# print
print(Envelope)

Simulations of a point pattern according to the null hypothesis of population independence defined for K

Description

Simulates of a point pattern according to the null hypothesis of population independence defined for K.

Usage

rPopulationIndependenceK(X, ReferenceType, NeighborType, CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

ReferenceType

One of the point types.

NeighborType

One of the point types.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

Reference points are kept unchanged, neighbor type point positions are shifted by rshift. Other points are lost and point weights are not kept (they are set to 1) since the K function ignores them.

Value

A new weighted, marked, planar point pattern (an object of class wmppp, see wmppp.object).

References

Goreaud, F. et Pelissier, R. (2003). Avoiding misinterpretation of biotic interactions with the intertype K12 fonction: population independence vs random labelling hypotheses. Journal of Vegetation Science 14(5): 681-692.

Examples

# Simulate a point pattern with three types
X <- rpoispp(50) 
PointType   <- sample(c("A", "B", "C"), X$n, replace=TRUE)
PointWeight <- runif(X$n, min=1, max=10)
marks(X) <- data.frame(PointType, PointWeight)
X <- as.wmppp(X)

# Plot the point pattern, using PointType as marks
autoplot(X, main="Original pattern")

# Randomize it
Y <- rPopulationIndependenceK(X, "A", "B")
# Points of type "A" are unchanged, points of type "B" have been moved altogether
# Other points are lost and point weights are set to 1
autoplot(Y, main="Randomized pattern")

Simulations of a point pattern according to the null hypothesis of population independence defined for M

Description

Simulates of a point pattern according to the null hypothesis of population independence defined for M

Usage

rPopulationIndependenceM(X, ReferenceType, CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

ReferenceType

One of the point types.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

Reference points are kept unchanged, other points are redistributed randomly across locations.

Value

A new weighted, marked, planar point pattern (an object of class wmppp, see wmppp.object).

References

Marcon, E. and Puech, F. (2010). Measures of the Geographic Concentration of Industries: Improving Distance-Based Methods. Journal of Economic Geography 10(5): 745-762.

Marcon, E., F. Puech and S. Traissac (2012). Characterizing the relative spatial structure of point patterns. International Journal of Ecology 2012(Article ID 619281): 11.

Examples

# Simulate a point pattern with five types
X <- rpoispp(50) 
PointType   <- sample(c("A", "B", "C", "D", "E"), X$n, replace=TRUE)
PointWeight <- runif(X$n, min=1, max=10)
marks(X) <- data.frame(PointType, PointWeight)
X <- as.wmppp(X)


autoplot(X, main="Original pattern")

# Randomize it
Y <- rPopulationIndependenceM(X, "A")
# Points of type "A" are unchanged, 
# all other points have been redistributed randomly across locations
autoplot(Y, main="Randomized pattern")

Simulations of a point pattern according to the null hypothesis of random labeling

Description

Simulates of a point pattern according to the null hypothesis of random labeling.

Usage

rRandomLabeling(X, CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

Marks are redistributed randomly across the original point pattern.

Value

A new weighted, marked, planar point pattern (an object of class wmppp, see wmppp.object).

References

Examples

# Simulate a point pattern with five types
X <- rpoispp(50) 
PointType   <- sample(c("A", "B", "C", "D", "E"), X$n, replace=TRUE)
PointWeight <- runif(X$n, min=1, max=10)
marks(X) <- data.frame(PointType, PointWeight)
X <- as.wmppp(X)

autoplot(X, main="Original pattern")

# Randomize it
Y <- rRandomLabeling(X)
# Types and weights have been redistributed randomly across locations
autoplot(Y, main="Randomized pattern")

Simulations of a point pattern according to the null hypothesis of random labelling defined for M

Description

Simulates of a point pattern according to the null hypothesis of random labelling defined for M

Usage

rRandomLabelingM(X, CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object) or a Dtable object.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

Point types are randomized. Locations and weights are kept unchanged. If both types and weights must be randomized together (Duranton and Overman, 2005; Marcon and Puech, 2010), use rRandomLocation.

Value

A new weighted, marked, planar point pattern (an object of class wmppp, see wmppp.object).

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Marcon, E. and Puech, F. (2010). Measures of the Geographic Concentration of Industries: Improving Distance-Based Methods. Journal of Economic Geography 10(5): 745-762.

Marcon, E., F. Puech and S. Traissac (2012). Characterizing the relative spatial structure of point patterns. International Journal of Ecology 2012(Article ID 619281): 11.

Examples

# Simulate a point pattern with five types
X <- rpoispp(50) 
PointType   <- sample(c("A", "B", "C", "D", "E"), X$n, replace=TRUE)
PointWeight <- runif(X$n, min=1, max=10)
marks(X) <- data.frame(PointType, PointWeight)
X <- as.wmppp(X)

autoplot(X, main="Original pattern")

# Randomize it
Y <- rRandomLabelingM(X)
# Labels have been redistributed randomly across locations
# But weights are unchanged
autoplot(Y, main="Randomized pattern")

Simulations of a point pattern according to the null hypothesis of random location

Description

Simulates of a point pattern according to the null hypothesis of random location.

Usage

rRandomLocation(X, ReferenceType = "", CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

ReferenceType

One of the point types.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

Points are redistributed randomly across the locations of the original point pattern. This randomization is equivalent to random labeling, considering the label is both point type and point weight. If ReferenceType is specified, then only reference type points are kept in the orginal point pattern before randomization.

Value

A new weighted, marked, planar point pattern (an object of class wmppp, see wmppp.object).

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106.

Marcon, E. and Puech, F. (2010). Measures of the Geographic Concentration of Industries: Improving Distance-Based Methods. Journal of Economic Geography 10(5): 745-762.

Examples

# Simulate a point pattern with five types
X <- rpoispp(50) 
PointType   <- sample(c("A", "B", "C", "D", "E"), X$n, replace=TRUE)
PointWeight <- runif(X$n, min=1, max=10)
marks(X) <- data.frame(PointType, PointWeight)
X <- as.wmppp(X)

autoplot(X, main="Original pattern")

# Randomize it
Y <- rRandomLocation(X)
# Points have been redistributed randomly across locations
autoplot(Y, main="Randomized pattern")

Simulations of a point pattern according to the null hypothesis of random position defined for K

Description

Simulations of a point pattern according to the null hypothesis of random position defined for K.

Usage

rRandomPositionK(X, Precision = 0, CheckArguments = TRUE)

Arguments

X

A weighted, marked, planar point pattern (wmppp.object).

Precision

Accuracy of point coordinates, measured as a part of distance unit. See notes. Default is 0 for no approximation.

CheckArguments

Logical; if TRUE, the function arguments are verified. Should be set to FALSE to save time in simulations for example, when the arguments have been checked elsewhere.

Details

Points marks are kept unchanged and their position is drawn in a binomial process by runifpoint.

Value

A new weighted, marked, planar point pattern (an object of class wmppp, see wmppp.object).

Note

Simulations in a binomial process keeps the same number of points, so that marks can be redistributed. If a real CSR simulation is needed and marks are useless, use rpoispp.

Actual data coordinates are often rounded. Use the Precision argument to simulate point patterns with the same rounding procedure. For example, if point coordinates are in meters and rounded to the nearest half meter, use Precision = 0.5 so that the same approximation is applied to the simulated point patterns.

Examples

# Simulate a point pattern with two types
X <- rpoispp(5) 
PointType   <- sample(c("A", "B"), X$n, replace=TRUE)
PointWeight <- runif(X$n, min=1, max=10)
marks(X) <- data.frame(PointType, PointWeight)
X <- as.wmppp(X)

autoplot(X, main="Original pattern")

# Randomize it
Y <- rRandomPositionK(X)
# Points are randomly distributed
autoplot(Y, main="Randomized pattern")

Objects exported from other packages

Description

These objects are imported from other packages. Follow the links below to see their documentation.

ggplot2: autoplot

spatstat.geom: marks

Methods for weighted, marked planar point patterns (of class wmppp) from spatstat

Description

spatstat methods for a ppp.object applied to a wmppp.object.

Usage

## S3 method for class 'wmppp'
sharpen(X, ...)
## S3 method for class 'wmppp'
superimpose(...)
## S3 method for class 'wmppp'
unique(x, ...)
## S3 method for class 'wmppp'
i[j, drop=FALSE, ..., clip=FALSE]

Arguments

X, x

A two-dimensional point pattern. An object of class "wmppp".

...

Arguments passed to the ppp.object method.

i

Subset index. Either a valid subset index in the usual R sense, indicating which points should be retained, or a window (an object of class "owin") delineating a subset of the original observation window, or a pixel image with logical values defining a subset of the original observation window.

j

Redundant. Included for backward compatibility.

drop

Logical value indicating whether to remove unused levels of the marks, if the marks are a factor.

clip

Logical value indicating how to form the window of the resulting point pattern, when i is a window. If clip=FALSE (the default), the result has window equal to i. If clip=TRUE, the resulting window is the intersection between the window of x and the window i.

Details

spatstat methods for ppp objects returning a ppp object can be applied to a wmppp and return a wpppp with these methods which just call the ppp.object method and change the class of the result for convenience.

Some spatstat functions such as rthin are not generic so they always return a ppp.object when applied to a wmppp.object. Their result may be converted by as.wmppp.

Value

An object of class "wmppp".

Summary of a confidence envelope

Description

Prints a useful summary of a confidence envelope of class "dbmssEnvelope"

Usage

## S3 method for class 'dbmssEnvelope'
summary(object, ...)

Arguments

object

An object of class "dbmssEnvelope".

...

Ignored.

Details

"dbmssEnvelope" objects are similar to envelope objects. Their summary is different to take into account the possibility of building global envelope following Duranton and Overman (2005): the global envelope is calculated by iteration: the simulations reaching one of the upper or lower values at any distance are eliminated at each step. The process is repeated until Alpha / Number of simulations simulations are dropped. The remaining upper and lower bounds at all distances constitute the global envelope. Interpolation is used if the exact ratio cannot be reached.

References

Duranton, G. and Overman, H. G. (2005). Testing for Localisation Using Micro-Geographic Data. Review of Economic Studies 72(4): 1077-1106

Examples

data(paracou16)
autoplot(paracou16)

# Calculate intertype K envelope
Envelope <- KEnvelope(paracou16, NumberOfSimulations = 20, Global = TRUE,
  ReferenceType = "V. Americana", NeighborType = "Q. Rosea")
autoplot(Envelope)
summary(Envelope)

Create a Weighted, Marked, Planar Point Pattern

Description

Creates an object of class "wmppp" representing a two-dimensional point pattern with weights and labels.

Usage

wmppp(df, window = NULL, unitname = NULL)

Arguments

df

A dataframe with at least two columns containing point coordinates.

window

An object of calls "owin" (owin.object).

unitname

Name of unit of length. Either a single character string, or a vector of two character strings giving the singular and plural forms, respectively. Ignored if window is not NULL.

Details

Columns named "X", "Y", "PointType", "PointWeight" (capitalization is ignored) are searched to build the "wmppp" object and set the point coordinates, type and weight. If they are not found, columns are used in this order. If columns are missing, PointType is set to "All" and PointWeight to 1. If a "PointName" column is found, it is used to set the row names of the marks, else the original row names are used.

If the window is not specified, a rectangle containing all points is used, and unitname is used.

Value

An object of class "wmppp".

Examples

# Draw the coordinates of 10 points
X <- runif(10)
Y <- runif(10)
# Draw the point types.
PointType   <- sample(c("A", "B"), 10, replace=TRUE)
# Plot the point pattern. Weights are set to 1 ant the window is adjusted.
plot(wmppp(data.frame(X, Y, PointType)), , which.marks=2)

Class of Weighted, Marked, Planar Point Patterns

Description

A class "wmppp" to represent a two-dimensional point pattern of class ppp whose marks are a dataframe with two columns:

PointType: labels, as factors
PointWeight: weights.

Details

This class represents a two-dimensional point pattern dataset. wmppp objects are also of class ppp.

Objects of class wmppp may be created by the function wmppp and converted from other types of data by the function as.wmppp.

Examples

# Draw the coordinates of 10 points
X <- runif(10)
Y <- runif(10)
# Draw the point types and weights
PointType   <- sample(c("A", "B"), 10, replace=TRUE)
PointWeight <- runif(10)
# Build the point pattern
X <- wmppp(data.frame(X, Y, PointType, PointWeight), owin())

# Plot the point pattern. which.marks=1 for point weights, 2 for point types
par(mfrow=c(1,2))
plot(X, which.marks=1, main="Point weights")
plot(X, which.marks=2, main="Point types")

# Or use autoplot for a ggplot
autoplot(X)

Distance Based Measures of Spatial Structures

Description

Author(s)

References

Checks the arguments of a function of the package dbmss

Description

Usage

Details

Value

Estimation of the confidence envelope of the D function under its null hypothesis

Description

Usage

Arguments

Details

Value

References

See Also

Examples

Estimation of the D function

Description

Usage

Arguments

Details

Value

Note

References

See Also

Examples

Create a Distance table object.

Description

Usage

Arguments

Details

Value

See Also

Examples

Transform simulation values to an fv

Description

Usage

Arguments

Value

Estimation of the global confidence interval of simulations

Description

Usage

Arguments

Details

Value

References

Goodness of Fit test between a distance based measure of spatial structure and simulations of its null hypothesis

Description

Usage

Arguments

Details

Value

Note

References

See Also

Examples

Estimation of the confidence envelope of the K function under its null hypothesis

Description

Usage

Arguments

Details

Value

References

See Also

Examples

Estimation of the confidence envelope of the Kd function under its null hypothesis

Description

Usage

Arguments

Details

Value

References

See Also

Examples

Estimation of the Kd function

Description

Usage

Arguments