Review
Gaussian Markov Random Fields (GMRFs) and copulas are two powerful statistical tools, each offering unique strengths in modeling complex data structures. GMRFs excel in capturing spatial and temporal dependencies, particularly in fields such as environmental science, epidemiology, and image analysis (Havard Rue and Held 2005; Knorr-Held 2000; Håvard Rue, Martino, and Chopin 2009). Their ability to represent local dependencies through sparse precision matrices makes them computationally attractive for high-dimensional problems. Copulas, on the other hand, provide a flexible framework for modeling multivariate dependencies, allowing separate specification of marginal distributions and their joint behavior (Sklar 1959; Joe 1997; Nelsen 2006).
The Gaussian copula, in particular, has gained popularity due to its interpretability and connection to the multivariate normal distribution. However, combining GMRFs with copulas has historically been computationally challenging, limiting their joint application to smaller datasets or simpler models.
Let \(\mathbf{X} = (X_1, X_2, \ldots, X_n)\) be a multivariate random vector with marginal distribution functions \(F_i\) for \(i = 1, 2, \ldots, n\). The joint distribution function of \(\mathbf{X}\) can be written as:
\[
F_{\mathbf{X}}(\mathbf{x}) = C(F_1(x_1), F_2(x_2), \ldots, F_n(x_n)),
\]
where \(C\) is the Gaussian copula defined by the GMRF precision matrix \(\mathbf{Q}\). The Gaussian copula \(C\) is given by:
\[
C(u_1, u_2, \ldots, u_n) = \Phi_\mathbf{Q}(\Phi^{-1}(u_1), \Phi^{-1}(u_2), \ldots, \Phi^{-1}(u_n)),
\]
where \(\Phi_\mathbf{Q}\) is the joint cumulative distribution function of a multivariate normal distribution with mean vector \(\mathbf{0}\) and precision matrix \(\mathbf{Q}\), and \(\Phi^{-1}\) is the inverse of the standard normal cumulative distribution function.
A critical requirement for the precision matrix \(\mathbf{Q}\) governing the GMRF copula \(C\) is that \(\mathbf{\Sigma} = \mathbf{Q}^{-1}\) should have a unit diagonal, i.e. the marginal variance is equal to one everywhere. This ensures it operates on the same scale as the transformed data, \(\Phi^{-1}(u_i)\). However, this can be challenging as GMRFs are typically defined in terms of precision matrices that often imply non-unit marginal variances. While related scaling issues have been addressed in spatial statistics literature (Sørbye and Rue 2014; Riebler et al. 2016), their focus was on scaling for use in priors for the BYM2 model, aiming for a consistent interpretation of the precision parameter across different graph structures. In contrast, our work requires exact unit marginal variance at each point, a more stringent condition necessitated by the copula framework.
Similarly to Håvard Rue (2005), this paper proposes a way to efficiently calculate the marginal variances in GMRFs, but instead of working with the Cholesky decomposition of any general GMRF precision matrix, we work with a family of precision matrices that are similar to the sparse approximation to the Gaussian field with Matérn coveriance defined in Lindgren, Rue, and Lindström (2011). The precision matrices are defined as
\[
\mathbf{Q} = \left( \mathbf{Q}_{\rho_1} \otimes \mathbf{I_{n_2}} + \mathbf{I_{n_1}} \otimes \mathbf{Q}_{\rho_2}\right)^{\nu+1},
\]
where \(\mathbf{Q}_\rho\) is the precision matrix of a standardized one-dimensional AR(1) process with correlation \(\rho\), \(\nu\) is a smoothness parameter, and \(\otimes\) denotes the Kronecker product.
By focusing on this type of matrix, we can utilize known results on the eigendecomposition of \(\mathbf Q\) and how it relates directly to the eigendecompositions of \(\mathbf{Q}_{\rho_1}\) and \(\mathbf{Q}_{\rho_2}\). This lets us avoid explicit formation and inversion of the large precision matrix \(\mathbf{Q}\), making it particularly suitable for high-dimensional spatial data. In addition to the exact method, we show how the precision matrix can be approximated by a folded circulant matrix wich gives a large speed-up while preserving suitable boundary conditions (Kent and Mardia 2022; Mondal 2018; Besag and Mondal 2005).