## Introduction

In order to help summarize and integrate the results of the ever-growing number of neuroimaging studies, some groups have developed methods to conduct voxel-based meta-analyses solely relying on the information reported in the papers, namely the peaks of the clusters where there were statistically significant activations or where patients and controls showed statistically significant differences. Activation likelihood estimation (ALE) (1–4), (effect-size) signed differential mapping (ES-SDM) (5–7) and (multilevel) kernel density analysis (M-KDA) (8, 9) are commonly used methods that have already been applied to meta-analyze a wide range of normal brain functions (10–12) and abnormalities in neurological (13–15) and psychiatric disorders (16–18). As briefly introduced in Figure 1, these methods differ substantially in their algorithms [for a deeper review, see Ref. (19)], but one characteristic they share is that all convolve an isotropic kernel with the peak. In this context, a “kernel” is a small matrix convolved with the peaks in order that voxels surrounding a peak have a value similar to, but slightly lower than that of the peak. An “isotropic kernel” is identical in all directions. In simple terms, the effect size of a voxel close to a peak would only depend on the effect size of the peak and the Euclidian distance between the voxel and the peak (Figure 2). All voxels at 1 cm of a peak would have the same effect size, independently of whether they are in the same brain region or not.

**Figure 1. Main steps of activation likelihood estimation (ALE) and effect-size signed differential mapping (ES-SDM)**. ALE (left approach) aims to estimate the likelihood that a peak lies in any given voxel. To this end, it first applies a Gaussian kernel so that the likelihood is high in the voxel where the peak is reported and similar but slightly lower in the close voxels. Afterward, it calculates the probability of the union of the likelihoods estimated from the different peaks and studies. ES-SDM algorithms (middle and right approach) are different, as this method aims to estimate the effect size rather than the peak likelihood. However, the first step also consists in applying an (un-normalized) Gaussian kernel, this time to achieve that voxels around a reported peak have an estimated effect size which is similar but slightly smaller to that of the peak. Afterward, effect-sizes recreated from the different peaks of a study are combined using a weighted average, i.e., when a voxel is close to two peaks, it has an effect size that depends on both peaks. Finally, the effect size maps as well as their variance maps are introduced in a meta-analytic random-effects general linear model.

**Figure 2. Recreation of clusters using isotropic kernels in previous versions of effect-size signed differential mapping (ES-SDM) and activation likelihood estimation (ALE)**. Note that the recreation of the effect size (or the estimation of the activation likelihood) does *not* depend on the strength of the spatial correlations, but only on the Euclidean distance between each voxel and the peak.

However, such perfect spheres of effect size around the peak are probably implausible. Independently on the distance, voxels in the same brain region as the peak are more likely to be part of the cluster of significant activation or difference. Conversely, voxels in other brain regions, or separated from the peak by cerebrospinal fluid, are less likely to be part of the cluster.

Applying an isotropic kernel may thus underestimate the effect size of voxels in the same brain region as the peak, whereas it may overestimate the effect size of voxels from other brain regions. Some groups have recommended the use of large kernels, thus minimizing the underestimation of the effect size, though at the cost of a potential overestimation in some voxels (6, 20). Other developers have recommended the use of narrow kernels, thus minimizing the overestimation of the effect size, though at the cost of potential underestimation in some other voxels (4).

The aim of this study was to develop anisotropic kernels for coordinate-based meta-analyses, which would assign different values to the different neighboring voxels based on the spatial correlation between them. This was specifically developed for ES-SDM, although it might be easily implemented in other widely used meta-analytical programs such as ALE. The paper also includes the creation of new tissue-specific templates and a validation of the new method. We hypothesized that the recreation of effect size maps using anisotropic kernels would be more accurate than using isotropic kernels.

## Theory

Previous versions of ES-SDM adopted the ALE Gaussian kernel with the aim that, in the recreated statistical map, the voxels close to a peak have slightly smaller effect sizes than that of the peak, and progressively further voxels have progressively smaller effect sizes. Specifically, the effect size of a voxel close to a peak depended on the effect size of the peak and on the Euclidean distance between the voxel and the peak by means of an un-normalized Gaussian function:

where *d* is the effect size, *D* is the distance, and σ is the standard deviation of the kernel (approximately 0.425 of its full-width at half-maximum, FWHM).

The new method described here is based on the correlation between close voxels in the underlying structural image. Note that correlated voxels (e.g., individuals with much gray matter in one voxel tend to also have much gray matter in the other voxel) are more likely to be from the same brain region. The method consists of virtually deforming the distance so that highly correlated voxels are brought closer, while uncorrelated voxels are moved further away. A Gaussian kernel is then applied to the deformed space. When the original space is restored, highly correlated voxels are estimated to have larger effect sizes whereas uncorrelated voxels are estimated to have smaller or null effect sizes (Figure 3).

**Figure 3. Recreation of clusters using the anisotropic kernel in the updated version of effect-size signed differential mapping (ES-SDM)**. Note that the recreation of the effect size *does* depend on the strength of the spatial correlations, with the cluster being stretched toward voxels highly correlated with the peak.

### Deformation of the Space

Space is deformed to match the correlation of each voxel with its neighbors. To match distances and correlations, the expression of *d*_{voxel} in Eq. 1 is made equal the expression of *d*_{voxel} in Eq. 2, obtaining Eq. 3 from which *D* may be isolated:

where ρ is the coefficient of correlation between the voxel and the peak.

The distance between a peak and its adjacent voxels is thus deformed according to the Eq. 4, the only variables of which are the constant standard deviation of the kernel (or equivalently the FWHM) and the correlation between the two voxels. Figure 4 shows this correspondence between correlation and deformed space.

**Figure 4. Deformed distance between two adjacent voxels depending on the correlation between them**. Deformed distances in this example have been calculated for σ = 8.5 mm (FWHM = 20 mm). Note, however, that the recreation of effect size map does not indeed depend on FWHM when full anisotropy is used [see text, Eq. 2 and Figure 6].

To estimate the distance between the peak and a non-contiguous voxel *v*, the software must sum the distances between the pairs of contiguous voxels along the shortest path between the peak and the voxel *v*. However it is difficult to know, in the deformed space, which is the shortest path between two voxels. A path composed of 10 voxels may be shorter than a path composed of 6 voxels, if the sum of the distances between the nine pairs of contiguous voxels of the former is smaller than the sum of the distances between the five pairs of contiguous voxels of the latter. A Dijkstra’s algorithm (21) is used in the new method to find the shortest distance between the peak and each of its surrounding voxels. Specifically, the algorithm first calculates the distances between the peak (“initial node”) and each of its’ 26 adjacent voxels. Second, it calculates the distances between one of these adjacent voxels (“current node” in this step) and each its’ adjacent voxels. This step is repeated for each of the voxels adjacent to the peak. Third, it calculates the distances between each of the voxels adjacent to the voxels adjacent to the peak and their adjacent voxels. This is repeated until: (a) the total distance between the peak and a voxel following a path of voxels is not shorter to that previously calculated following another path; or (b) the total distance is longer than the FWHM – which would correspond to the effect size of the peak divided by 16, already negligible.

This algorithm does not restrict its calculations to voxels within a mask of, e.g., gray matter. This is important because neuroimaging studies not uncommonly show significant results outside the expected tissue, due to, e.g., registration mismatches during pre-processing. With this unrestricted spatial propagation: (a) peaks outside the selected mask may be also used in the recreation of the effect size; and (b) researchers can check whether peaks of the recreated map match with those reported in the manuscript – which may be outside the mask. However, spuriously strong correlations outside the tissue might potentially cause artifacts such as “bridges” between two separate brain regions. In order to avoid such artifacts, correlations in voxels with a tissue probability lower than 0.1 in the smoothed average (see Creation of correlation templates below) are decreased proportionally, e.g., are divided by 2 in voxels with a tissue probability of 0.1/2 = 0.05, and by 10 in voxels with a tissue probability of 0.1/10 = 0.01.

### Generalization to Variable Degrees of Anisotropy

Equation 2 can be generalized to:

where α is the degree of anisotropy and ρ_{isotropic} is a theoretical spatially constant correlation in the isotropic scenario. Note that α = 0 corresponds to the isotropic scenario, α = 1 to the fully anisotropic scenario, and 0 < α < 1 to variable degrees of anisotropic scenarios.

The theoretical spatially constant correlation in the isotropic scenario (ρ_{isotropic}) may be isolated from (3):

where *D*_{real} is the real Euclidean distance between the two voxels.

*D* may be again isolated following the same steps as outlined above:

### Subsequent Processing Steps

Once the deformed distances from a peak have been calculated, these are used by the Gaussian kernel to estimate the effect size of the voxels surrounding a peak. Remaining ES-SDM steps have not been modified: (a) combination of the effect sizes of nearby peaks by means of a weighted average; (b) estimation of the variances associated to these effect sizes; and (c) combination of the effect sizes of the studies included in the meta-analysis by fitting random-effects general linear models. Step (a) is conducted throughout the whole volume for diagnostic purposes, but voxels outside the tissue mask are subsequently discarded.

In ALE, the deformed distances could be used by the Gaussian kernel to estimate the likelihood of a peak, and remaining ALE steps (e.g., the estimation of the probability of the union) would not need to be modified.

## Creation of Correlation Templates

In order to apply the method described above, we needed to create correlation templates for gray matter, white matter, cerebrospinal fluid and fractional anisotropy (FA).

Raw magnetic resonance imaging (MRI) data were obtained from the IXI dataset^{1}. This dataset includes nearly 600 MR images from normal, healthy subjects acquired in three different hospitals in London. In order to avoid scanner-related differences, we only used those MR images acquired at Hammersmith Hospital, where a Philips 3 T device was used. T1 parameters were as follows: repetition time = 9.6, echo time = 4.6, 208 phase encoding steps, echo train length = 208, reconstruction diameter = 240, acquisition matrix = 208 × 208, and flip angle = 8. Diffusion tensor imaging (DTI) parameters were as follows: repetition time = 11894, echo time = 51, two averages, 110 phase encoding steps, echo train length = 0, reconstruction diameter = 224, acquisition matrix = 112 × 110, and flip angle = 90.

After exclusion of individuals younger than 20 years or older than 80 years, the Hammersmith Hospital sample included 181 scans. A minimization script was used to select 120 of them in order to obtain 6 equal-sized demographic groups (20–40-year-old males, 40–60-year-old males, 60–80-year-old males, 20–40-year-old females, 40–60-year-old females, and 60–80-year-old females) and a relatively lower frequency of the over-represented white and university-educated individuals (73 and 52% respectively in the original sample, 60 and 40% in the selected sample). The same individuals were used to create the FA template, although DTI data were missing for four of them.

T1 scans were pre-processed following a standard voxel-based morphometry (VBM) algorithm with FSL^{2}
, with the exception that no study-specific template was estimated in order that the final images were exactly in Montreal Neurological Institute (MNI) space: brain-extraction (22), tissue segmentation (23), non-linear registration to MNI space, and smoothing (σ = 4mm; FWHM = 9.4 mm). DTI scans were also pre-processed following a standard voxel-based FA algorithm with FSL: brain-extraction, Eddy correction, FA estimation, linear registration to the T1 scans, non-linear registration to MNI space, and smoothing (σ = 3mm; FWHM = 7.1 mm). Non-linear registrations were based on the warp parameters estimated for the T1 gray matter segments.

Finally, individual values in each voxel were correlated with the individual values in its contiguous voxels using R (24). The gray matter density of a voxel *x* in the 120 individuals, for example, was correlated with the gray matter density of its right-contiguous voxel *y* in the same individuals, with *x* and *y* being variables with one value per subject. A strong correlation would indicate that individuals with more gray matter density in one of the voxels also had more gray matter density in the other voxel. For computational and memory purposes, only 13 correlations were calculated for each voxel, as the other complementary 13 correlation were indeed also calculated for the corresponding neighboring voxel, e.g., “correlation with the voxel at the left” had been already calculated when calculating the “correlation with the voxel at the right” in the voxel at the left. An example of final template is shown in Figure 5.

**Figure 5. Main correlation maps for white matter volume**. For illustrative purposes, this Figure only shows correlations along the three main directions (left-right, back-front and bottom-up). The templates created in this study include the correlations with all 26 voxels surrounding each voxel.

## Validation of the New Kernel

### Method

In order to validate the new method, six voxel-based effect size maps were recreated from peak information using different degrees of anisotropy and FWHM, and compared to the effect size maps directly obtained from the raw statistical parametric maps (“true” effect size maps). The idea is that the lower the difference between peak-recreated and true effect size maps, the better the recreation.

The 120 individuals from the IXI dataset were divided six times in two groups of 60 individuals each, with these divisions being orthogonal between them (25). A statistical parametric map was obtained from the comparison of the registered and smoothed gray matter segments between the two groups of each pair of groups. The six independent statistical parametric maps were thresholded liberally (*p* = 0.001, with a minimum extent of only 10 voxels) in order to obtain significant differences. The mean (±standard deviation) number of clusters was 22 ± 28, and the median (±absolute deviation) was 16 ± 14.

Effect-size signed differential mapping pre-processing of peak information was then conducted with different degrees of anisotropy (0.0, 0.2… 1.0) and different FWHMs (5, 10… 100 mm). Differences between peak-recreated and true effect size maps were summarized with the relative mean square error (MSE), i.e., the MSE obtained under the current degree of anisotropy and FWHM, divided by the MSE obtained under default ES-SDM isotropic FWHM (20 mm) (6). A relative MSE <100% would indicate an improvement of the recreation. A set of six MSEs (one per statistical map) was obtained for each combination of parameters, and we assessed whether these were lower than the MSEs obtained from the same statistical maps under default ES-SDM isotropic FWHM by means of a non-parametric repeated-measures Wilcoxon signed-rank test.

### Results

As shown in Figure 6, the optimal FWHM in this particular dataset ranged from 40–45 mm in the absence of anisotropy (relative MSE = 80%, *p* = 0.053), to 100 mm (or more) when anisotropy was 0.4 or higher (relative MSE = 78–92%, *p* = 0.030–0.053). Narrower FWHMs were associated to substantial increases of the MSE (relative MSE = 120–128%). Wider FWHM also seem to be associated to substantial increases of the MSE, at least in the absence of anisotropy (relative MSE = 127%).

**Figure 6. Relative mean square error (MSE) of the recreation of the statistical maps used in this study depending on the degree of anisotropy and the full-width at half-maximum (FWHM)**. Relative MSE was defined as the MSE obtained with the current set of parameters divided by the MSE obtained after applying effect-size signed differential mapping (ES-SDM) standard isotropic kernel (FWHM = 20 mm). Please note that optimal degree of anisotropy and FWHM were different when using other datasets (not reported here), but use of full anisotropy was still associated to a substantial decrease of MSE.

As expected from Eq. 2, the effects of FWHM were null when recreations were conducted with full anisotropy, whereas there was still a substantial decrease of MSE (MSE = 92%, *p* = 0.030).

Optimal degree of anisotropy and FWHM were different when using other datasets (not reported here), but use of full anisotropy was still associated to a substantial decrease of MSE.

## Discussion

This manuscript presents anisotropic kernels for peak-based meta-analytic methods based on the spatial correlation between neighboring voxels, as well as the creation of the required templates for gray matter, white matter, cerebrospinal and FA. The empirical validation showed that the optimal FWHM in the particular dataset used was substantially larger than in previous validations (6, 20), indicating that optimal FWHM might vary largely depending on the data. However, it also showed that the recreation substantially improved and did not depend on the FWHM when full anisotropy was used. Both the method and the templates are readily available with SDM software^{3}.

These findings support our hypothesis that isotropic kernels may underestimate the effect size in voxels strongly correlated with the peak (e.g., more likely to be from the same brain region), whereas they may overestimate the effect size in voxels weakly correlated (e.g., less likely to be from the same brain region). In this regard, it must be noted that anisotropic kernels have also already successfully been applied in other neuroimaging fields (26).

In the absence of anisotropy, the optimal FWHM in this validation was found to be 45 mm in the present study, whilst reported to be 20–25 mm in previous work. Such difference might be related to the extent and smoothness of the brain activations, differences or abnormalities. Spatially large and smooth effects may be better recreated with large kernels, whilst small and circumscribed effects with small kernels. Thus, recreation could be optimized for each specific meta-analysis based on the degree of anisotropy and FWHM found to optimally recreate the available statistical maps. This optimization may be achieved following a series of steps analogous to those conducted in Section “Method,” namely: (a) threshold the statistical parametric maps; (b) conduct command-line ES-SDM pre-processing with different degrees of anisotropy and FWHM; (c) calculate the MSE of the differences between each peak-recreated map and the corresponding effect size map under each combination of parameters; (d) choose the optimal parameters based on a plot similar to Figure 6. Robustness should be taken into account when deciding which is the optimal combination of parameters, e.g., ensuring that slight variations in anisotropy or FWHM are not associated with large increase in MSE. On the absence of available statistical parametric maps, however, the use of full anisotropy may represent a robust choice given that results do not depend on the FWHM.

There is at least one situation in which isotropic and anisotropic kernels may be probably equivalent, namely when meta-analyzing studies using tract-based spatial statistics (TBSS) (27). These studies limit their statistical analysis to a FA skeleton, but skeletons of different studies do not overlap. To overcome this difficulty, TBSS protocol in ES-SDM consists in retrieving a mass number of liberally thresholded local peaks from the statistical maps and incorporating them into the ES-SDM TBSS map in order to reconstruct the effect size maps in a common skeleton (28). Given the extreme proximity of the retrieved local peaks, no difference is expected between using one or another kernel. Conversely, the effects of anisotropy may be larger than those found in this paper, in studies reporting few but high peaks, as the shape and intensity of the recreated clusters may differ substantially.

Selecting one or another kernel is obviously irrelevant when the ES-SDM meta-analysis does not include any effect size map recreation from peak information. This could be the case in the rare situations in which statistical parametric maps can be obtained from all the studies in a field. Similarly, this is also the case in the more common situation in which a meta-analytic approach is used to combine data from different sites. This “mega-analytic” approach improves upon simpler covariate-based mega-analyses in that results may be extrapolated to sites other than those included in the multi-site study. Finally, selecting one or another kernel is also irrelevant when combining meta-analytic maps from different modalities (e.g., gray matter volume and BOLD response) to obtain a multi-modal meta-analysis (29–31), as again it does not involve any map recreation from peak information.

Two limitations of this study must be acknowledged. First, the validation showed that the use of full anisotropy may be sub-optimal as compared to some combinations of degree of anisotropy and FWHM. However, full anisotropy is still associated with a significant improvement as compared to default isotropic kernels whilst it is more robust because results do not depend on the FWHM. Second, we did not create a specific correlation template for functional MRI (fMRI) or positron emission tomography (PET). Unfortunately, creation of this template is not straightforward because functional correlations between voxels may depend on the state of mind. The optimal template is likely to be different for each specific fMRI task. Also, we do not know to which extent the functional connectivity abnormalities reported in patients (32) may bias the recreation of the effect size maps of task-based fMRI studies when comparing patients with controls. Fortunately, the use of the structural gray matter template may provide a general correlation template, which seems unlikely to depend on the state of mind or the functional connectivity.

## Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

## Footnotes

**^**http://www.brain-development.org/, accessed on Oct 31 2013**^**http://fsl.fmrib.ox.ac.uk/fsl/fslwiki/**^**http://www.sdmproject.com

## References

1. Eickhoff SB, Bzdok D, Laird AR, Kurth F, Fox PT. Activation likelihood estimation meta-analysis revisited. *Neuroimage* (2012) **59**:2349–61. doi: 10.1016/j.neuroimage.2011.09.017

2. Turkeltaub PE, Eden GF, Jones KM, Zeffiro TA. Meta-analysis of the functional neuroanatomy of single-word reading: method and validation. *Neuroimage* (2002) **16**:765–80. doi:10.1006/nimg.2002.1131

3. Laird AR, Fox PM, Price CJ, Glahn DC, Uecker AM, Lancaster JL, et al. ALE meta-analysis: controlling the false discovery rate and performing statistical contrasts. *Hum Brain Mapp* (2005) **25**:155–64. doi:10.1002/hbm.20136

4. Eickhoff SB, Laird AR, Grefkes C, Wang LE, Zilles K, Fox PT. Coordinate-based activation likelihood estimation meta-analysis of neuroimaging data: a random-effects approach based on empirical estimates of spatial uncertainty. *Hum Brain Mapp* (2009) **30**:2907–26. doi:10.1002/hbm.20718

5. Radua J, Mataix-Cols D. Voxel-wise meta-analysis of grey matter changes in obsessive-compulsive disorder. *Br J Psychiatry* (2009) **195**:393–402. doi:10.1192/bjp.bp.108.055046

6. Radua J, Mataix-Cols D, Phillips ML, El-Hage W, Kronhaus DM, Cardoner N, et al. A new meta-analytic method for neuroimaging studies that combines reported peak coordinates and statistical parametric maps. *Eur Psychiatry* (2012) **27**:605–11. doi:10.1016/j.eurpsy.2011.04.001

7. Radua J, van den Heuvel OA, Surguladze S, Mataix-Cols D. Meta-analytical comparison of voxel-based morphometry studies in obsessive-compulsive disorder vs other anxiety disorders. *Arch Gen Psychiatry* (2010) **67**:701–11. doi:10.1001/archgenpsychiatry.2010.70

8. Wager TD, Barrett LF, Bliss-Moreau E, Lindquist K, Duncan S, Kober H, et al. The neuroimaging of emotion. In: Lewis M, Haviland-Jones JM, Barrett LF, editors. *The Handbook of Emotion*, 3rd Edn. New York: Guilford (2008). p. 249–71.

9. Wager TD, Phan KL, Liberzon I, Taylor SF. Valence, gender, and lateralization of functional brain anatomy in emotion: a meta-analysis of findings from neuroimaging. *Neuroimage* (2003) **19**:513–31. doi:10.1016/S1053-8119(03)00078-8

10. Kober H, Barrett LF, Joseph J, Bliss-Moreau E, Lindquist K, Wager TD. Functional grouping and cortical-subcortical interactions in emotion: a meta-analysis of neuroimaging studies. *Neuroimage* (2008) **42**:998–1031. doi:10.1016/j.neuroimage.2008.03.059

11. Rottschy C, Langner R, Dogan I, Reetz K, Laird AR, Schulz JB, et al. Modelling neural correlates of working memory: a coordinate-based meta-analysis. *Neuroimage* (2012) **60**:830–46. doi:10.1016/j.neuroimage.2011.11.050

12. Bohrn IC, Altmann U, Jacobs AM. Looking at the brains behind figurative language – a quantitative meta-analysis of neuroimaging studies on metaphor, idiom, and irony processing. *Neuropsychologia* (2013) **50**:2669–83. doi:10.1016/j.neuropsychologia.2012.07.021

13. Jacobs HI, Radua J, Luckmann HC, Sack AT. Meta-analysis of functional network alterations in Alzheimer’s disease: toward a network biomarker. *Neurosci Biobehav Rev* (2013) **37**:753–65. doi:10.1016/j.neubiorev.2013.03.009

14. Lansley J, Mataix-Cols D, Grau M, Radua J, Sastre-Garriga J. Localized grey matter atrophy in multiple sclerosis: a meta-analysis of voxel-based morphometry studies and associations with functional disability. *Neurosci Biobehav Rev* (2013) **37**:819–30. doi:10.1016/j.neubiorev.2013.03.006

15. Dogan I, Eickhoff SB, Schulz JB, Shah NJ, Laird AR, Fox PT, et al. Consistent neurodegeneration and its association with clinical progression in Huntington’s disease: a coordinate-based meta-analysis. *Neurodegener Dis* (2013) **12**:23–35. doi:10.1159/000339528

16. Hart H, Radua J, Nakao T, Mataix-Cols D, Rubia K. Meta-analysis of functional magnetic resonance imaging studies of inhibition and attention in attention-deficit/hyperactivity disorder: exploring task-specific, stimulant medication, and age effects. *JAMA Psychiatry* (2013) **70**:185–98. doi:10.1001/jamapsychiatry.2013.277

17. Nortje G, Stein DJ, Radua J, Mataix-Cols D, Horn N. Systematic review and voxel-based meta-analysis of diffusion tensor imaging studies in bipolar disorder. *J Affect Disord* (2013) **150**:192–200. doi:10.1016/j.jad.2013.05.034

18. Nickl-Jockschat T, Schneider F, Pagel AD, Laird AR, Fox PT, Eickhoff SB. Progressive pathology is functionally linked to the domains of language and emotion: meta-analysis of brain structure changes in schizophrenia patients. *Eur Arch Psychiatry Clin Neurosci* (2011) **261**(Suppl 2):S166–71. doi:10.1007/s00406-011-0249-8

19. Radua J, Mataix-Cols D. Meta-analytic methods for neuroimaging data explained. *Biol Mood Anxiety Disord* (2012) **2**:6. doi:10.1186/2045-5380-2-6

20. Salimi-Khorshidi G, Smith SM, Keltner JR, Wager TD, Nichols TE. Meta-analysis of neuroimaging data: a comparison of image-based and coordinate-based Pooling of studies. *Neuroimage* (2009) **45**:810–23. doi:10.1016/j.neuroimage.2008.12.039

21. Dijkstra EW. A note on two problems in connexion with graphs. *Numer Math* (1959) **1**:269–71. doi:10.1007/BF01386390

22. Smith SM. Fast robust automated brain extraction. *Hum Brain Mapp* (2002) **17**:143–55. doi:10.1002/hbm.10062

23. Zhang Y, Brady M, Smith S. Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. *IEEE Trans Med Imaging* (2001) **20**:45–57. doi:10.1109/42.906424

24. R_Core_Team. *R: A Language and Environment for Statistical Computing*. Vienna: R Foundation for Statistical Computing (2013).

25. Radua J, Canales-Rodríguez EJ, Pomarol-Clotet E, Salvador R. Validity of modulation and optimal settings for advanced voxel-based morphometry. *Neuroimage* (2014) **86**:81–90. doi:10.1016/j.neuroimage.2013.07.084

26. Zhang T, Davatzikos C. Optimally-Discriminative Voxel-Based Morphometry significantly increases the ability to detect group differences in schizophrenia, mild cognitive impairment, and Alzheimer’s disease. *Neuroimage* (2013) **79**:94–110. doi:10.1016/j.neuroimage.2013.04.063

27. Smith SM, Jenkinson M, Johansen-Berg H, Rueckert D, Nichols TE, Mackay CE, et al. Tract-based spatial statistics: voxelwise analysis of multi-subject diffusion data. *Neuroimage* (2006) **31**:1487–505. doi:10.1016/j.neuroimage.2006.02.024

28. Peters BD, Blaas J, de Haan L. Diffusion tensor imaging in the early phase of schizophrenia: what have we learned? *J Psychiatr Res* (2010) **44**:993–1004. doi:10.1016/j.jpsychires.2010.05.003

29. Cooper D, Barker V, Radua J, Fusar-Poli P, Lawrie S. Multimodal voxel-based meta-analysis of structural and functional magnetic resonance imaging studies in those at elevated genetic risk of developing schizophrenia. *Psychiatry Res* (2014) **221**:69–77. doi:10.1016/j.pscychresns.2013.07.008

30. Radua J, Borgwardt S, Crescini A, Mataix-Cols D, Meyer-Lindenberg A, McGuire PK, et al. Multimodal meta-analysis of structural and functional brain changes in first episode psychosis and the effects of antipsychotic medication. *Neurosci Biobehav Rev* (2012) **36**:2325–33. doi:10.1016/j.neubiorev.2012.07.012

31. Radua J, Romeo M, Mataix-Cols D, Fusar-Poli P. A general approach for combining voxel-based meta-analyses conducted in different neuroimaging modalities. *Curr Med Chem* (2013) **20**:462–6. doi:10.2174/0929867311320030017