Pavel Dvurechenskii, Klaus-Robert Müller, Shinichi Nakajima, Vladimir Spokoiny
01.01.2021 − 31.12.2022
We focus on analyzing neuroimaging data (EEG/MEG/fMRI) to find correlates of brain activity for a better understanding of e.g. ageing processes. We develop a Bayesian optimal transport framework to detect and statistically validate clusters and differences of brain activities taking into account the spatio-temporal structure of neuroimaging data.
Detecting spatio-temporal differences or changes of brain states is a fundamental question in computational and clinical neuroscience. For instance, one line of research aims to characterize healthy cognitive ageing by studying potential factors that may be of importance to maintain cognitive functionality across long lifespan (e.g. ). Brain Computer Interfacing (BCI), the research field aiming to decode brain states (e.g. ) in real-time and to translate them into control signals aim to find robust discriminators of various cognitive brain states or transients. To study these scientific goals, various neuroimaging data set have been collected [20,13,18], recently a trend has been towards analyzing multimodal brain signals . Multimodality is important in brain signal analysis, because there is no single non-invasive acquisition method that has both sufficient temporal and spatial resolution. For example, fMRI and MRI have high spatial resolution but low temporal resolution to capture reactions of brains to stimuli, while EEG and MEG have sufficient temporal resolution but suffer from low spatial resolution and low signal-to-noise ratio.
One of the established analytic procedures is source localization, where the EEG/MEG signal generation process is defined by a linear model with the leadfield matrix, and the signal source in the brain is inferred from data samples by solving the inverse problem (e.g.). Novel approaches may benefit from the complementary nature of modalities. For example, one can build a joint model of EEG and MEG signals to infer a common source signal, and use the MRI measurements, which identify the cortical structure of brains, to define subject-specific leadfield matrices such that the source space is aligned to all subjects .
Conventionally, differences in brain activities are detected and validated by statistical test in the source space with the Euclidean metric. However, recent research revealed that optimal transport (OT), which takes the anatomical distance between voxels into account, has emerged as a candidate for a more appropriate metric, avoiding undesired blurring in reconstructed source signals . This implies that the statistical test should also be conducted with the OT metric.
On the neuroscientific side, some of the questions on healthy cognitive ageing—maintenance view and compensation mechanism mentioned above have been partially supported by EEG/MEG signal analyses [4,14]. However, more accurate and robust analysis techniques may be required for furthering our understanding of healthy cognitive ageing. On the BCI side, previous machine learning techniques for EEG/MEG have shown broadly that cognitive states and their transients can be detected, including also fatigues, drowsiness, and attention (e.g.[27,11]). However, high intra and inter-subject variability leave still ample challenges (e.g. ). Clearly, novel robust statistical tools may help to overcome the discussed limitations and will thus facilitate further progress in multimodal analysis of neuroimaging data and cognitive neuroscience.
Our main goal for this project is to establish tools to detect and statistically validate differences in neuroimaging data with an appropriate metric taken into account. More specifically, we aim to establish a statistical method, called Bayesian optimal transport (BOT), with which one can analyse and judge whether groups of signal samples form separate clusters or a single joint cluster based on OT as the metric.
To proceed towards that goal, we will consider a selection of established datasets of multimodal brain data recordings, e.g., Cam-CAN , and DS117 . Those datasets contain EEG/MEG/fMRI/MRI signals acquired multiple times (trials) under different stimuli from different subjects (individuals) belonging to different age groups. Those differences, as well as hidden brain states between trials, are potential (hypothetical) clusters we aim to identify in a statistically grounded manner. Following best practices, EEG/MEG signals are epoched relative to stimulus onset, and epochs during which blinks or eye-movements are detected by EOG will be cleaned or discarded. The remaining epochs are the data samples to be analyzed.
Our goal will then be to demonstrate that clustering/discrimination based on BOT can be useful in many aspects: When it is applied at epoch-level under the same conditions (same subject and stimulus), it can detect hidden brain state changes (e.g., fatigue and drowsiness). When applied at subject-level, it enhances multi-subject learning by providing information on which subjects should be grouped and regularized as a single group—clustering allows us to extend multi-task regression approaches to mixture of regressions approaches . When applied stimulus-wise, it can directly assess statistical significance of the brain activity dependence on the stimulus. Most interesting is group-wise analysis, where hypothetical grouping is provided based on age, gender, and cognitive ability. Statistical test by BOT could potentially contribute to answer to neuroscientific questions (see , e.g., at what age the brain activity changes and exhibits flexibility for healthy cognitive ageing? Is there gender dependence? How much the change-point depends on the individual?
Ideally, BOT should be applied not in the signal space but in the source space, and clustering should be carried out simultaneously when the inverse problem is solved. This is in order to take the advantage of multi-subject analysis with OT as an appropriate metric, avoiding undesired blurring caused by noisy signals and less robust inference procedures.
Another goal of this project is to “explain” the cluster decisions in the source domain. Providing a visual explanation allows to check the consistency of the result with neuroscientific knowledge, e.g., neurosynth label and aparc.a2009s segmentation. In order to draw meaningful insights from the data, the Bayesian OT approach is particularly suitable here as it provides a natural disentanglement of the effects that can be explained by the data (what we are truly interested in) and those that are due to the model and its prior. Practically, we will extend methods for explaining clusters  to BOT, and also integrate recent progress on Bayesian uncertainty quantification of explanations .
Mathematically, we consider activation pattern in the source space as probability distribution on the space of voxels or on the brain surface with probability mass at each point being the relative activation strength at this point. An enhanced leadfield operator will map the signal space to the space of probability measures. Then the space of probability measures will be equipped with an OT distance, in particular, Wasserstein distance, or its unbalanced variant. Observed measures will be considered as samples from some unknown probability distribution on the space of these. These second-level distributions correspond to subjects, age, stimuli, trials, etc.
Based on this model and extending the framework of , different statistical questions will be answered via Bayesian OT barycenter of probability measures. The key aspect is to use prior in the space of log-density for the barycenter and relax the constraints in the definition of an OT distance by proposing a new relaxation method, which we call calming, and adding priors in the space of log-density for transportation plans. This will allow to obtain a variant of Bernstein-von Mises theorem for barycenters. As an output, the framework allows not only to produce the barycenter, but also to equip it with confidence or credible set.
To be more specific, we will consider clustering a) barycenters over epochs for each individual to group subjects with similar activation patterns, b) barycenters over subjects to find subject-independent task-specific activation zones and explain which zones in the brain are responsible for a particular tasks. Detecting change-point by comparing a) barycenters and clusters between group ages to understand brain changes connected to ageing, b) barycenters over trials for different tasks to find task-specific changes in activation patterns and apply this knowledge for better BCI. Explainability of the obtained results by finding a specific activation pattern (see ) for each task and each group of subjects, including different age groups. The main benefit of the BOT in comparison with standard OT is that the new approach comes with statistical guarantees for the obtained results and that these results are in the finite sample setting, which is crucial as the experiments are costly and the provided data has small sample size.
Extending the methods in [8,10] we will address the computational hardness of the barycenter problem to increase scalability of computations and obtain computational complexity bounds. As an alternative, we consider gradient and Hesssian-free optimization approach based on MCMC iterative sampling from posterior to simultaneously solve the optimization problem and statistical estimation problem. Finally, estimating effective dimension and effective subspace will allow to make dimension reduction leading to faster optimization algorithms.