Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

Multimodal data integration combines different data modalities to improve predictive and classification performance. In biology, multi-omics profiling has become a powerful tool for applications such as cancer patient stratification. However, integration of multi-omics data remains challenging because of missingness and inherent heterogeneity. Methods such as imputation and sample exclusion often rely on strong assumptions that could lead to information loss or distortion. To address these limitations, we propose MIND (Multimodal Integration with Neighbourhood-aware Distributions), which learns patient-specific embeddings from incomplete multi-omics data using a multimodal Variational Autoencoder with a data-driven prior. We inject neighbourhood structure of the observed dataset, encoded as affinity matrices, into the prior, penalising latent configurations when neighbourhood structures in data and latent spaces diverge. MIND handles high missing rates, unbalanced missingness patterns, and low signal-to-noise ratios robustly. Compared with existing integration methods, MIND achieves better performance on downstream tasks on both synthetic and real data.

More information Original publication

DOI

10.1038/s41467-026-74413-1

Type

Journal article

Publication Date

2026-06-24T00:00:00+00:00