Katalog Plus
Bibliothek der Frankfurt UAS
Bald neuer Katalog: sichern Sie sich schon vorab Ihre persönlichen Merklisten im Nutzerkonto: Anleitung.
Dieses Ergebnis aus BASE kann Gästen nicht angezeigt werden.  Login für vollen Zugriff.

Statistical modelling of an outcome variable with integrated multi-omics

Title: Statistical modelling of an outcome variable with integrated multi-omics
Authors: Li,He; Gu,Zander; el Bouhaddani, Said; Houwing-Duistermaat,Jeanine; Datascience; Cancer
Publication Year: 2026
Subject Terms: Latent variables; Low-dimensional representation; Metabolomics; Multivariate analysis; Polygenic score; Structural Biology; Biochemistry; Molecular Biology; Computer Science Applications; Applied Mathematics
Description: Background: In studies that aim to model the relationship between an outcome variable and multiple omics datasets, it is often desirable to reduce the dimensionality of these datasets or to represent one omics dataset in terms of another. Several approaches exist for this purpose, including univariate methods such as polygenic scores, and multivariate methods. Multivariate approaches offer advantages by producing lower-dimensional integrative scores, capturing joint structures across datasets, and filtering out dataset-specific noise. In this paper, we describe one univariate and two multivariate methods, and evaluate their performance through simulations involving two correlated multivariate normally distributed omics datasets, as well as a combination of one multivariate normal and one fixed categorical dataset. Results: We assess method performance using the root mean squared error (RMSE) when modelling the outcome variable as a function of the reduced omics representations. Multivariate methods generally perform well, particularly when a slightly higher number of components is used for integration. They outperform the univariate method in scenarios involving two normally distributed omics datasets and perform comparably in settings with one normal and one categorical dataset. In real data applications, including two metabolomics datasets from TwinsUK and a metabolomics-genetic dataset from ORCADES, all methods show similar performance in modelling body mass index. Conclusions: Multivariate methods provide a valuable framework for summarizing multi-omics datasets into low-dimensional components suitable for outcome modelling. Even in the presence of non-normal data, these methods offer a promising alternative to high-dimensional univariate approaches.
Document Type: article in journal/newspaper
File Description: application/pdf
Language: English
ISSN: 1471-2105
Relation: https://dspace.library.uu.nl/handle/1874/469368
Availability: https://dspace.library.uu.nl/handle/1874/469368
Rights: info:eu-repo/semantics/OpenAccess
Accession Number: edsbas.91C647A5
Database: BASE