Katalog Plus
Bibliothek der Frankfurt UAS
Bald neuer Katalog: sichern Sie sich schon vorab Ihre persönlichen Merklisten im Nutzerkonto: Anleitung.
Dieses Ergebnis aus BASE kann Gästen nicht angezeigt werden.  Login für vollen Zugriff.

Data Sheet 1_Analysis of follow-up data in large biobank cohorts: a review of methodology.pdf

Title: Data Sheet 1_Analysis of follow-up data in large biobank cohorts: a review of methodology.pdf
Authors: Anastassia Kolde; Merli Koitmäe; Meelis Käärik; Märt Möls; Krista Fischer
Publication Year: 2025
Collection: The University of Auckland: Figshare
Subject Terms: Genetics; survival analysis; genome-wide association study; populationbased biobank data; martingale residuals; cox proportional hazards model
Description: This study focuses on key methodological challenges in genome-wide association studies (GWAS) of biobank data with time-to-event outcomes, analyzed using the Cox proportional hazards (CPH) model. We address four primary issues: left-truncation of the data, computational inefficiency of standard model-fitting algorithms, relatedness among individuals, and model misspecification. To manage left-truncation, the common practice is to use age as the timescale, with individuals entering the risk set at their age of recruitment. We assess how this choice of timescale influences bias and statistical power, under realistic GWAS conditions of varying effect sizes and censoring rates. In addition, to alleviate the computational burden typical in large-scale data, we propose and evaluate a two-step martingale residual (MR) approach for high-dimensional CPH modeling. Our results show that the timescale choice has minimal effect on accuracy for small hazard ratios, though using time since birth as the timescale – ignoring recruitment age – yields the highest power for association detection. We find that relatedness, when ignored, does not substantially bias effect size estimates, while omitting key covariates introduces significant bias. The two-step MR approach proves to be computationally efficient, retaining power for detecting small effect sizes, making it suitable for large-scale association studies. However, when precise effect size estimates are critical, particularly for moderate or larger effect sizes, we recommend recalculating these estimates using the conventional CPH model, with careful attention to left-truncation and relatedness. These conclusions are drawn from simulations and illustrated with data from the Estonian Biobank cohort.
Document Type: dataset
Language: unknown
DOI: 10.3389/fgene.2025.1534726.s001
Availability: https://doi.org/10.3389/fgene.2025.1534726.s001; https://figshare.com/articles/dataset/Data_Sheet_1_Analysis_of_follow-up_data_in_large_biobank_cohorts_a_review_of_methodology_pdf/29399525
Rights: CC BY 4.0
Accession Number: edsbas.1208395A
Database: BASE