Katalog Plus
Bibliothek der Frankfurt UAS
Bald neuer Katalog: sichern Sie sich schon vorab Ihre persönlichen Merklisten im Nutzerkonto: Anleitung.
Dieses Ergebnis aus BASE kann Gästen nicht angezeigt werden.  Login für vollen Zugriff.

A benchmarking study of individual somatic variant callers and voting-based ensembles for whole-exome sequencing

Title: A benchmarking study of individual somatic variant callers and voting-based ensembles for whole-exome sequencing
Authors: Guille, Arnaud; Adélaïde, José; Finetti, Pascal; Andre, Fabrice; Birnbaum, Daniel; Mamessier, Emilie; Bertucci, François; Chaffanet, Max
Contributors: Fondation Groupe EDF
Source: Briefings in Bioinformatics ; volume 26, issue 1 ; ISSN 1467-5463 1477-4054
Publisher Information: Oxford University Press (OUP)
Publication Year: 2024
Description: By identifying somatic mutations, whole-exome sequencing (WES) has become a technology of choice for the diagnosis and guiding treatment decisions in many cancers. Despite advances in the field of somatic variant detection and the emergence of sophisticated tools incorporating machine learning, accurately identifying somatic variants remains challenging. Each new somatic variant caller is often accompanied by claims of superior performance compared to predecessors. Furthermore, most comparative studies focus on a limited set of tools and reference datasets, leading to inconsistent results and making it difficult for laboratories to select the optimal solution. Our study comprehensively evaluated 20 somatic variant callers across four reference WES datasets. We subsequently assessed the performance of ensemble approaches by exploring all possible combinations of these callers, generating 8178 and 1013 combinations for single-nucleotide variants (SNVs) and indels, respectively, with varying voting thresholds. Our analysis identified five high-performing individual somatic variant callers: Muse, Mutect2, Dragen, TNScope, and NeuSomatic. For somatic SNVs, an ensemble combining LoFreq, Muse, Mutect2, SomaticSniper, Strelka, and Lancet outperformed the top-performing caller (Dragen) by >3.6% (mean F1 score = 0.927). Similarly, for somatic indels, an ensemble of Mutect2, Strelka, Varscan2, and Pindel outperformed the best individual caller (Neusomatic) by >3.5% (mean F1 score = 0.867). By considering the computational costs of each combination, we were able to identify an optimal solution involving four somatic variant callers, Muse, Mutect2, and Strelka for the SNVs and Mutect2, Strelka, and Varscan2 for the indels, enabling accurate and cost-effective somatic variant detection in whole exome.
Document Type: article in journal/newspaper
Language: English
DOI: 10.1093/bib/bbae697
Availability: https://doi.org/10.1093/bib/bbae697; https://academic.oup.com/bib/article-pdf/26/1/bbae697/61491609/bbae697.pdf
Rights: https://creativecommons.org/licenses/by/4.0/
Accession Number: edsbas.EBD4ECD1
Database: BASE