Benchmarking scRNA-seq copy number variation callers

Research output: Contribution to journalArticleAcademicpeer-review

10 Downloads (Pure)

Abstract

Copy number variations (CNVs), the gain or loss of genomic regions, are associated with disease, especially cancer. Single cell technologies offer new possibilities to capture within-sample heterogeneity of CNVs and identify subclones relevant for tumor progression and treatment outcome. Several computational tools have been developed to identify CNVs from scRNA-seq data. However, an independent benchmarking of them is lacking. Here, we evaluate six popular methods in their ability to correctly identify ground truth CNVs, euploid cells and subclonal structures in 21 scRNA-seq datasets. We discover dataset-specific factors influencing the performance, including dataset size, the number and type of CNVs in the sample and the choice of the reference dataset. Methods which include allelic information perform more robustly for large droplet-based datasets, but require higher runtime. Furthermore, the methods differ in their additional functionalities. We offer a benchmarking pipeline to identify the optimal method for new datasets, and improve methods' performance.

Original languageEnglish
Article number8777
Number of pages17
JournalNature Communications
Volume16
Issue number1
DOIs
Publication statusPublished - 2-Oct-2025

Keywords

  • DNA Copy Number Variations/genetics
  • Humans
  • Benchmarking
  • Single-Cell Analysis/methods
  • Neoplasms/genetics
  • RNA-Seq/methods
  • Computational Biology/methods
  • Single-Cell Gene Expression Analysis

Fingerprint

Dive into the research topics of 'Benchmarking scRNA-seq copy number variation callers'. Together they form a unique fingerprint.

Cite this