CellPhy: accurate and fast probabilistic inference of single-cell phylogenies from scDNA-seq data
DATE:
2022-01-26
UNIVERSAL IDENTIFIER: http://hdl.handle.net/11093/5559
EDITED VERSION: https://doi.org/10.1186/s13059-021-02583-w
UNESCO SUBJECT: 2409 Genética
DOCUMENT TYPE: article
ABSTRACT
We introduce CellPhy, a maximum likelihood framework for inferring phylogenetic trees from somatic single-cell single-nucleotide variants. CellPhy leverages a finite-site Markov genotype model with 16 diploid states and considers amplification error and allelic dropout. We implement CellPhy into RAxML-NG, a widely used phylogenetic inference package that provides statistical confidence measurements and scales well on large datasets with hundreds or thousands of cells. Comprehensive simulations suggest that CellPhy is more robust to single-cell genomics errors and outperforms state-of-the-art methods under realistic scenarios, both in accuracy and speed.