BATMAN: Fast and Accurate Integration of Single-Cell RNA-Seq Datasets via Minimum-Weight Matching

Igor Mandric*, Brian L. Hill, Malika K. Freund, Michael Thompson, Eran Halperin

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Single-cell RNA-sequencing (scRNA-seq) is a set of technologies used to profile gene expression at the level of individual cells. Although the throughput of scRNA-seq experiments is steadily growing in terms of the number of cells, large datasets are not yet commonly generated owing to prohibitively high costs. Integrating multiple datasets into one can improve power in scRNA-seq experiments, and efficient integration is very important for downstream analyses such as identifying cell-type-specific eQTLs. State-of-the-art scRNA-seq integration methods are based on the mutual nearest neighbor paradigm and fail to both correct for batch effects and maintain the local structure of the datasets. In this paper, we propose a novel scRNA-seq dataset integration method called BATMAN (BATch integration via minimum-weight MAtchiNg). Across multiple simulations and real datasets, we show that our method significantly outperforms state-of-the-art tools with respect to existing metrics for batch effects by up to 80% while retaining cell-to-cell relationships.

Original languageEnglish
Article number101185
JournaliScience
Volume23
Issue number6
DOIs
StatePublished - 26 Jun 2020
Externally publishedYes

Funding

FundersFunder number
National Science Foundation
National Institutes of Health
National Human Genome Research Institute1R56MD013312, 5UL1TR001881, / NHGRI HG010505-02, 1R01MH115979, 5R25GM112625, R01MH115676, R01HG009120, U01CA194393
Directorate for Computer and Information Science and Engineering1705197

    Keywords

    • Algorithms
    • Bioinformatics
    • Transcriptomics

    Fingerprint

    Dive into the research topics of 'BATMAN: Fast and Accurate Integration of Single-Cell RNA-Seq Datasets via Minimum-Weight Matching'. Together they form a unique fingerprint.

    Cite this