SCAPP: an algorithm for improved plasmid assembly in metagenomes

David Pellow*, Alvah Zorea, Maraike Probst, Ori Furman, Arik Segal, Itzhak Mizrahi, Ron Shamir

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

28 Scopus citations

Abstract

Background: Metagenomic sequencing has led to the identification and assembly of many new bacterial genome sequences. These bacteria often contain plasmids: usually small, circular double-stranded DNA molecules that may transfer across bacterial species and confer antibiotic resistance. These plasmids are generally less studied and understood than their bacterial hosts. Part of the reason for this is insufficient computational tools enabling the analysis of plasmids in metagenomic samples. Results: We developed SCAPP (Sequence Contents-Aware Plasmid Peeler)—an algorithm and tool to assemble plasmid sequences from metagenomic sequencing. SCAPP builds on some key ideas from the Recycler algorithm while improving plasmid assemblies by integrating biological knowledge about plasmids. We compared the performance of SCAPP to Recycler and metaplasmidSPAdes on simulated metagenomes, real human gut microbiome samples, and a human gut plasmidome dataset that we generated. We also created plasmidome and metagenome data from the same cow rumen sample and used the parallel sequencing data to create a novel assessment procedure. Overall, SCAPP outperformed Recycler and metaplasmidSPAdes across this wide range of datasets. Conclusions: SCAPP is an easy to use Python package that enables the assembly of full plasmid sequences from metagenomic samples. It outperformed existing metagenomic plasmid assemblers in most cases and assembled novel and clinically relevant plasmids in samples we generated such as a human gut plasmidome. SCAPP is open-source software available from: https://github.com/Shamir-Lab/SCAPP. [MediaObject not available: see fulltext.].

Original languageEnglish
Article number144
JournalMicrobiome
Volume9
Issue number1
DOIs
StatePublished - Dec 2021

Funding

FundersFunder number
ERC Horizon 2020
National Science Foundation1947/19, 2016694
Horizon 2020 Framework Programme640384
United States-Israel Binational Science Foundation
Israel Science Foundation1339/18

    Keywords

    • Assembly
    • Plasmids

    Fingerprint

    Dive into the research topics of 'SCAPP: an algorithm for improved plasmid assembly in metagenomes'. Together they form a unique fingerprint.

    Cite this