GePMI: A statistical model for personal intestinal microbiome identification

Zicheng Wang, Huazhe Lou, Ying Wang, Ron Shamir, Rui Jiang*, Ting Chen

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

Human gut microbiomes consist of a large number of microbial genomes, which vary by diet and health conditions and from individual to individual. In the present work, we asked whether such variation or similarity could be measured and, if so, whether the results could be used for personal microbiome identification (PMI). To address this question, we herein propose a method to estimate the significance of similarity among human gut metagenomic samples based on reference-free, long k-mer features. Using these features, we find that pairwise similarities between the metagenomes of any two individuals obey a beta distribution and that a p value derived accordingly well characterizes whether two samples are from the same individual or not. We develop a computational framework called GePMI (Generating inter-individual similarity distribution for Personal Microbiome Identification) and apply it to several human gut metagenomic datasets (>300 individuals and >600 samples in total). From the results of GePMI, most of the human gut microbiomes can be identified (auROC = 0.9470, auPRC = 0.8702). Even after antibiotic treatment or fecal microbiota transplantation, the individual k-mer signature still maintains a certain specificity.

Original languageEnglish
Article number20
Journalnpj Biofilms and Microbiomes
Volume4
Issue number1
DOIs
StatePublished - 1 Dec 2018

Funding

FundersFunder number
National Natural Science Foundation of China31600096, 61561146396, 61721003, 61673241
Tsinghua National Laboratory for Information Science and Technology

    Fingerprint

    Dive into the research topics of 'GePMI: A statistical model for personal intestinal microbiome identification'. Together they form a unique fingerprint.

    Cite this