TY - JOUR
T1 - MUSiCC
T2 - A marker genes based framework for metagenomic normalization and accurate profiling of gene abundances in the microbiome
AU - Manor, Ohad
AU - Borenstein, Elhanan
N1 - Publisher Copyright:
© 2015 Manor and Borenstein; licensee BioMed Central.
PY - 2015/3/25
Y1 - 2015/3/25
N2 - Functional metagenomic analyses commonly involve a normalization step, where measured levels of genes or pathways are converted into relative abundances. Here, we demonstrate that this normalization scheme introduces marked biases both across and within human microbiome samples, and identify sample- and gene-specific properties that contribute to these biases. We introduce an alternative normalization paradigm, MUSiCC, which combines universal single-copy genes with machine learning methods to correct these biases and to obtain an accurate and biologically meaningful measure of gene abundances. Finally, we demonstrate that MUSiCC significantly improves downstream discovery of functional shifts in the microbiome. MUSiCC is available at http://elbo.gs.washington.edu/software.html.
AB - Functional metagenomic analyses commonly involve a normalization step, where measured levels of genes or pathways are converted into relative abundances. Here, we demonstrate that this normalization scheme introduces marked biases both across and within human microbiome samples, and identify sample- and gene-specific properties that contribute to these biases. We introduce an alternative normalization paradigm, MUSiCC, which combines universal single-copy genes with machine learning methods to correct these biases and to obtain an accurate and biologically meaningful measure of gene abundances. Finally, we demonstrate that MUSiCC significantly improves downstream discovery of functional shifts in the microbiome. MUSiCC is available at http://elbo.gs.washington.edu/software.html.
UR - https://www.scopus.com/pages/publications/84939202778
U2 - 10.1186/s13059-015-0610-8
DO - 10.1186/s13059-015-0610-8
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 25885687
AN - SCOPUS:84939202778
SN - 1474-7596
VL - 16
JO - Genome Biology
JF - Genome Biology
IS - 1
M1 - 53
ER -