Statistical modeling of coverage in high-throughput data

David Golan, Saharon Rosset

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

1 Scopus citations

Abstract

In high-throughput sequencing experiments, the number of reads mapping to a genomic region, also known as the "coverage" or "coverage depth," is often used as a proxy for the abundance of the underlying genomic region in the sample. The abundance, in turn, can be used for many purposes including calling SNPs, estimating the allele frequency in a pool of individuals, identifying copy number variations, and identifying differentially expressed shRNAs in shRNA-seq experiments. In this chapter we describe the fundamentals of statistical modeling of coverage depth and discuss the problems of estimation and inference in the relevant experimental scenarios.

Original languageEnglish
Title of host publicationDeep Sequencing Data Analysis
PublisherHumana Press Inc.
Pages61-79
Number of pages19
ISBN (Print)9781627035132
DOIs
StatePublished - 2013

Publication series

NameMethods in Molecular Biology
Volume1038
ISSN (Print)1064-3745

Keywords

  • CNV calling
  • Coverage
  • Differential expression
  • High-throughput sequencing
  • Modeling
  • Next-generation sequencing
  • SNP calling
  • Statistical modeling of coverage

Fingerprint

Dive into the research topics of 'Statistical modeling of coverage in high-throughput data'. Together they form a unique fingerprint.

Cite this