TY - JOUR
T1 - Panoramic
T2 - A package for constructing eukaryotic pan-genomes
AU - Glick, Lior
AU - Mayrose, Itay
N1 - Publisher Copyright:
© 2021 John Wiley & Sons Ltd
PY - 2021/5
Y1 - 2021/5
N2 - The study of intraspecific genomic variation in eukaryotic species has been the focus of numerous genome resequencing projects in recent years. One emerging approach for the analysis of intraspecific diversity uses the concept of a pan-genome, which theoretically represents the full set of genomic sequences and coding genes from all individuals of a given species. This approach has many advantages over reference-based methods and has been successfully applied to study both prokaryotic and eukaryotic species. However, the process of pan-genome construction still presents considerable scientific and technical challenges, especially for eukaryotic species with large and complex genomes. Although general approaches for the construction of pan-genomes have been devised, currently available software tools implement only certain modules of the entire computational procedure. Therefore, each pan-genome project requires the development of tailored analysis pipelines, thus complicating and prolonging the process and impairing research reproducibility and comparison across studies. Here, we present Panoramic, a software package for the automatic construction of eukaryotic pan-genomes. Panoramic takes raw sequencing reads as input and applies two alternative approaches for pan-genome construction. Panoramic makes pan-genome construction a considerably easier task by providing simple user interface and efficient data processing algorithms. We demonstrate the use of Panoramic by constructing the pan-genome of the model plant species Arabidopsis thaliana from sequencing data of 20 diverse ecotypes.
AB - The study of intraspecific genomic variation in eukaryotic species has been the focus of numerous genome resequencing projects in recent years. One emerging approach for the analysis of intraspecific diversity uses the concept of a pan-genome, which theoretically represents the full set of genomic sequences and coding genes from all individuals of a given species. This approach has many advantages over reference-based methods and has been successfully applied to study both prokaryotic and eukaryotic species. However, the process of pan-genome construction still presents considerable scientific and technical challenges, especially for eukaryotic species with large and complex genomes. Although general approaches for the construction of pan-genomes have been devised, currently available software tools implement only certain modules of the entire computational procedure. Therefore, each pan-genome project requires the development of tailored analysis pipelines, thus complicating and prolonging the process and impairing research reproducibility and comparison across studies. Here, we present Panoramic, a software package for the automatic construction of eukaryotic pan-genomes. Panoramic takes raw sequencing reads as input and applies two alternative approaches for pan-genome construction. Panoramic makes pan-genome construction a considerably easier task by providing simple user interface and efficient data processing algorithms. We demonstrate the use of Panoramic by constructing the pan-genome of the model plant species Arabidopsis thaliana from sequencing data of 20 diverse ecotypes.
KW - Arabidopsis thaliana
KW - Pan-genome
KW - assembly
KW - gene content
KW - genomics
KW - presence/absence variation
UR - http://www.scopus.com/inward/record.url?scp=85102249983&partnerID=8YFLogxK
U2 - 10.1111/1755-0998.13344
DO - 10.1111/1755-0998.13344
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 33533167
AN - SCOPUS:85102249983
SN - 1755-098X
VL - 21
SP - 1393
EP - 1403
JO - Molecular Ecology Resources
JF - Molecular Ecology Resources
IS - 4
ER -