TY - JOUR
T1 - GenomeFLTR
T2 - Filtering reads made easy
AU - Dotan, Edo
AU - Alburquerque, Michael
AU - Wygoda, Elya
AU - Huchon, Dorothee
AU - Pupko, Tal
N1 - Publisher Copyright:
© 2023 The Author(s). Published by Oxford University Press on behalf of Nucleic Acids Research.
PY - 2023/7/5
Y1 - 2023/7/5
N2 - In the last decade, advances in sequencing technology have led to an exponential increase in genomic data. These new data have dramatically changed our understanding of the evolution and function of genes and genomes. Despite improvements in sequencing technologies, identifying contaminated reads remains a complex task for many research groups. Here, we introduce GenomeFLTR, a new web server to filter contaminated reads. Reads are compared against existing sequence databases from various representative organisms to detect potential contaminants. The main features implemented in GenomeFLTR are: (i) automated updating of the relevant databases; (ii) fast comparison of each read against the database; (iii) the ability to create user-specified databases; (iv) a user-friendly interactive dashboard to investigate the origin and frequency of the contaminations; (v) the generation of a contamination-free file. Availability: https://genomefltr.tau.ac.il/.
AB - In the last decade, advances in sequencing technology have led to an exponential increase in genomic data. These new data have dramatically changed our understanding of the evolution and function of genes and genomes. Despite improvements in sequencing technologies, identifying contaminated reads remains a complex task for many research groups. Here, we introduce GenomeFLTR, a new web server to filter contaminated reads. Reads are compared against existing sequence databases from various representative organisms to detect potential contaminants. The main features implemented in GenomeFLTR are: (i) automated updating of the relevant databases; (ii) fast comparison of each read against the database; (iii) the ability to create user-specified databases; (iv) a user-friendly interactive dashboard to investigate the origin and frequency of the contaminations; (v) the generation of a contamination-free file. Availability: https://genomefltr.tau.ac.il/.
UR - http://www.scopus.com/inward/record.url?scp=85164235810&partnerID=8YFLogxK
U2 - 10.1093/nar/gkad410
DO - 10.1093/nar/gkad410
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 37177997
AN - SCOPUS:85164235810
SN - 0305-1048
VL - 51
SP - W232-W236
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 1 W
ER -