Web of Science: 6 citations, Scopus: 8 citations, Google Scholar: citations,
T-lex3 : An accurate tool to genotype and estimate population frequencies of transposable elements using the latest short-read whole genome sequencing data
Bogaerts-Márquez, María (Institut de Biologia Evolutiva (UPF-CSIC) (Barcelona))
Barrón Aduriz, Maite Garazi (Institut de Biologia Evolutiva (UPF-CSIC) (Barcelona))
Fiston-Lavier, Anna-Sophie (Institut des Sciences de l'Evolution de Montpellier)
Vendrell Mir, Pol (Centre de Recerca en Agrigenòmica)
Castanera, Raúl (Centre de Recerca en Agrigenòmica)
Casacuberta i Suñer, Josep M. 1962- (Centre de Recerca en Agrigenòmica)
González, Josefa (Institut de Biologia Evolutiva (UPF-CSIC) (Barcelona))

Date: 2020
Abstract: Motivation: Transposable elements (TEs) constitute a significant proportion of the majority of genomes sequenced to date. TEs are responsible for a considerable fraction of the genetic variation within and among species. Accurate genotyping of TEs in genomes is therefore crucial for a complete identification of the genetic differences among individuals, populations and species. Results: In this work, we present a new version of T-lex, a computational pipeline that accurately genotypes and estimates the population frequencies of reference TE insertions using short-read high-throughput sequencing data. In this new version, we have re-designed the T-lex algorithm to integrate the BWA-MEM short-read aligner, which is one of the most accurate short-read mappers and can be launched on longer short-reads (e. g. reads >150 bp). We have added new filtering steps to increase the accuracy of the genotyping, and new parameters that allow the user to control both the minimum and maximum number of reads, and the minimum number of strains to genotype a TE insertion. We also showed for the first time that T-lex3 provides accurate TE calls in a plant genome. Availability and implementation: To test the accuracy of T-lex3, we called 1630 individual TE insertions in Drosophila melanogaster, 1600 individual TE insertions in humans, and 3067 individual TE insertions in the rice genome. We showed that this new version of T-lex is a broadly applicable and accurate tool for genotyping and estimating TE frequencies in organisms with different genome sizes and different TE contents. T-lex3 is available at Github: https://github. com/GonzalezLab/T-lex3.
Grants: European Commission 647900
Agència de Gestió d'Ajuts Universitaris i de Recerca 2017/SGR-880
Ministerio de Economía y Competitividad AGL2016-78992-R
Rights: Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, la comunicació pública de l'obra i la creació d'obres derivades, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. Creative Commons
Language: Anglès
Document: Article ; recerca ; Versió publicada
Published in: Bioinformatics, Vol. 36, Issue 4 (February 2020) , p. 1191-1197, ISSN 1367-4811

DOI: 10.1093/bioinformatics/btz727
PMID: 31580402


7 p, 1.9 MB

The record appears in these collections:
Research literature > UAB research groups literature > Research Centres and Groups (research output) > Experimental sciences > CRAG (Centre for Research in Agricultural Genomics)
Articles > Research articles
Articles > Published articles

 Record created 2020-04-09, last modified 2022-05-13



   Favorit i Compartir