Sequence- vs. chip-assisted genomic selection : accurate biological information is advised

Perez-Enciso, Miguel; Rincón, Juan C.; Legarra, Andres

doi:10.1186/s12711-015-0117-5

Cita bibliogràfica -- Enllaç permanent: https://ddd.uab.cat/record/181901

Google Scholar: cites

Sequence- vs. chip-assisted genomic selection : accurate biological information is advised
Perez-Enciso, Miguel

(Centre de Recerca en Agrigenòmica)
Rincón, Juan C. (Centre de Recerca en Agrigenòmica)
Legarra, Andres (Institut National de la Recherche Agronomique (França))

Data:	2015
Resum:	Background: the development of next-generation sequencing technologies (NGS) has made the use of whole-genome sequence data for routine genetic evaluations possible, which has triggered a considerable interest in animal and plant breeding fields. Here, we investigated whether complete or partial sequence data can improve upon existing SNP (single nucleotide polymorphism) array-based selection strategies by simulation using a mixed coalescence - gene-dropping approach. - Results: we simulated 20 or 100 causal mutations (quantitative trait nucleotides, QTN) within 65 predefined 'gene' regions, each 10 kb long, within a genome composed of ten 3-Mb chromosomes. We compared prediction accuracy by cross-validation using a medium-density chip (7. 5 k SNPs), a high-density (HD, 17 k) and sequence data (335 k). Genetic evaluation was based on a GBLUP method. The simulations showed: (1) a law of diminishing returns with increasing number of SNPs; (2) a modest effect of SNP ascertainment bias in arrays; (3) a small advantage of using whole-genome sequence data vs. HD arrays i. e. ~4%; (4) a minor effect of NGS errors except when imputation error rates are high (≥20%); and (5) if QTN were known, prediction accuracy approached 1. Since this is obviously unrealistic, we explored milder assumptions. We showed that, if all SNPs within causal genes were included in the prediction model, accuracy could also dramatically increase by ~40%. However, this criterion was highly sensitive to either misspecification (including wrong genes) or to the use of an incomplete gene list; in these cases, accuracy fell rapidly towards that reached when all SNPs from sequence data were blindly included in the model. - Conclusions: our study shows that, unless an accurate prior estimate on the functionality of SNPs can be included in the predictor, there is a law of diminishing returns with increasing SNP density. As a result, use of whole-genome sequence data may not result in a highly increased selection response over high-density genotyping.
Ajuts:	Ministerio de Ciencia e Innovación AGL2010-14822 Ministerio de Economía y Competitividad AGL2013-41834-R
Drets:	Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, la comunicació pública de l'obra i la creació d'obres derivades, fins i tot amb finalitats comercials, sempre i quan es reconegui l'autoria de l'obra original.
Llengua:	Anglès
Document:	Article ; recerca ; Versió publicada
Publicat a:	Genetics, selection, evolution, Vol. 47, no. 1 (May 2015) , art. 53, ISSN 1297-9686

DOI: 10.1186/s12711-015-0117-5
PMID: 25956961

14 p, 1.0 MB

El registre apareix a les col·leccions:
Documents de recerca > Documents dels grups de recerca de la UAB > Centres i grups de recerca (producció científica) > Ciències > CRAG (Centre de Recerca en Agrigenòmica)
Articles > Articles de recerca
Articles > Articles publicats

Registre creat el 2017-10-31, darrera modificació el 2022-03-26

Registres semblants

Afegeix-lo al cistell personal
Anomena i desa Citation, BibTeX, MARC, MARCXML, DC, EDM OpenAire4