☆ 4.3 Article

Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits

G3-GENES GENOMES GENETICS (2019)

期刊

G3-GENES GENOMES GENETICS

卷 9, 期 11, 页码 3691-3702

出版社

OXFORD UNIV PRESS INC

DOI: 10.1534/g3.119.400498

关键词

Genomic selection; artificial neural network; genotype-to-phenotype; Genomic Prediction; GenPred; Shared Data Resources

类别

Genetics & Heredity

资金

National Science Foundation (NSF) [2015196719]
Graduate Research Opportunities Abroad (GROW) Fellowship
NSF PlantGenomics Research Experiences for Undergraduate
U.S. Department of Energy Great Lakes Bioenergy Research Center [BER DE-SC0018409]
National Science Foundation [IOS-1546617, DEB-1655386]
National Institutes of Health [R01GM099992, R01FM101219]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The usefulness of genomic prediction in crop and livestock breeding programs has prompted efforts to develop new and improved genomic prediction algorithms, such as artificial neural networks and gradient tree boosting. However, the performance of these algorithms has not been compared in a systematic manner using a wide range of datasets and models. Using data of 18 traits across six plant species with different marker densities and training population sizes, we compared the performance of six linear and six non-linear algorithms. First, we found that hyperparameter selection was necessary for all non-linear algorithms and that feature selection prior to model training was critical for artificial neural networks when the markers greatly outnumbered the number of training lines. Across all species and trait combinations, no one algorithm performed best, however predictions based on a combination of results from multiple algorithms (i.e., ensemble predictions) performed consistently well. While linear and non-linear algorithms performed best for a similar number of traits, the performance of non-linear algorithms vary more between traits. Although artificial neural networks did not perform best for any trait, we identified strategies (i.e., feature selection, seeded starting weights) that boosted their performance to near the level of other algorithms. Our results highlight the importance of algorithm selection for the prediction of trait values.

Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits

期刊

G3-GENES GENOMES GENETICS

出版社

OXFORD UNIV PRESS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits

期刊

G3-GENES GENOMES GENETICS

出版社

OXFORD UNIV PRESS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文