4.3 Article

A Genomic Bayesian Multi-trait and Multi-environment Model

Journal

G3-GENES GENOMES GENETICS
Volume 6, Issue 9, Pages 2725-2744

Publisher

OXFORD UNIV PRESS INC
DOI: 10.1534/g3.116.032359

Keywords

multi-trait; multi-environment; Bayesian estimation; genome-enabled prediction; genomic selection; GenPred; shared data resource

Ask authors/readers for more resources

When information on multiple genotypes evaluated in multiple environments is recorded, a multi-environment single trait model for assessing genotype x environment interaction (G x E) is usually employed. Comprehensive models that simultaneously take into account the correlated traits and trait x genotype x environment interaction (T x G x E) are lacking. In this research, we propose a Bayesian model for analyzing multiple traits and multiple environments for whole-genome prediction (WGP) model. For this model, we used Half-t priors on each standard deviation term and uniform priors on each correlation of the covariance matrix. These priors were not informative and led to posterior inferences that were insensitive to the choice of hyper-parameters. We also developed a computationally efficient Markov Chain Monte Carlo (MCMC) under the above priors, which allowed us to obtain all required full conditional distributions of the parameters leading to an exact Gibbs sampling for the posterior distribution. We used two real data sets to implement and evaluate the proposed Bayesian method and found that when the correlation between traits was high (>0.5), the proposed model (with unstructured variance-covariance) improved prediction accuracy compared to the model with diagonal and standard variance-covariance structures. The R-software package Bayesian Multi-Trait and Multi-Environment (BMTME) offers optimized C++ routines to efficiently perform the analyses.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Genetics & Heredity

Bayesian multitrait kernel methods improve multienvironment genome-based prediction

Osval Antonio Montesinos-Lopez, Jose Cricelio Montesinos-Lopez, Abelardo Montesinos-Lopez, Juan Manuel Ramirez-Alcaraz, Jesse Poland, Ravi Singh, Susanne Dreisigacker, Leonardo Crespo, Sushismita Mondal, Velu Govidan, Philomin Juliana, Julio Huerta Espino, Sandesh Shrestha, Rajeev K. Varshney, Jose Crossa

Summary: This study explores Bayesian multitrait kernel methods for genomic prediction and finds that the Gaussian kernel method outperforms traditional methods in prediction performance, capturing nonlinear patterns more efficiently. Evaluating multiple kernels to select the best one is recommended.

G3-GENES GENOMES GENETICS (2022)

Article Plant Sciences

Using an incomplete block design to allocate lines to environments improves sparse genome-based prediction in plant breeding

Osval Antonio Montesinos-Lopez, Abelardo Montesinos-Lopez, Ricardo Acosta, Rajeev K. Varshney, Alison Bentley, Jose Crossa

Summary: Genomic selection is a predictive method used in plant breeding that trains machine learning models with a reference population to predict new lines. This study proposes using incomplete block designs for allocating lines to locations, which outperforms random allocation in terms of predictive performance.

PLANT GENOME (2022)

Article Environmental Sciences

Prediction of Root Biomass in Cassava Based on Ground Penetrating Radar Phenomics

Afolabi Agbona, Brody Teare, Henry Ruiz-Guzman, Iliyana D. Dobreva, Mark E. Everett, Tyler Adams, Osval A. Montesinos-Lopez, Peter A. Kulakow, Dirk B. Hays

Summary: Inadequate means to measure early storage root bulking in cassava have prompted a study to evaluate the capability of ground penetrating radar (GPR) for non-destructive assessment of cassava root biomass. Different methods of processing the GPR radargram were tested, with a simple model without interaction producing the best prediction accuracy. Results demonstrate the potential for GPR technology to be adopted by cassava breeding programs for selecting early stage root bulking to increase crop yield.

REMOTE SENSING (2021)

Article Plant Sciences

Comparing gradient boosting machine and Bayesian threshold BLUP for genome-based prediction of categorical traits in wheat breeding

Osval Antonio Montesinos-Lopez, Henry Nicole Gonzalez, Abelardo Montesinos-Lopez, Maria Daza-Torres, Morten Lillemo, Jose Cricelio Montesinos-Lopez, Jose Crossa

Summary: Genomic selection is a predictive methodology that is changing plant breeding. In this study, the performance of two algorithms (TGBLUP and GBM) was compared on wheat datasets, and GBM outperformed TGBLUP in terms of prediction accuracy. Further research is encouraged to explore the virtues of GBM in genomic selection.

PLANT GENOME (2022)

Article Genetics & Heredity

A General-Purpose Machine Learning R Library for Sparse Kernels Methods With an Application for Genome-Based Prediction

Osval Antonio Montesinos Lopez, Brandon Alejandro Mosqueda Gonzalez, Abel Palafox Gonzalez, Abelardo Montesinos Lopez, Jose Crossa

Summary: This paper presents a new software package (SKM) for implementing six popular supervised machine learning algorithms with the optional use of sparse kernels, as well as a function for computing seven different kernels. SKM focuses on user simplicity and computational efficiency, providing a user-friendly format for algorithms and reducing resources needed for kernel machine learning methods.

FRONTIERS IN GENETICS (2022)

Article Genetics & Heredity

Partial Least Squares Enhances Genomic Prediction of New Environments

Osval A. Montesinos-Lopez, Abelardo Montesinos-Lopez, Kismiantini, Armando Roman-Gallardo, Keith Gardner, Morten Lillemo, Roberto Fritsche-Neto, Jose Crossa

Summary: Improved prediction of future seasons or new environments is crucial for plant breeding. This study demonstrates that the partial least squares regression method outperforms the Bayesian genomic best linear unbiased predictor method in predicting future seasons or new environments.

FRONTIERS IN GENETICS (2022)

Article Genetics & Heredity

A Comparison of Three Machine Learning Methods for Multivariate Genomic Prediction Using the Sparse Kernels Method (SKM) Library

Osval A. Montesinos-Lopez, Abelardo Montesinos-Lopez, Bernabe Cano-Paez, Carlos Moises Hernandez-Suarez, Pedro C. Santana-Mancilla, Jose Crossa

Summary: Genomic selection has revolutionized the way plant breeders select genotypes, using statistical machine learning models to predict phenotypic values of new lines. Multi-trait genomic prediction models leverage correlated traits to improve accuracy. This paper compares the performance of three multi-trait methods and finds that their performance varies under different predictors.

GENES (2022)

Article Genetics & Heredity

Multi-trait genome prediction of new environments with partial least squares

Osval A. Montesinos-Lopez, Abelardo Montesinos-Lopez, David Alejandro Bernal Sandoval, Brandon Alejandro Mosqueda-Gonzalez, Marco Alberto Valenzo-Jimenez, Jose Crossa

Summary: The genomic selection methodology has revolutionized plant breeding by using statistical machine learning algorithms to predict candidate individuals. However, it faces challenges when predicting future seasons or new environments. This study compared the performance of the multi-trait partial least square (MT-PLS) regression method with the Bayesian Multi-trait Genomic Best Linear Unbiased Predictor (MT-GBLUP) method and found that MT-PLS outperforms MT-GBLUP in predicting future seasons or new environments.

FRONTIERS IN GENETICS (2022)

Article Plant Sciences

Enviromic-based kernels may optimize resource allocation with multi-trait multi-environment genomic prediction for tropical Maize

Raysa Gevartosky, Humberto Fanelli Carvalho, Germano Costa-Neto, Osval A. Montesinos-Lopez, Jose Crossa, Roberto Fritsche-Neto

Summary: This study aimed to design optimized training sets for genomic prediction considering multi-trait multi-environment trials and how those methods may increase accuracy reducing phenotyping costs. The combined use of genomic and enviromic data efficiently designs optimized training sets for genomic prediction, improving the response to selection per dollar invested.

BMC PLANT BIOLOGY (2023)

Article Genetics & Heredity

Integrating Parental Phenotypic Data Enhances Prediction Accuracy of Hybrids in Wheat Traits

Osval A. A. Montesinos-Lopez, Alison R. R. Bentley, Carolina Saint Pierre, Leonardo Crespo-Herrera, Josafhat Salinas Ruiz, Patricia Edwigis Valladares-Celis, Abelardo Montesinos-Lopez, Jose Crossa

Summary: Genomic selection (GS) is a revolutionary plant breeding method that allows the selection of candidate genotypes without the need for field phenotypic evaluation. This study investigated the genomic prediction accuracy of wheat hybrids by incorporating covariates with parental phenotypic information into the model. The results showed that the models with parental information outperformed those without parental information, and the inclusion of covariates significantly improved prediction accuracy compared to marker information. However, the use of parental phenotypic information as covariates is expensive and not always available.

GENES (2023)

Article Biotechnology & Applied Microbiology

Two simple methods to improve the accuracy of the genomic selection methodology

Osval A. Montesinos-Lopez, Abelardo Kismiantini, Abelardo Montesinos-Lopez

Summary: Genomic selection (GS) is being revolutionized in plant and animal breeding, but its practical implementation faces challenges due to uncontrolled factors. To improve prediction accuracy, this paper proposes two methods: reformulating GS as a binary classification problem, and applying postprocessing to adjust the classification threshold. Both methods outperformed the conventional regression model, with the postprocessing method showing better results.

BMC GENOMICS (2023)

Article Genetics & Heredity

Multimodal deep learning methods enhance genomic prediction of wheat breeding

Abelardo Montesinos-Lopez, Carolina Rivera, Francisco Pinto, Francisco Pinera, David Gonzalez, Mathew Reynolds, Paulino Perez-Rodriguez, H. Li, Osval A. Montesinos-Lopez, Jose Crossa

Summary: By comparing a novel DL method with conventional GP models, this study found that DL method has higher accuracy in predicting genomic phenotypes in plant breeding research and can account for the complexity of genotype-environment interaction. However, traditional GP models can also achieve high accuracy in certain situations.

G3-GENES GENOMES GENETICS (2023)

Article Plant Sciences

Efficacy of plant breeding using genomic information

Osval A. Montesinos-Lopez, Alison R. Bentley, Carolina Saint Pierre, Leonardo Crespo-Herrera, Leonardo Rebollar-Ruellas, Patricia Edwigis Valladares-Celis, Morten Lillemo, Abelardo Montesinos-Lopez, Jose Crossa

Summary: Genomic selection (GS), proposed by Meuwissen et al. more than 20 years ago, is revolutionizing plant and animal breeding. In our study of 14 real datasets, we found that the average gain in prediction accuracy when genomic information is considered was 26.31%. The quality of the markers and relatedness of the individuals can greatly impact the increase in prediction accuracy.

PLANT GENOME (2023)

Article Environmental Sciences

Yield Adjustment Using GPR-Derived Spatial Covariance Structure in Cassava Field: A Preliminary Investigation

Afolabi Agbona, Osval A. Montesinos-Lopez, Mark E. Everett, Henry Ruiz-Guzman, Dirk B. Hays

Summary: Many aspects of below-ground plant performance are not fully understood, including their spatial and temporal dynamics in relation to environmental factors. In this study, Ground-Penetrating Radar (GPR) was evaluated for its potential in normalizing spatial heterogeneity and estimating fresh root yield in a cassava field trial. The results showed that the GPR-based autoregressive (AR) model outperformed other models, indicating the potential of GPR in non-destructive yield estimation and field spatial heterogeneity normalization in root and tuber crop programs.

REMOTE SENSING (2023)

Article Agronomy

Designing optimal training sets for genomic prediction using adversarial validation with probit regression

Osval Montesinos-Lopez, Kismiantini, Abelardo Montesinos-Lopez

Summary: Genomic selection is revolutionizing animal and plant breeding, but its implementation faces challenges due to mismatch in training and testing set distributions. This research used the adversarial validation method with probit regression to address the distribution mismatch and select optimal training sets. Evaluations showed that the proposed method effectively detected the mismatch and outperformed existing methods, achieving higher prediction accuracy.

PLANT BREEDING (2023)

No Data Available