4.3 Article

Genomic Bayesian Prediction Model for Count Data with Genotype x Environment Interaction

期刊

G3-GENES GENOMES GENETICS
卷 6, 期 5, 页码 1165-1177

出版社

OXFORD UNIV PRESS INC
DOI: 10.1534/g3.116.028118

关键词

Bayesian model; count data; genome enabled prediction; Gibbs sampler; GenPred; shared data resource; genomic selection

向作者/读者索取更多资源

Genomic tools allow the study of the whole genome, and facilitate the study of genotype-environment combinations and their relationship with phenotype. However, most genomic prediction models developed so far are appropriate for Gaussian phenotypes. For this reason, appropriate genomic prediction models are needed for count data, since the conventional regression models used on count data with a large sample size (n(T)) and a small number of parameters (p) cannot be used for genomic-enabled prediction where the number of parameters (p) is larger than the sample size (n(T)). Here, we propose a Bayesian mixed-negative binomial (BMNB) genomic regression model for counts that takes into account genotype by environment (GxE) interaction. We also provide all the full conditional distributions to implement a Gibbs sampler. We evaluated the proposed model using a simulated data set, and a real wheat data set from the International Maize and Wheat Improvement Center (CIMMYT) and collaborators. Results indicate that our BMNB model provides a viable option for analyzing count data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

A feature selection model for speech emotion recognition using clustering-based population generation with hybrid of equilibrium optimizer and atom search optimization algorithm

Soham Chattopadhyay, Arijit Dey, Pawan Kumar Singh, Ali Ahmadian, Ram Sarkar

Summary: Speech is crucial in human communication and human-computer interaction. In the field of AI and ML, it has been extensively studied to recognize human emotions from speech signals. To address the challenge of large feature dimension, a hybrid feature selection algorithm called CEOAS is proposed. By extracting LPC and LPCC features, the proposed model reduces feature dimension and improves classification accuracy. Impressive recognition accuracies have been achieved on four benchmark datasets, surpassing state-of-the-art algorithms.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Computer Science, Information Systems

Automatic spoken language identification using MFCC based time series features

Mainak Biswas, Saif Rahaman, Ali Ahmadian, Kamalularifin Subari, Pawan Kumar Singh

Summary: Spoken Language Identification (SLID) is a well-researched field and an important first step in multilingual speech recognition systems. This study proposes a model for Indian and foreign language recognition, which enhances data to make it robust against everyday life noise and selects relevant features through feature extraction and selection algorithms. The model achieves high accuracy on three standard datasets, indicating that these features capture language specific characteristics of speech and can be used as standard features for SLID task.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Business, Finance

Impact of pandemic on development and demography in different continents and nations

Pawan Kumar Singh, Alok Kumar Pandey, Ravi Kiran, Rajiv Kumar Bhatt, Anushka Chouhan

Summary: This study collected information from 145 countries to predict the impact of COVID-19 cases, tests per million, and the proportion of people aged 65 and above on deaths per million at country and continent levels. It also evaluated the economic cost of these indicators in terms of reduction in GDP growth rate. The study found significant differences across continents and a negative association between tests per million and deaths per million. It provides valuable insights for assessing the impact of these indicators in the pandemic and informing policy formation and decision-making strategies.

INTERNATIONAL JOURNAL OF FINANCE & ECONOMICS (2023)

Article Environmental Sciences

Forecasting of non-renewable and renewable energy production in India using optimized discrete grey model

Alok Kumar Pandey, Pawan Kumar Singh, Muhammad Nawaz, Amrendra Kumar Kushwaha

Summary: Renewable energy plays an important role in providing reliable power supplies and diversifying fuel sources, while also helping to conserve natural resources. Solar energy has become increasingly prominent in India. This study forecasts the development of renewable energy and finds that wind power is growing faster than hydropower, solar energy, and bioenergy.

ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH (2023)

Article Plant Sciences

Enviromic-based kernels may optimize resource allocation with multi-trait multi-environment genomic prediction for tropical Maize

Raysa Gevartosky, Humberto Fanelli Carvalho, Germano Costa-Neto, Osval A. Montesinos-Lopez, Jose Crossa, Roberto Fritsche-Neto

Summary: This study aimed to design optimized training sets for genomic prediction considering multi-trait multi-environment trials and how those methods may increase accuracy reducing phenotyping costs. The combined use of genomic and enviromic data efficiently designs optimized training sets for genomic prediction, improving the response to selection per dollar invested.

BMC PLANT BIOLOGY (2023)

Article Biotechnology & Applied Microbiology

Two simple methods to improve the accuracy of the genomic selection methodology

Osval A. Montesinos-Lopez, Abelardo Kismiantini, Abelardo Montesinos-Lopez

Summary: Genomic selection (GS) is being revolutionized in plant and animal breeding, but its practical implementation faces challenges due to uncontrolled factors. To improve prediction accuracy, this paper proposes two methods: reformulating GS as a binary classification problem, and applying postprocessing to adjust the classification threshold. Both methods outperformed the conventional regression model, with the postprocessing method showing better results.

BMC GENOMICS (2023)

Article Biochemistry & Molecular Biology

Genomic Prediction of Resistance to Tan Spot, Spot Blotch and Septoria Nodorum Blotch in Synthetic Hexaploid Wheat

Guillermo Garcia-Barrios, Jose Crossa, Serafin Cruz-Izquierdo, Victor Heber Aguilar-Rincon, J. Sergio Sandoval-Islas, Tarsicio Corona-Torres, Nerida Lozano-Ramirez, Susanne Dreisigacker, Xinyao He, Pawan Kumar Singh, Rosa Angela Pacheco-Gil

Summary: Genomic prediction is used to predict breeding values based on molecular and phenotypic data. This study evaluated the performance of different models in predicting disease resistance in synthetic hexaploid wheat. The results showed that the combination of genomic and pedigree information (A+G BLUP) had the highest prediction accuracy, while the single trait and multi-trait models had similar accuracies. This suggests that the use of genomic information can improve breeding programs.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2023)

Article Genetics & Heredity

Multimodal deep learning methods enhance genomic prediction of wheat breeding

Abelardo Montesinos-Lopez, Carolina Rivera, Francisco Pinto, Francisco Pinera, David Gonzalez, Mathew Reynolds, Paulino Perez-Rodriguez, H. Li, Osval A. Montesinos-Lopez, Jose Crossa

Summary: By comparing a novel DL method with conventional GP models, this study found that DL method has higher accuracy in predicting genomic phenotypes in plant breeding research and can account for the complexity of genotype-environment interaction. However, traditional GP models can also achieve high accuracy in certain situations.

G3-GENES GENOMES GENETICS (2023)

Article Plant Sciences

Efficacy of plant breeding using genomic information

Osval A. Montesinos-Lopez, Alison R. Bentley, Carolina Saint Pierre, Leonardo Crespo-Herrera, Leonardo Rebollar-Ruellas, Patricia Edwigis Valladares-Celis, Morten Lillemo, Abelardo Montesinos-Lopez, Jose Crossa

Summary: Genomic selection (GS), proposed by Meuwissen et al. more than 20 years ago, is revolutionizing plant and animal breeding. In our study of 14 real datasets, we found that the average gain in prediction accuracy when genomic information is considered was 26.31%. The quality of the markers and relatedness of the individuals can greatly impact the increase in prediction accuracy.

PLANT GENOME (2023)

Article Environmental Sciences

Yield Adjustment Using GPR-Derived Spatial Covariance Structure in Cassava Field: A Preliminary Investigation

Afolabi Agbona, Osval A. Montesinos-Lopez, Mark E. Everett, Henry Ruiz-Guzman, Dirk B. Hays

Summary: Many aspects of below-ground plant performance are not fully understood, including their spatial and temporal dynamics in relation to environmental factors. In this study, Ground-Penetrating Radar (GPR) was evaluated for its potential in normalizing spatial heterogeneity and estimating fresh root yield in a cassava field trial. The results showed that the GPR-based autoregressive (AR) model outperformed other models, indicating the potential of GPR in non-destructive yield estimation and field spatial heterogeneity normalization in root and tuber crop programs.

REMOTE SENSING (2023)

Article Agronomy

Release of tepary bean cultivar 'USDA Fortuna' with improved disease and insect resistance, seed size, and culinary quality

Timothy G. Porch, Juan Carlos Rosas, Karen Cichy, Graciela Godoy Lutz, Iveth Rodriguez, Raphael W. Colbert, Gasner Demosthene, Juan Carlos Hernandez, Donna M. Winham, James S. Beaver

Summary: Tepary bean is a nutritious alternative to common bean in high temperature and drought-prone areas. USDA Fortuna cultivar has improved seed size and quality, resistant to diseases and pests, and shorter cooking time.

JOURNAL OF PLANT REGISTRATIONS (2023)

Article Agronomy

Designing optimal training sets for genomic prediction using adversarial validation with probit regression

Osval Montesinos-Lopez, Kismiantini, Abelardo Montesinos-Lopez

Summary: Genomic selection is revolutionizing animal and plant breeding, but its implementation faces challenges due to mismatch in training and testing set distributions. This research used the adversarial validation method with probit regression to address the distribution mismatch and select optimal training sets. Evaluations showed that the proposed method effectively detected the mismatch and outperformed existing methods, achieving higher prediction accuracy.

PLANT BREEDING (2023)

Article Multidisciplinary Sciences

Genetic variation among elite inbred lines suggests potential to breed for BNI-capacity in maize

Cesar D. Petroli, Guntur V. Subbarao, Juan A. Burgueno, Tadashi Yoshihashi, Huihui Li, Jorge Franco Duran, Kevin V. Pixley

Summary: A study found that maize root systems release glycosides that can inhibit the activity of nitrifiers and reduce soil nitrate formation in the root zone. Through genetic variation analysis, several maize varieties with high glycoside activity and the ability to release glycosides were identified, and genetic markers associated with these traits were found, providing the possibility of improving glycoside activity in maize through marker-assisted selection.

SCIENTIFIC REPORTS (2023)

Article Plant Sciences

Gene expression profiling of soaked dry beans (Phaseolus vulgaris L.) reveals cell wall modification plays a role in cooking time

Hannah R. R. Jeffery, Nyasha Mudukuti, Carol Robin Buell, Kevin L. L. Childs, Karen Cichy

Summary: Soaking dry beans before cooking can reduce the cooking time. This study identified gene expression patterns that are altered by soaking and compared gene expression in fast-cooking and slow-cooking beans. Genes related to cell wall growth and development, as well as hypoxic stress, were differentially expressed in slow-cooking beans after soaking.

PLANT GENOME (2023)

Article Mathematics

Identifying Genetic Signatures from Single-Cell RNA Sequencing Data by Matrix Imputation and Reduced Set Gene Clustering

Soumita Seth, Saurav Mallik, Atikul Islam, Tapas Bhadra, Arup Roy, Pawan Kumar Singh, Aimin Li, Zhongming Zhao

Summary: In this paper, a new framework is introduced to discover gene signatures from scRNA-seq data. The framework combines various strategies such as imputed matrix, MRMR feature selection, and shrinkage clustering. The results show that the proposed framework efficiently identifies differentially expressed stronger gene signatures and up-regulated markers in single-cell RNA sequencing data.

MATHEMATICS (2023)

暂无数据