4.6 Article

A New Genotype Imputation Method with Tolerance to High Missing Rate and Rare Variants

期刊

PLOS ONE
卷 9, 期 6, 页码 -

出版社

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pone.0101025

关键词

-

资金

  1. Ministry of Agriculture of China
  2. National Natural Science Foundation of China [31370043, 31272414, 31101706]
  3. National 948 Project of China [2012-Z26, 2011-G2A]

向作者/读者索取更多资源

We report a novel algorithm, iBLUP, to impute missing genotypes by simultaneously and comprehensively using identity by descent and linkage disequilibrium information. The simulation studies showed that the algorithm exhibited drastically tolerance to high missing rate, especially for rare variants than other common imputation methods, e. g. BEAGLE and fastPHASE. At a missing rate of 70%, the accuracy of BEAGLE and fastPHASE dropped to 0.82 and 0.74 respectively while iBLUP retained an accuracy of 0.95. For minor allele, the accuracy of BEAGLE and fastPHASE decreased to -0.1 and 0.03, while iBLUP still had an accuracy of 0.61. We implemented the algorithm in a publicly available software package also named iBLUP. The application of iBLUP for processing real sequencing data in an outbred pig population was demonstrated.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Plant Sciences

E183K Mutation in Chalcone Synthase C2 Causes Protein Aggregation and Maize Colorless

Haixiao Dong, He Li, Yingjie Xue, Shengzhong Su, Shipeng Li, Xiaohui Shan, Hongkui Liu, Nan Jiang, Xuyang Wu, Zhiwu Zhang, Yaping Yuan

Summary: A novel colorless maize mutant was identified and characterized, showing reduced pigmentation throughout its life cycle. The E183K mutation in maize chalcone synthase C2 was identified as the causal mutation for the colorless phenotype. This mutation led to protein aggregation of chalcone synthase, providing insight into protein aggregation in plants.

FRONTIERS IN PLANT SCIENCE (2021)

Article Genetics & Heredity

GAPIT Version 3: Boosting Power and Accuracy for Genomic Association and Prediction

Jiabo Wang, Zhiwu Zhang

Summary: Genome-wide association study (GWAS) and genomic prediction/selection (GP/GS) are two essential tasks in genomic research. GAPIT, a widely-used integrated tool, has recently introduced multi-locus test methods to enhance statistical power and prediction accuracy.

GENOMICS PROTEOMICS & BIOINFORMATICS (2021)

Letter Biochemistry & Molecular Biology

Genetics-inspired data-driven approaches explain and predict crop performance fluctuations attributed to changing climatic conditions

Xianra Li, Tingting Guo, Guihua Bai, Zhiwu Zhang, Deven See, Juliet Marshall, Kimberly A. Garland-Campbell, Jianming Yu

MOLECULAR PLANT (2022)

Article Plant Sciences

Comparison of Single-Trait and Multi-Trait Genome-Wide Association Models and Inclusion of Correlated Traits in the Dissection of the Genetic Architecture of a Complex Trait in a Breeding Program

Lance F. Merrick, Adrienne B. Burke, Zhiwu Zhang, Arron H. Carter

Summary: This study aimed to dissect the genetic architecture of seedling emergence in wheat using one multi-trait genome-wide association study (MT-GWAS) and three single-trait GWAS (ST-GWAS) models. The ST-GWAS models identified 107 significant markers across 19 chromosomes, while the MT-GWAS identified 82 significant markers across 14 chromosomes. The FarmCPU and BLINK models were able to identify both small effect markers and large effect markers on chromosome 5A.

FRONTIERS IN PLANT SCIENCE (2022)

Review Genetics & Heredity

Prospectus of Genomic Selection and Phenomics in Cereal, Legume and Oilseed Breeding Programs

Karansher S. Sandhu, Lance F. Merrick, Sindhuja Sankaran, Zhiwu Zhang, Arron H. Carter

Summary: The adoption of genomic selection and phenomics tools in plant breeding programs has significantly increased in the past decade. Genomic selection has shown potential in selecting superior genotypes with high precision and accelerating the breeding cycle, while phenomics aims to alleviate phenotyping bottlenecks and explore new large-scale phenotyping and data acquisition methods. This review discusses the lessons learned from genomic selection and phenomics in six self-pollinated crops and their implementation schemes, with a focus on rice, wheat, soybean, common bean, chickpea, and groundnut.

FRONTIERS IN GENETICS (2022)

Article Genetics & Heredity

Genome Assembly of Alfalfa Cultivar Zhongmu-4 and Identification of SNPs Associated with Agronomic Traits

Ruicai Long, Fan Zhang, Zhiwu Zhang, Mingna Li, Lin Chen, Xue Wang, Wenwen Liu, Tiejun Zhang, Long-Xi Yu, Fei He, Xueqian Jiang, Xijiang Yang, Changfu Yang, Zhen Wang, Junmei Kang, Qingchuan Yang

Summary: This study reported the chromosome-level genome sequence and genomic variation database of Zhongmu-4, a widely cultivated alfalfa cultivar in China, as well as the polymorphic genes associated with agronomic traits. The expansion of gene families and increase of repetitive elements observed in Zhongmu-4 genome may contribute to its larger genome size compared to Medicago truncatula. The population structure analysis revealed the influence of geography on the genetic divergence of alfalfa. The findings of this study will facilitate genetic research and genomics-assisted breeding for alfalfa variety improvement.

GENOMICS PROTEOMICS & BIOINFORMATICS (2022)

Review Food Science & Technology

As the number falls, alternatives to the Hagberg-Perten falling number method: A review

Yang Hu, Stephanie M. Sjoberg, Chunpen (James) Chen, Amber L. Hauvermale, Craig F. Morris, Stephen R. Delwiche, Ashley E. Cannon, Camille M. Steber, Zhiwu Zhang

Summary: This review examines the application, limitations, and potential alternatives to the Hagberg-Perten falling number (FN) method used in the global wheat industry. The FN test indirectly detects the presence of alpha-amylase, which is responsible for starch degradation. Low FN/high alpha-amylase in wheat grains results in poor end-product quality and significant losses for various stakeholders. The FN method has limitations such as sampling variability, high cost, labor intensiveness, destructive nature, and inability to differentiate between different causes of low FN. Seeking faster, cheaper, and more accurate alternatives can improve breeding and avoid inadvertent mixing of high- and low-FN grain, thus preserving the value of wheat grain.

COMPREHENSIVE REVIEWS IN FOOD SCIENCE AND FOOD SAFETY (2022)

Article Agriculture, Dairy & Animal Science

VTag: a semi-supervised pipeline for tracking pig activity with a single top-view camera

Chun-Peng J. Chen, Gota Morota, Kiho Lee, Zhiwu Zhang, Hao Cheng

Summary: Precision livestock farming in the swine industry focuses on using computer vision technology to automate monitoring pig activity, but the performance is highly dependent on the quality of imagery features and requires intensive effort in labeling ground truths for training datasets.

JOURNAL OF ANIMAL SCIENCE (2022)

Article Agronomy

A new strategy for using historical imbalanced yield data to conduct genome-wide association studies and develop genomic prediction models for wheat breeding

Chenggen Chu, Shichen Wang, Jackie C. Rudd, Amir M. H. Ibrahim, Qingwu Xue, Ravindra N. Devkota, Jason A. Baker, Shannon Baker, Bryan Simoneaux, Geraldine Opena, Haixiao Dong, Xiaoxiao Liu, Kirk E. Jessup, Ming-Shun Chen, Kele Hui, Richard Metz, Charles D. Johnson, Zhiwu S. Zhang, Shuyu Liu

Summary: Using genome-wide association studies and high throughput genotyping based on sequencing techniques can improve the accuracy of predicting and selecting new varieties using imbalanced historical yield data.

MOLECULAR BREEDING (2022)

Article Plant Sciences

Detection of Breeding-Relevant Fruit Cracking and Fruit Firmness Quantitative Trait Loci in Sweet Cherry via Pedigree-Based and Genome-Wide Association Approaches

William Wesley Crump, Cameron Peace, Zhiwu Zhang, Per McCord

Summary: This study aims to improve fruit cracking incidence and fruit firmness in sweet cherry through DNA-informed breeding. Four stable QTLs related to fruit cracking and firmness were identified, and the ancestral sources of functional SNP haplotypes were traced.

FRONTIERS IN PLANT SCIENCE (2022)

Article Multidisciplinary Sciences

Graph pangenome captures missing heritability and empowers tomato breeding

Yao Zhou, Zhiyang Zhang, Zhigui Bao, Hongbo Li, Yaqing Lyu, Yanjun Zan, Yaoyao Wu, Lin Cheng, Yuhan Fang, Kun Wu, Jinzhe Zhang, Hongjun Lyu, Tao Lin, Qiang Gao, Surya Saha, Lukas Mueller, Zhangjun Fei, Thomas Stadler, Shizhong Xu, Zhiwu Zhang, Doug Speed, Sanwen Huang

Summary: Constructing a graph pangenome of tomato can improve the estimation of heritability for complex traits, identify more causal structural variants, and facilitate the identification of genetic factors underlying agronomically important traits. This study advances our understanding of the heritability of complex traits and demonstrates the power of the graph pangenome in crop breeding.

NATURE (2022)

Article Plant Sciences

Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa

Fan Zhang, Junmei Kang, Ruicai Long, Mingna Li, Yan Sun, Fei He, Xueqian Jiang, Changfu Yang, Xijiang Yang, Jie Kong, Yiwen Wang, Zhen Wang, Zhiwu Zhang, Qingchuan Yang

Summary: Fall dormancy (FD) is a crucial trait for alfalfa cultivar selection. We conducted genomic prediction of FD using machine learning methods and achieved high prediction accuracy.

HORTICULTURE RESEARCH (2023)

Article Agriculture, Multidisciplinary

Affordable High Throughput Field Detection of Wheat Stripe Rust Using Deep Learning with Semi-Automated Image Labeling

Zhou Tang, Meinan Wang, Michael Schirrmann, Xianran Li, Robert Brueggeman, Sindhuja Sankaran, Arron H. Carter, Michael O. Pumphrey, Yang Hu, Xianming Chen, Zhiwu Zhang

Summary: In this study, a neural network-based image classifier called RustNet was developed to efficiently monitor wheat fields for stripe rust. The model uses deep learning to process images and videos captured from affordable devices, enabling high-throughput pheno-typing for early detection of rust and improved control efficiency.

COMPUTERS AND ELECTRONICS IN AGRICULTURE (2023)

Article Mycology

Large-scale genome investigations reveal insights into domestication of cultivated mushrooms

Y. P. Fu, Y. T. Dai, K. W. T. Chethana, Z. H. Li, L. Sun, C. T. Li, H. L. Yu, R. H. Yang, Q. Tan, D. P. Bao, Y. J. Deng, S. X. Wang, Y. F. Wang, F. H. Tian, L. L. Qi, L. L. Shu, P. S. Jia, L. C. Chen, M. Y. Chen, Q. X. Hu, H. Tan, T. T. Song, Z. W. Zhang, G. Bonito, G. Zervakis, S. J. Xiao, K. D. Hyde, Y. Li, X. H. Yuan

Summary: This study presents high-quality genomes, transcriptomes, and re-sequencing data of edible and medicinal mushrooms, and constructs an integrated omics database (MushDB) for a better understanding of mushroom domestication. Using multi-omics data, the researchers identified potential functional genes contributing to mushroom domestication, including genes involved in starch and sucrose metabolism and mitogen-activated protein kinase signaling pathway. The function of one key gene in low temperature adaptation and cultivation environment was validated using CRISPR/Cas9 system.

MYCOSPHERE (2022)

Article Multidisciplinary Sciences

Genomic mosaicism due to homoeologous exchange generates extensive phenotypic diversity in nascent allopolyploids

Ying Wu, Fan Lin, Yao Zhou, Jie Wang, Shuai Sun, Bin Wang, Zhibin Zhang, Guo Li, Xiuyun Lin, Xutong Wang, Yue Sun, Qianli Dong, Chunming Xu, Lei Gong, Jonathan F. Wendel, Zhiwu Zhang, Bao Liu

Summary: This study found that homoeologous exchanges (HEs) in allopolyploids can lead to extensive genomic variation, affecting known-function genes and their complex genetic interactions, thereby creating extraordinary phenotypic diversity at the population level.

NATIONAL SCIENCE REVIEW (2021)

暂无数据