4.7 Article

FastHiC: a fast and accurate algorithm to detect long-range chromosomal interactions from Hi-C data

Journal

BIOINFORMATICS
Volume 32, Issue 17, Pages 2692-2695

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btw240

Keywords

-

Funding

  1. NHGRI NIH HHS [R01 HG006703, R01 HG006292] Funding Source: Medline
  2. NIDDK NIH HHS [U54 DK107977] Funding Source: Medline

Ask authors/readers for more resources

Motivation: How chromatin folds in three-dimensional (3D) space is closely related to transcription regulation. As powerful tools to study such 3D chromatin conformation, the recently developed Hi-C technologies enable a genome-wide measurement of pair-wise chromatin interaction. However, methods for the detection of biologically meaningful chromatin interactions, i.e. peak calling, from Hi-C data, are still under development. In our previous work, we have developed a novel hidden Markov random field (HMRF) based Bayesian method, which through explicitly modeling the non-negligible spatial dependency among adjacent pairs of loci manifesting in high resolution Hi-C data, achieves substantially improved robustness and enhanced statistical power in peak calling. Superior to peak callers that ignore spatial dependency both methodologically and in performance, our previous Bayesian framework suffers from heavy computational costs due to intensive computation incurred by modeling the correlated peak status of neighboring loci pairs and the inference of hidden dependency structure. Results: In this work, we have developed FastHiC, a novel approach based on simulated field approximation, which approximates the joint distribution of the hidden peak status by a set of independent random variables, leading to more tractable computation. Performance comparisons in real data analysis showed that FastHiC not only speeds up our original Bayesian method by more than five times, bus also achieves higher peak calling accuracy.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemistry & Molecular Biology

Copper Changes Intestinal Microbiota of the Cecum and Rectum in Female Mice by 16S rRNA Gene Sequencing

Sufang Cheng, Huirong Mao, Yezhao Ruan, Cong Wu, Zheng Xu, Guoliang Hu, Xiaoquan Guo, Caiying Zhang, Huabin Cao, Ping Liu

BIOLOGICAL TRACE ELEMENT RESEARCH (2020)

Article Integrative & Complementary Medicine

Traditional Chinese Medicine Prescriptions Decrease Diarrhea Rate by Relieving Colonic Inflammation and Ameliorating Caecum Microbiota in Piglets

Jian Chen, Yaqing Mao, Chenghong Xing, Ruiming Hu, Zheng Xu, Huabin Cao, Junrong Luo

EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE (2020)

Article Plant Sciences

Non-homology-based prediction of gene functions in maize (Zea mays ssp. mays)

Xiuru Dai, Zheng Xu, Zhikai Liang, Xiaoyu Tu, Silin Zhong, James C. Schnable, Pinghua Li

PLANT GENOME (2020)

Article Environmental Sciences

Investigation of the effects of dichlorvos poisoning on AMPK signaling pathway in chicken brain tissues

Yanyu Xiao, Xibang Zheng, Guyue Li, Changming Zhou, Cong Wu, Zheng Xu, Guoliang Hu, Xiaoquan Guo, Lin Li, Huabin Cao, Vincent Latigo, Ping Liu

ENVIRONMENTAL POLLUTION (2020)

Article Biochemistry & Molecular Biology

Robust Hi-C Maps of Enhancer-Promoter Interactions Reveal the Function of Non-coding Genome in Neural Development and Diseases

Leina Lu, Xiaoxiao Liu, Wei-Kai Huang, Paola Giusti-Rodriguez, Jian Cui, Shanshan Zhang, Wanying Xu, Zhexing Wen, Shufeng Ma, Jonathan D. Rosen, Zheng Xu, Cynthia F. Bartels, Riki Kawaguchi, Ming Hu, Peter C. Scacheri, Zhili Rong, Yun Li, Patrick F. Sullivan, Hongjun Song, Guo-li Ming, Yan Li, Fulai Jin

MOLECULAR CELL (2020)

Article Critical Care Medicine

Clinical characteristics and outcomes of critically ill patients with novel coronavirus infectious disease (COVID-19) in China: a retrospective multicenter study

Jianfeng Xie, Wenjuan Wu, Shusheng Li, Yu Hu, Ming Hu, Jinxiu Li, Yi Yang, Tingrong Huang, Kun Zheng, Yishan Wang, Hanyujie Kang, Yingzi Huang, Li Jiang, Wei Zhang, Ming Zhong, Ling Sang, Xia Zheng, Chun Pan, Ruiqiang Zheng, Xuyan Li, Zhaohui Tong, Haibo Qiu, Bin Du

INTENSIVE CARE MEDICINE (2020)

Article Agriculture, Dairy & Animal Science

Dysregulated expression of mRNA and SNP in pulmonary artery remodeling in ascites syndrome in broilers

Sufang Cheng, Xin Liu, Pei Liu, Guyue Li, Xiaoquan Guo, Guoliang Hu, Lin Li, Cong Wu, Zheng Xu, Qi Zhou, Jialin Jiang, Shixian Luo, Huajun Huang, Ping Liu

Summary: Broiler ascites syndrome is a metabolic disorder observed worldwide in fast-growing broilers, with pulmonary arterial remodeling being a key step. This study analyzed SNP loci of genes associated with pulmonary artery remodeling in AS progression, shedding light on the molecular mechanism underlying AS progression.

POULTRY SCIENCE (2021)

Article Biochemistry & Molecular Biology

An Empirical Bayes approach for the identification of long-range chromosomal interaction from Hi-C data

Qi Zhang, Zheng Xu, Yutong Lai

Summary: The study introduces an empirical Bayes approach (EBHiC) for peak detection from Hi-C data, providing flexible over-dispersion modeling by including true interaction intensities as latent variables.

STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY (2021)

Article Mathematics

Logistic Regression Based on Individual-Level Predictors and Aggregate-Level Responses

Zheng Xu

Summary: This paper proposes estimation methods for logistic regression based on individual-level predictors and aggregate-level responses. Likelihood of logistic models in this situation is derived and estimators with different optimization methods are proposed. Simulation studies and a real data-based study are conducted to evaluate and compare the performance of the different estimators.

MATHEMATICS (2023)

Article Mathematics

Next-Generation Sequencing Data-Based Association Testing of a Group of Genetic Markers for Complex Responses Using a Generalized Linear Model Framework

Zheng Xu, Song Yan, Cong Wu, Qing Duan, Sixia Chen, Yun Li

Summary: This study developed NGS data-based methods for association studies, filling the gap in the literature. Simulation studies showed that NGS data-based methods have better performance than genotype-based methods for handling binary and count responses, especially when sequencing depth is low.

MATHEMATICS (2023)

Article Mathematics

Association Testing of a Group of Genetic Markers Based on Next-Generation Sequencing Data and Continuous Response Using a Linear Model Framework

Zheng Xu

Summary: This paper extends methods for testing the association between a continuous response variable and a group of common or rare genetic variants without the need for genotype calling. The proposed NGS data-based methods, derived from a linear model framework, show better statistical power compared to genotype-based methods, with improved performance as sequencing depth increases.

MATHEMATICS (2023)

Article Mathematics, Interdisciplinary Applications

Efficient Two-Stage Analysis for Complex Trait Association with Arbitrary Depth Sequencing Data

Zheng Xu, Song Yan, Shuai Yuan, Cong Wu, Sixia Chen, Zifang Guo, Yun Li

Summary: Sequencing-based genetic association analysis typically involves creating genotype calls from sequence data and then performing association tests. This study proposes a computationally efficient two-stage approach which screens single-nucleotide polymorphisms (SNPs) in the first stage using a rapid maximum likelihood (ML)-based method directly on sequence data, and then evaluates the selected SNPs in the second stage by performing association tests on genotypes from multi-sample linkage disequilibrium (LD)-aware calling. Simulation and real data-based studies demonstrate that this approach can save computational costs by 80% while still achieving over 90% power compared to the traditional method of genotyping all markers.

STATS (2023)

Article Mathematics, Interdisciplinary Applications

Combining Probability and Nonprobability Samples by Using Multivariate Mass Imputation Approaches with Application to Biomedical Research

Sixia Chen, Alexandra May Woodruff, Janis Campbell, Sara Vesely, Zheng Xu, Cuyler Snider

Summary: Nonprobability samples are commonly used in various fields such as public health, economics, education, and political polls. However, naive estimates based on nonprobability samples may suffer from serious selection bias. Mass imputation methods have been shown to effectively improve the representativeness of nonprobability samples.

STATS (2023)

Article Ecology

Duration of usage and farmer reported benefits of conservation tillage

A. E. Saak, T. Wang, Z. Xu, D. Kolady, J. D. Ulrich-Schad, D. Clay

Summary: The study found that it takes 5-10 years for conservation tillage to be profitable, with higher adoption and positive outcomes in areas with less precipitation. Adoption subsidies should decrease with duration of usage in adopted areas, and monetary incentives may be needed in areas with higher rainfall to encourage conservation tillage adoption.

JOURNAL OF SOIL AND WATER CONSERVATION (2021)

Article Agronomy

Semantic Segmentation of Sorghum Using Hyperspectral Data Identifies Genetic Associations

Chenyong Miao, Alejandro Pages, Zheng Xu, Eric Rodene, Jinliang Yang, James C. Schnable

PLANT PHENOMICS (2020)

No Data Available