4.7 Article

HiG2Vec: hierarchical representations of Gene Ontology and genes in the Poincare ball

期刊

BIOINFORMATICS
卷 37, 期 18, 页码 2971-2980

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btab193

关键词

-

资金

  1. National Research Foundation of Korea - Korea government (MSIT) [NRF-2019R1A2C1006608]
  2. ITRC (Information Technology Research Center) support program [IITP-2020-2018-0-01431]
  3. National Institutes of Health [S10OD023495]

向作者/读者索取更多资源

The study proposes hierarchical representations of GO and genes using Poincare embedding, which can better capture the hierarchical structure and predict gene interactions compared to previous studies. HiG2Vec demonstrates superior performance in capturing the semantics of GO and genes and in data utilization, making it robust for various biological knowledge manipulation.
Motivation: Knowledge manipulation of Gene Ontology (GO) and Gene Ontology Annotation (GOA) can be done primarily by using vector representation of GO terms and genes. Previous studies have represented GO terms and genes or gene products in Euclidean space to measure their semantic similarity using an embedding method such as the Word2Vec-based method to represent entities as numeric vectors. However, this method has the limitation that embedding large graph-structured data in the Euclidean space cannot prevent a loss of information of latent hierarchies, thus precluding the semantics of GO and GOA from being captured optimally. On the other hand, hyperbolic spaces such as the Poincare balls are more suitable for modeling hierarchies, as they have a geometric property in which the distance increases exponentially as it nears the boundary because of negative curvature. Results: In this article, we propose hierarchical representations of GO and genes (HiG2Vec) by applying Poincare embedding specialized in the representation of hierarchy through a two-step procedure: GO embedding and gene embedding. Through experiments, we show that our model represents the hierarchical structure better than other approaches and predicts the interaction of genes or gene products similar to or better than previous studies. The results indicate that HiG2Vec is superior to other methods in capturing the GO and gene semantics and in data utilization as well. It can be robustly applied to manipulate various biological knowledge.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Cardiac & Cardiovascular Systems

A deep learning model for screening type 2 diabetes from retinal photographs

Jae-Seung Yun, Jaesik Kim, Sang-Hyuk Jung, Seon-Ah Cha, Seung-Hyun Ko, Yu-Bae Ahn, Hong-Hee Won, Kyung-Ah Sohn, Dokyoon Kim

Summary: This study developed and evaluated a deep learning algorithm for screening type 2 diabetes using retinal images. The algorithm showed improved risk stratification when combined with traditional risk factors, making it a useful tool for identifying individuals at high risk of type 2 diabetes.

NUTRITION METABOLISM AND CARDIOVASCULAR DISEASES (2022)

Article Biology

NETMAGE: A human disease phenotype map generator for the network-based visualization of phenome-wide association study results

Vivek Sriram, Manu Shivakumar, Sang-Hyuk Jung, Yonghyun Nam, Lisa Bang, Anurag Verma, Seunggeun Lee, Eun Kyung Choe, Dokyoon Kim

Summary: This article introduces a tool for generating disease-disease network visualizations based on summary statistics from phenome-wide association studies. The tool was used to construct a network to study the relationships between human diseases. By analyzing this network, potential genetic explanations for the relationships between diseases can be identified, and a better understanding of the underlying architecture of human diseases can be achieved. This tool can help researchers identify potential genetic targets for drug design and contribute to the exploration of personalized medicine.

GIGASCIENCE (2022)

Article Computer Science, Hardware & Architecture

An ensemble approach to anomaly detection using high- and low-variance principal components

Jeong-Hyeon Moon, Jun-Hyung Yu, Kyung-Ah Sohn

Summary: This paper proposes a novel ensemble learning approach for anomaly detection by extracting specific features to improve overall accuracy and detect diverse types of unknown attacks. The method also considers unexpressed variations in the training data and can be deployed in various CPS environments.

COMPUTERS & ELECTRICAL ENGINEERING (2022)

Article Endocrinology & Metabolism

Dissecting the clinical relevance of polygenic risk score for obesity-a cross-sectional, longitudinal analysis

Eun Kyung Choe, Manu Shivakumar, Seung Mi Lee, Anurag Verma, Dokyoon Kim

Summary: This study used genotyping and health data to investigate the association between polygenic risk score (PRS) for obesity and obesity-related diseases. The results showed that low PRS may be a risk factor for metabolically unhealthy lean body.

INTERNATIONAL JOURNAL OF OBESITY (2022)

Article Computer Science, Artificial Intelligence

Efficient deep neural network for photo-realistic image super-resolution

Namhyuk Ahn, Byungkon Kang, Kyung-Ah Sohn

Summary: This study focuses on the application of deep learning models in single-image super-resolution and proposes an architecture design that enhances network performance through cascading mechanism and feature fusion. By implementing group convolution and recursive schemes, the model achieves high efficiency and improves output quality. The research shows that the method performs well in various tasks.

PATTERN RECOGNITION (2022)

Article Computer Science, Information Systems

Attention-guided residual frame learning for video anomaly detection

Jun-Hyung Yu, Jeong-Hyeon Moon, Kyung-Ah Sohn

Summary: This paper proposes a new framework for video anomaly detection by designing a convolutional long short-term memory-based model that emphasizes semantic objects using self-attention mechanisms and concatenation operations to improve performance. The experiments demonstrated that our framework outperformed previous models on the Chinese University of Hong Kong dataset.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Computer Science, Information Systems

Dataset Bias Prediction for Few-Shot Image Classification

Jang Wook Kim, So Yeon Kim, Kyung-Ah Sohn

Summary: Dataset bias is a significant obstacle in image classification, particularly in few-shot learning with limited samples per class. To address this, we propose a bias prediction network that recovers biases from image data features, improving few-shot image classification performance. Our method trains the framework to extract features that are difficult for the bias prediction network to recover. We evaluate our approach on multiple benchmark datasets and integrate it with existing few-shot learning models, showing improved performance in different scenarios. The proposed bias prediction model is compatible with other few-shot learning models and applicable to real-world applications with biased samples.

ELECTRONICS (2023)

Article Computer Science, Artificial Intelligence

Deep Multi-Modal Network Based Automated Depression Severity Estimation

Md Azher Uddin, Joolekha Bibi Joolee, Kyung-Ah Sohn

Summary: This paper proposes a novel deep multi-modal framework that effectively utilizes facial and verbal cues for automated depression assessment. By analyzing audio and video data and applying specific algorithms and strategies, this method can diagnose depression more accurately than other existing methods.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2023)

Article Biotechnology & Applied Microbiology

ClearF plus plus : Improved Supervised Feature Scoring Using Feature Clustering in Class-Wise Embedding and Reconstruction

Sehee Wang, So Yeon Kim, Kyung-Ah Sohn

Summary: Feature selection methods are crucial for accurate disease classification and identifying informative biomarkers. ClearF++ addresses the limitations of previous methods by using reconstruction error from low-dimensional embeddings as a proxy for entropy term and incorporating feature-wise clustering. It outperforms other commonly used methods in terms of prediction accuracy and stability, making it valuable for biomedical data analysis.

BIOENGINEERING-BASEL (2023)

Article Computer Science, Information Systems

CEAT: Curvature Feature Extractor Using Action Based Triplet Learning for Action Segmentation

Seungmin Jang, Jeong-Hyeon Moon, So Yeon Kim, Kyung-Ah Sohn

Summary: With the increasing number of untrimmed videos on the internet, there is a growing demand for advanced action segmentation methods that can accurately localize sequences within lengthy videos. Traditional approaches have tried to address the issue of over-segmentation by smoothing consecutive frame predictions, but this may overlook important spatio-temporal characteristics. To address these challenges more effectively, we propose a novel approach that constructs a geometric curve based on frame-wise embeddings and extracts curvature features. Experimental results show that incorporating curvature information into existing action segmentation models can significantly enhance performance.

IEEE ACCESS (2023)

Article Biochemical Research Methods

Discovering comorbid diseases using an inter-disease interactivity network based on biobank-scale PheWAS data

Yonghyun Nam, Sang-Hyuk Jung, Jae-Seung Yun, Vivek Sriram, Pankhuri Singhal, Marta Byrska-Bishop, Anurag Verma, Hyunjung Shin, Woong-Yang Park, Hong-Hee Won, Dokyoon Kim

Summary: Understanding comorbidity is crucial for disease prevention and treatment. In this study, we introduce the use of an inter-disease interactivity network to discover and prioritize comorbidities. By considering phenotype associations, we develop a comorbidity scoring algorithm and predict the priority of comorbid diseases. The findings highlight the importance of considering interaction when predicting comorbidity.

BIOINFORMATICS (2023)

Article Computer Science, Artificial Intelligence

Decomposing texture and semantic for out-of-distribution detection

Jeong-Hyeon Moon, Namhyuk Ahn, Kyung-Ah Sohn

Summary: This article discusses the impact of distribution mismatch on the reliability of AI systems and proposes a solution by decomposing the definition of in-distribution into texture and semantics. A new benchmark is introduced to measure the robustness and precision of OOD detection methods, and a divide-and-conquer approach is presented to achieve a balance between performance and robustness.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

暂无数据