4.7 Article Proceedings Paper

Gene2vec: distributed representation of genes based on co-expression

期刊

BMC GENOMICS
卷 20, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/s12864-018-5370-x

关键词

Distributed representation; Gene2Vec; Gene co-expression; Embedding; Word2vec; Gene-gene interaction

资金

  1. Cancer Prevention Research Institute of Texas (CPRIT) [RP160015]

向作者/读者索取更多资源

BackgroundExisting functional description of genes are categorical, discrete, and mostly through manual process. In this work, we explore the idea of gene embedding, distributed representation of genes, in the spirit of word embedding.ResultsFrom a pure data-driven fashion, we trained a 200-dimension vector representation of all human genes, using gene co-expression patterns in 984 data sets from the GEO databases. These vectors capture functional relatedness of genes in terms of recovering known pathways - the average inner product (similarity) of genes within a pathway is 1.52X greater than that of random genes. Using t-SNE, we produced a gene co-expression map that shows local concentrations of tissue specific genes. We also illustrated the usefulness of the embedded gene vectors, laden with rich information on gene co-expression patterns, in tasks such as gene-gene interaction prediction.ConclusionsWe proposed a machine learning method that utilizes transcriptome-wide gene co-expression to generate a distributed representation of genes. We further demonstrated the utility of our distribution by predicting gene-gene interaction based solely on gene names. The distributed representation of genes could be useful for more bioinformatics applications.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Review Immunology

Application of artificial intelligence and machine learning for HIV prevention interventions

Yang Xiang, Jingcheng Du, Kayo Fujimoto, Fang Li, John Schneider, Cui Tao

Summary: The goal to end the HIV epidemic presents various challenges, but artificial intelligence has shown great potential in developing effective prevention intervention strategies.

LANCET HIV (2022)

Article Immunology

Using a Machine Learning Approach to Monitor COVID-19 Vaccine Adverse Events (VAE) from Twitter Data

Andrew T. Lian, Jingcheng Du, Lu Tang

Summary: This study utilized machine learning and natural language processing to identify COVID-19 vaccine adverse events from Twitter data. The research found that the four most populous states in the US were the areas with the most discussions about adverse events on Twitter, and the most common adverse effects were sore to touch, fatigue, and headache. The findings demonstrate the feasibility of using social media data to monitor vaccine adverse events.

VACCINES (2022)

Article Biochemistry & Molecular Biology

Identifying candidate genes and drug targets for Alzheimer's disease by an integrative network approach using genetic and brain region-specific proteomic data

Andi Liu, Astrid M. Manuel, Yulin Dai, Brisa S. Fernandes, Nitesh Enduru, Peilin Jia, Zhongming Zhao

Summary: This study used large-scale proteomic datasets and GWAS data to investigate the molecular pathways and potential drug targets specific to different brain regions in Alzheimer's disease. By applying a network-based tool, the researchers identified specific module genes associated with AD and pinpointed three potential drug targets in the parahippocampal gyrus.

HUMAN MOLECULAR GENETICS (2022)

Article Engineering, Chemical

Immobilizing Fe0 nanoparticles on covalent organic framework towards enhancement of Cr(VI) removal by adsorption and reduction synergistic effect

Huizhen Shen, Li Chen, Cailong Zhou, Jingcheng Du, Chenyang Lu, Hao Yang, Luxi Tan, Xinjuan Zeng, Lichun Dong

Summary: In this study, NZVI was immobilized on porous TpPa-1 covalent organic frameworks (COFs) using dopamine as a connector, resulting in the formation of Fe-0/TpPa-1@DOPA composite. The composite exhibited excellent performance in Cr(VI) removal, making it a promising material for wastewater treatment.

SEPARATION AND PURIFICATION TECHNOLOGY (2022)

Article Genetics & Heredity

deCS: A Tool for Systematic Cell Type Annotations of Single-cell RNA Sequencing Data among Human Tissues

Guangsheng Pei, Fangfang Yan, Lukas M. Simon, Yulin Dai, Peilin Jia, Zhongming Zhao

Summary: Single-cell RNA sequencing (scRNA-seq) is revolutionizing the study of complex and dynamic cellular mechanisms. In this study, the researchers present deCS, an automatic cell type annotation method that enhances annotation accuracy using comprehensive human cell type expression profiles and marker genes. The results show that expanding the references is critical for improving annotation accuracy, and deCS significantly reduces computation time and increases accuracy compared to existing tools. The researchers also demonstrate the broad utility of deCS in identifying trait-cell type associations in human complex traits, providing deep insights into disease pathogenesis.

GENOMICS PROTEOMICS & BIOINFORMATICS (2023)

Article Oncology

CD73-Dependent Adenosine Signaling through Adora2b Drives Immunosuppression in Ductal Pancreatic Cancer

Erika Y. Faraoni, Kanchan Singh, Vidhi Chandra, Olivereen Le Roux, Yulin Dai, Ismet Sahin, Baylee J. O'Brien, Lincoln N. Strickland, Le Li, Emily Vucic, Amanda N. Warner, Melissa Pruski, Trent Clark, George Van Buren, Nirav C. Thosani, John S. Bynon, Curtis J. Wray, Dafna Bar-Sagi, Kyle L. Poulsen, Lana A. Vornik, Michelle I. Savage, Shizuko Sei, Altaf Mohammed, Zhongming Zhao, Powel H. Brown, Tingting Mills, Holger K. Eltzschig, Florencia McAllister, Jennifer M. Bailey-Lundberg

Summary: The microenvironment of PDAC is desmoplastic and immunosuppressive. CD73 is overexpressed in the tumor microenvironment and could be a target for immunotherapy.

CANCER RESEARCH (2023)

Article Biochemistry & Molecular Biology

Investigating cellular heterogeneity at the single-cell level by the flexible and mobile extrachromosomal circular DNA

Jiajinlong Kang, Yulin Dai, Jinze Li, Huihui Fan, Zhongming Zhao

Summary: This article reports the first single-cell level analysis of eccDNA in glioblastoma (GBM) samples. The study revealed the presence of potential mobile enhancers acting in a trans-regulation manner in GBM. This research provides insights into the novel features of eccDNA in the cellular context of brain tumors, highlighting the importance of investigating eccDNA at the single-cell level.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2023)

Article Genetics & Heredity

De novo mutations disturb early brain development more frequently than common variants in schizophrenia

Toshiyuki Itai, Peilin Jia, Yulin Dai, Jingchun Chen, Xiangning Chen, Zhongming Zhao

Summary: Investigating functional, temporal, and cell-type expression features of mutations is important for understanding schizophrenia. This study collected and analyzed mutations in schizophrenia patients and identified genes that are neurologically important and have specific expression patterns during prenatal development. Results suggest that gene expression patterns in specific cell types during early fetal stages may impact the risk of schizophrenia in adulthood.

AMERICAN JOURNAL OF MEDICAL GENETICS PART B-NEUROPSYCHIATRIC GENETICS (2023)

Article Materials Science, Multidisciplinary

Smart Solvent-Responsive Covalent Organic Framework Membranes with Self-regulating Pore Size

Ziye Song, Qian Sun, Jingcheng Du, Linghao Liu, Wen He, Yiming Xu, Jiangtao Liu

Summary: Covalent organic frameworks (COFs) are emerging as a promising class of porous materials with high porosity, low density, and excellent physicochemical stability. Smart COFs with special structures and functional groups have attracted considerable attention due to their ability to respond to external stimuli. However, fabricating smart COF membranes with adjustable pore sizes for gradient separation of organic pollutants remains a challenge.

ACS APPLIED POLYMER MATERIALS (2023)

Article Engineering, Chemical

Smart superwetting COF membrane for controllable oil/water separation

Qian Sun, Jingcheng Du, Linghao Wang, Ayan Yao, Ziye Song, Linghao Liu, Dong Cao, Ji Ma, Weiwang Lim, Wen He, Shabi Ul Hassan, Cailong Zhou, Jiangtao Liu

Summary: In this study, a COF membrane with unique super-wettability was prepared using collected COF nanofibers by filtration assembly method. It showed controllable and switchable separation performance in oil/water separation field, achieving high separation efficiency and permeance.

SEPARATION AND PURIFICATION TECHNOLOGY (2023)

Article Materials Science, Multidisciplinary

Smart Solvent-Responsive Covalent Organic Framework Membranes with Self-regulating Pore Size

Ziye Song, Qian Sun, Jingcheng Du, Linghao Liu, Wen He, Yiming Xu, Jiangtao Liu

Summary: Covalent organic frameworks (COFs) are emerging crystalline porous materials with desirable properties. Smart COFs with special structures and functional groups can respond to external stimuli. However, fabricating smart COF membranes with adjustable pore size for gradient separation of organic pollutants remains a challenge.

ACS APPLIED POLYMER MATERIALS (2023)

Article Chemistry, Multidisciplinary

Nanoarchitectonics of carbon molecular sieve membranes with graphene oxide and polyimide for hydrogen purification

Wen He, Jingcheng Du, Linghao Liu, Qian Sun, Ziye Song, Ji Ma, Dong Cao, Weiwang Lim, Shabi Ul Hassan, Jiangtao Liu

Summary: Graphene oxide (GO) tuned polyimide carbon molecular sieve (CMS) membranes were prepared by carbonization, showing high permeability, selectivity, and stability. The gas sorption capability increased with the carbonization temperature, creating more micropores under higher temperatures under GO guidance. GO guidance and subsequent carbonization enhanced H-2 permeability and selectivity, surpassing state-of-the-art materials. The CMS membranes transitioned from a polymeric structure to a denser graphite structure with increasing carbonization temperature, achieving ultrahigh selectivities for various gas pairs while maintaining moderate H-2 gas permeabilities.

RSC ADVANCES (2023)

Proceedings Paper Computer Science, Artificial Intelligence

IMI-CDE: an interactive interface for collaborative mapping of study variables to common data elements

Shiqiang Tao, Wei-Chun Chou, Jianfu Li, Jingcheng Du, Pritham Ram, Rashmie Abeysinghe, Hua Xu, Xiaoqian Jiang, Peter W. Rose, Lucila Ohno-Machado, Guo-Qiang Zhang

Summary: The National Institute of Health (NIH) has launched the RADx Radical research collaboratives (RADx-rad) to advance new, non-traditional approaches for COVID-19 testing. They have developed the web application IMI-CDE to facilitate the mapping of study variables to common data elements (CDEs) for researchers, increasing data interoperability. The application has been piloted with positive feedback from beta-testers.

2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

LANN: an Integrated Online Annotation Tool for Information Extraction

Jingqi Wang, Yaoyun Zhang, Bin Lin, Huy Anh Pham, Long He, Jingcheng Du, Frank Manion

Summary: The creation of high-quality annotated corpora is crucial for the development of machine and deep-learning models for Information Extraction and Natural Language Processing. This paper presents LANN, a text annotation tool that supports team-based annotation, quality controls, and machine learning assistance throughout the annotation project workflow.

2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022) (2022)

Article Mathematical & Computational Biology

Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations

Qingyu Chen, Alexis Allot, Robert Leaman, Rezarta Islamaj, Jingcheng Du, Li Fang, Kai Wang, Shuo Xu, Yuefu Zhang, Parsa Bagherzadeh, Sabine Bergler, Aakash Bhatnagar, Nidhir Bhavsar, Yung-Chun Chang, Sheng-Jie Lin, Wentai Tang, Hongtong Zhang, Ilija Tavchioski, Senja Pollak, Shubo Tian, Jinfeng Zhang, Yulia Otmakhova, Antonio Jimeno Yepes, Hang Dong, Honghan Wu, Richard Dufour, Yanis Labrak, Niladri Chatterjee, Kushagri Tandon, Frejus A. A. Laleye, Loic Rakotoson, Emmanuele Chersoni, Jinghang Gu, Annemarie Friedrich, Subhash Chandra Pujari, Mariia Chizhikova, Naveen Sivadasan, V. G. Saipradeep, Zhiyong Lu

Summary: The COVID-19 pandemic has had a severe impact on global society, leading to a rapid growth in related literature. To address the challenges of manual curation and interpretation, the BioCreative LitCovid track called for a community effort to automate topic annotation. Nineteen teams participated, achieving higher scores compared to existing methods.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2022)

暂无数据