4.6 Article Proceedings Paper

Annotation of protein residues based on a literature analysis: cross-validation against UniProtKb

期刊

BMC BIOINFORMATICS
卷 10, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/1471-2105-10-S8-S4

关键词

-

向作者/读者索取更多资源

Background: A protein annotation database, such as the Universal Protein Resource knowledge base (UniProtKb), is a valuable resource for the validation and interpretation of predicted 3D structure patterns in proteins. Existing studies have focussed on point mutation extraction methods from biomedical literature which can be used to support the time consuming work of manual database curation. However, these methods were limited to point mutation extraction and do not extract features for the annotation of proteins at the residue level. Results: This work introduces a system that identifies protein residues in MEDLINE abstracts and annotates them with features extracted from the context written in the surrounding text. MEDLINE abstract texts have been processed to identify protein mentions in combination with taxonomic species and protein residues (F1-measure 0.52). The identified protein-species-residue triplets have been validated and benchmarked against reference data resources (UniProtKb, average F1-measure of 0.54). Then, contextual features were extracted through shallow and deep parsing and the features have been classified into predefined categories (F1-measure ranges from 0.15 to 0.67). Furthermore, the feature sets have been aligned with annotation types in UniProtKb to assess the relevance of the annotations for ongoing curation projects. Altogether, the annotations have been assessed automatically and manually against reference data resources. Conclusion: This work proposes a solution for the automatic extraction of functional annotation for protein residues from biomedical articles. The presented approach is an extension to other existing systems in that a wider range of residue entities are considered and that features of residues are extracted as annotations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Oncology

Stage-based Variation in the Effect of Primary Tumor Side on All Stages of Colorectal Cancer Recurrence and Survival

Margaret M. Lee, Andrew MacKinlay, Christine Semira, Christine Schieber, Antonio Jose Jimeno Yepes, Belinda Lee, Rachel Wong, Chathurika K. H. Hettiarachchige, Natalie Gunn, Jeanne Tie, Hui-Li Wong, Iain Skinner, Ian T. Jones, James Keck, Suzanne Kosmider, Ben Tran, Kathryn Field, Peter Gibbs

CLINICAL COLORECTAL CANCER (2018)

Article Geochemistry & Geophysics

Semantic Labeling Using a Low-Power Neuromorphic Platform

Jianbin Tang, Benjamin Scott Mashford, Antonio Jimeno Yepes

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS (2018)

Article Biochemical Research Methods

BioNorm: deep learning-based event normalization for the curation of reaction databases

Peiliang Lou, Antonio Jimeno Yepes, Zai Zhang, Qinghua Zheng, Xiangrong Zhang, Chen Li

BIOINFORMATICS (2020)

Article Public, Environmental & Occupational Health

Combining Social Media and FDA Adverse Event Reporting System to Detect Adverse Drug Reactions

Ying Li, Antonio Jimeno Yepes, Cao Xiao

DRUG SAFETY (2020)

Article Computer Science, Interdisciplinary Applications

Adverse drug event detection using reason assignments in FDA drug labels

Corey Sutphin, Kahyun Lee, Antonio Jimeno Yepes, Ozlem Uzuner, Bridget T. McInnes

JOURNAL OF BIOMEDICAL INFORMATICS (2020)

Article Health Care Sciences & Services

Automating Quality Assessment of Medical Evidence in Systematic Reviews: Model Development and Validation Study

Simon Suster, Timothy Baldwin, Jey Han Lau, Antonio Jimeno Yepes, David Martinez Iraola, Yulia Otmakhova, Karin Verspoor

Summary: The study proposes a quality assessment task that provides an overall quality rating for each body of evidence (BoE) and justification for different quality criteria. A machine learning system (EvidenceGRADEr) is developed to automate the quality assessment process using a new dataset. The results show that the system performs well for some quality criteria but struggles with others due to limited data availability. This technology has the potential to reduce reviewer workload and expedite evidence synthesis.

JOURNAL OF MEDICAL INTERNET RESEARCH (2023)

Article Mathematical & Computational Biology

Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations

Qingyu Chen, Alexis Allot, Robert Leaman, Rezarta Islamaj, Jingcheng Du, Li Fang, Kai Wang, Shuo Xu, Yuefu Zhang, Parsa Bagherzadeh, Sabine Bergler, Aakash Bhatnagar, Nidhir Bhavsar, Yung-Chun Chang, Sheng-Jie Lin, Wentai Tang, Hongtong Zhang, Ilija Tavchioski, Senja Pollak, Shubo Tian, Jinfeng Zhang, Yulia Otmakhova, Antonio Jimeno Yepes, Hang Dong, Honghan Wu, Richard Dufour, Yanis Labrak, Niladri Chatterjee, Kushagri Tandon, Frejus A. A. Laleye, Loic Rakotoson, Emmanuele Chersoni, Jinghang Gu, Annemarie Friedrich, Subhash Chandra Pujari, Mariia Chizhikova, Naveen Sivadasan, V. G. Saipradeep, Zhiyong Lu

Summary: The COVID-19 pandemic has had a severe impact on global society, leading to a rapid growth in related literature. To address the challenges of manual curation and interpretation, the BioCreative LitCovid track called for a community effort to automate topic annotation. Nineteen teams participated, achieving higher scores compared to existing methods.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Impact of detecting clinical trial elements in exploration of COVID-19 literature

Simon Suster, Karin Verspoor, Timothy Baldwin, Jey Han Lau, Antonio Jimeno Yepes, David Martinez Iraola, Yulia Otmakhova

Summary: The COVID-19 pandemic has increased demand for tools that efficiently explore biomedical literature. Filtering results using clinically-relevant concepts and their relations can improve precision and increase the likelihood of users being exposed to more relevant documents, as demonstrated in an analysis using the TREC-COVID dataset.

2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021) (2021)

Article Audiology & Speech-Language Pathology

Predictive models for cochlear implant outcomes: Performance, generalizability, and the impact of cohort size

Elaheh Shafieibavani, Benjamin Goudey, Isabell Kiral, Peter Zhong, Antonio Jimeno-Yepes, Annalisa Swan, Manoj Gambhir, Andreas Buechner, Eugen Kludt, Robert H. Eikelboom, Cathy Sucher, Rene H. Gifford, Riaan Rottier, Kerrie Plant, Hamideh Anjomshoa

Summary: While machine learning shows better predictive accuracy for cochlear implant outcomes compared to traditional statistical methods, there are still limitations in overall accuracy. The study conducted is the largest retrospective study on cochlear implant outcomes to date, highlighting the superior performance of machine learning models in predicting word recognition scores.

TRENDS IN HEARING (2021)

Proceedings Paper Computer Science, Artificial Intelligence

ICDAR 2021 Competition on Scientific Literature Parsing

Antonio Jimeno Yepes, Peter Zhong, Douglas Burdick

Summary: Scientific literature contains important information for cutting-edge innovations, and advancements in natural language processing have enabled automated information extraction, despite challenges such as unstructured PDF formats and non-natural language content. The ICDAR 2021 Scientific Literature Parsing Competition aims to drive document understanding advances and has shown impressive performance in tasks related to document layout and table recognition.

DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV (2021)

Article Audiology & Speech-Language Pathology

A MultiCenter Analysis of Factors Associated with Hearing Outcome for 2,735 Adults with Cochlear Implants

Benjamin Goudey, Kerrie Plant, Isabell Kiral, Antonio Jimeno-Yepes, Annalisa Swan, Manoj Gambhir, Andreas Buechner, Eugen Kludt, Robert H. Eikelboom, Cathy Sucher, Rene H. Gifford, Riaan Rottier, Hamideh Anjomshoa

Summary: This study investigates the association between 21 preoperative factors and speech recognition one year after cochlear implantation, providing evidence of 17 statistically significant associations. Despite the large sample size, the variance explained by the models remains modest. Additionally, a novel statistical interaction indicates that the duration of deafness in the implanted ear has a stronger impact on hearing outcome when considered relative to a candidate's age.

TRENDS IN HEARING (2021)

Article Biochemical Research Methods

A representation model for biological entities by fusing structured axioms with unstructured texts

Peiliang Lou, YuXin Dong, Antonio Jimeno Yepes, Chen Li

Summary: This study introduces a new bio-entity representation learning model ERBK, which encodes axioms and definitions using knowledge graph embedding method and deep convolutional neural network respectively. Experimental results show that ERBK outperforms existing methods in predicting protein-protein interactions and gene-disease associations, and maintains promising performance under zero-shot circumstances.

BIOINFORMATICS (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Global Locality in Biomedical Relation and Event Extraction

Elaheh ShafieiBavani, Antonio Jimeno Yepes, Xu Zhong, David Martinez Iraola

19TH SIGBIOMED WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2020) (2020)

Article Medicine, General & Internal

Evaluation of Combined Artificial Intelligence and Radiologist Assessment to Interpret Screening Mammograms

Thomas Schaffter, Diana S. M. Buist, Christoph Lee, Yaroslav Nikulin, Dezso Ribli, Yuanfang Guan, William Lotter, Zequn Jie, Hao Du, Sijia Wang, Jiashi Feng, Mengling Feng, Hyo-Eun Kim, Francisco Albiol, Alberto Albiol, Stephen Morrell, Zbigniew Wojna, Mehmet Eren Ahsen, Umar Asif, Antonio Jimeno Yepes, Shivanthan Yohanandan, Simona Rabinovici-Cohen, Darvin Yi, Bruce Hoff, Thomas Yu, Elias Chaibub Neto, Daniel L. Rubin, Peter Lindholm, Laurie R. Margolies, Russell Bailey McBride, Joseph H. Rothstein, Weiva Sieh, Rami Ben-Ari, Stefan Harrer, Andrew Trister, Stephen Friend, Thea Norman, Berkman Sahiner, Fredrik Strand, Justin Guinney, Gustavo Stolovitzky

JAMA NETWORK OPEN (2020)

Proceedings Paper Computer Science, Artificial Intelligence

Findings of the WMT 2019 Biomedical Translation Shared Task: Evaluation for MEDLINE Abstracts and Biomedical Terminologies

Rachel Bawden, K. Bretonnel Cohen, Cristian Grozea, Antonio Jimeno Yepes, Madeleine Kittner, Martin Krallinger, Nancy Mah, Aurelie Neveol, Mariana Neves, Felipe Soares, Amy Siu, Karin Verspoor, Maika Vicente Navarro

FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), VOL 3: SHARED TASK PAPERS, DAY 2 (2019)

暂无数据