4.7 Article

Dataset-aware multi-task learning approaches for biomedical named entity recognition

Journal

BIOINFORMATICS
Volume 36, Issue 15, Pages 4331-4338

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btaa515

Keywords

-

Funding

  1. Natural Science Foundation of Shenzhen City [JCYJ20180306172131515]

Ask authors/readers for more resources

Motivation: Named entity recognition is a critical and fundamental task for biomedical text mining. Recently, researchers have focused on exploiting deep neural networks for biomedical named entity recognition (Bio-NER). The performance of deep neural networks on a single dataset mostly depends on data quality and quantity while high-quality data tends to be limited in size. To alleviate task-specific data limitation, some studies explored the multi-task learning (MTL) for Bio-NER and achieved state-of-the-art performance. However, these MTL methods did not make full use of information from various datasets of Bio-NER. The performance of state-of-the-art MTL method was significantly limited by the number of training datasets. Results: We propose two dataset-aware MTL approaches for Bio-NER which jointly train all models for numerous Bio-NER datasets, thus each of these models could discriminatively exploit information from all of related training datasets. Both of our two approaches achieve substantially better performance compared with the state-of-the-art MTL method on 14 out of 15 Bio-NER datasets. Furthermore, we implemented our approaches by incorporating Bio-NER and biomedical part-of-speech (POS) tagging datasets. The results verify Bio-NER and POS can significantly enhance one another.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemical Research Methods

Discovery of G-quadruplex-forming sequences in SARS-CoV-2

Danyang Ji, Mario Juhas, Chi Man Tsang, Chun Kit Kwok, Yongshu Li, Yang Zhang

Summary: Researchers have identified potential G-quadruplex-forming sequences in the SARS-CoV-2 RNA genome and other Coronaviridae family members, with some confirmed to form RNA G-quadruplex structures in vitro. These structures were found to interact with viral helicase (nsp13), suggesting a potential target for inhibiting the virus.

BRIEFINGS IN BIOINFORMATICS (2021)

Editorial Material Genetics & Heredity

Biosensing Detection of the SARS-CoV-2 D614G Mutation

Yang Zhang, Hui Xi, Mario Juhas

Summary: The emergence of a mutant strain of SARS-CoV-2 with an amino acid change at position 614 has become dominant in the pandemic, highlighting the importance of efficient detection using biosensing technologies for pandemic control.

TRENDS IN GENETICS (2021)

Article Chemistry, Analytical

Proteomic and Transcriptome Profiling of G-Quadruplex Aptamers Developed for Cell Internalization

Yang Zhang, Yang Wu, Hongjin Zheng, Hui Xi, Taoyu Ye, Chun-Yin Chan, Chun Kit Kwok

Summary: Nucleic acid medicine is emerging as a promising next-generation therapy, and the development of cell-penetrating aptamers can enhance the cellular delivery efficiency of therapeutic nucleic acids. Characteristic CD spectral analysis revealed the G-quadruplex structures of enriched aptamers.

ANALYTICAL CHEMISTRY (2021)

Article Biochemical Research Methods

A span-based joint model for extracting entities and relations of bacteria biotopes

Mei Zuo, Yang Zhang

Summary: Motivation: Information about bacteria biotopes (BB) is crucial for microbiological research and applications. The BB task at BioNLP-OST 2019 focuses on extracting microorganism locations and phenotypes from biomedical texts. Our span-based model, utilizing a pre-trained BERT model, achieves significantly better performance in entity and relation extraction tasks for BBs compared to previous methods, showing a reduction of 21.6% in slot error rate (SER). The model also shows effectiveness in recognizing nested entities and can be applied to other related tasks with state-of-the-art performance.

BIOINFORMATICS (2022)

Editorial Material Biochemistry & Molecular Biology

Deep Learning for Imaging and Detection of Microorganisms

Yang Zhang, Hao Jiang, Taoyu Ye, Mario Juhas

Summary: Despite the significant interest in deep learning in microbiology, its full potential is yet to be realized. Deep-learning-based systems are believed to play a crucial role in monitoring and investigating microorganisms in the future.

TRENDS IN MICROBIOLOGY (2021)

Article Biology

Multi-stage malaria parasite recognition by deep learning

Sen Li, Zeyu Du, Xiangjie Meng, Yang Zhang

Summary: This article introduces a novel deep learning approach using a deep transfer graph convolutional network (DTGCN) for the recognition of malaria parasites of various stages in blood smear images. The method has shown higher accuracy and effectiveness in publicly available microscopic images of multi-stage malaria parasites compared to a wide range of state-of-the-art approaches.

GIGASCIENCE (2021)

Article Biophysics

Fabricated Metal-Organic Frameworks (MOFs) as luminescent and electrochemical biosensors for cancer biomarkers detection

Brij Mohan, Sandeep Kumar, Hui Xi, Shixuan Ma, Zhiyu Tao, Tiantian Xing, Hengzhi You, Yang Zhang, Peng Ren

Summary: The study focuses on the application of sensors made of porous metal-organic frameworks (MOFs) in cancer biomarker detection, analyzing factors such as fabrication strategies and structural properties that influence sensing performance, and proposes an innovative technique for detecting cancer biomarkers using luminescence and electrochemical sensors.

BIOSENSORS & BIOELECTRONICS (2022)

Letter Biochemistry & Molecular Biology

DRBin: metagenomic binning based on deep representation learning

Gang Mao, Yulin Wu, Yang Zhang, Xuan Wang, Yan Zhu, Bo Liu, Yadong Wang, Junyi Li

JOURNAL OF GENETICS AND GENOMICS (2022)

Review Biotechnology & Applied Microbiology

Aptamers targeting SARS-COV-2: a promising tool to fight against COVID-19

Yang Zhang, Mario Juhas, Chun Kit Kwok

Summary: SARS-CoV-2, the cause of COVID-19, is a major contributor to global mortality. Existing antigen/antibody-based immunoassays and neutralizing antibodies are often ineffective against emerging SARS-CoV-2 variants, highlighting the urgent need for new approaches. Aptamers have been successfully used for detecting and inhibiting various viruses, and hold promise in the fight against COVID-19. This review discusses recent advances and future trends in the development of aptamer-based approaches for the diagnosis and treatment of SARS-CoV-2.

TRENDS IN BIOTECHNOLOGY (2023)

Article Chemistry, Medicinal

Clinical Translation of Aptamers for COVID-19

Yang Zhang, Yongen Li

Summary: SARS-CoV-2, the virus causing COVID-19, remains a leading cause of death globally. Despite the development of effective methods for diagnosing and treating COVID-19, there is still an urgent need for new approaches to tackle SARS-CoV-2 variants and long COVID. Aptamers have shown great potential as diagnostic and therapeutic agents for COVID-19, but their translation into clinical use has been slow, posing challenges that need to be overcome.

JOURNAL OF MEDICINAL CHEMISTRY (2023)

Article Biochemical Research Methods

A knowledge-integrated deep learning framework for cellular image analysis in parasite microbiology

Ruijun Feng, Sen Li, Yang Zhang

Summary: Cellular image analysis is a crucial method employed by microbiologists for the identification and study of microbes. The article presents a knowledge-integrated deep learning framework for cellular image analysis, focusing on classification, detection, and reconstruction tasks. It provides comprehensive information on various models, datasets, computing environment setup, knowledge representation, data pre-processing, and training and tuning, as well as evaluation and visualization techniques.

STAR PROTOCOLS (2023)

Article Chemistry, Analytical

Fluorescence detection of the human angiotensinogen protein by the G-quadruplex aptamer

Hui Xi, Hanlin Jiang, Mario Juhas, Yang Zhang

Summary: Noncanonical G-quadruplex nucleic acid structures have been used as probes in biosensors for accurate and efficient detection of metal ions, proteins, and nucleic acids. In this study, a reliable and efficient fluorescent biosensor platform for G-quadruplex based detection of the human AGT protein was constructed using the magnetic bead enrichment method. This biosensor provides high accuracy, speed, and low cost, and successfully detected AGT at the cellular level.

ANALYST (2022)

Article Biochemistry & Molecular Biology

Correction of out-of-focus microscopic images by deep learning

Chi Zhang, Hao Jiang, Weihuang Liu, Junyi Li, Shiming Tang, Mario Juhas, Yang Zhang

Summary: In this study, a model based on Cycle Generative Adversarial Network (CycleGAN) and a multi-component weighted loss function was developed to address the issue of out-of-focus microscopic images. The proposed model achieved state-of-the-art performance in deblurring and demonstrated excellent generalization capabilities.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2022)

Article Biochemistry & Molecular Biology

Genomic pan-cancer classification using image-based deep learning

Taoyu Ye, Sen Li, Yang Zhang

Summary: This study introduces a novel image-based deep learning strategy for cancer classification, achieving higher accuracy compared to existing methods. The approach is not only applicable to various types of cancer, but also helps identify top-ranked tumor-specific genes and pathways through heatmaps.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2021)

Article Biochemistry & Molecular Biology

Deep learning for COVID-19 chest CT (computed tomography) image analysis: A lesson from lung cancer

Hao Jiang, Shiming Tang, Weihuang Liu, Yang Zhang

Summary: To address the urgent need for COVID-19 diagnosis, AI-based methods for analyzing chest CT images have been proposed. By synthesizing a dataset and testing various deep learning models, accurate and efficient diagnostic testing for COVID-19 can be achieved.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2021)

No Data Available