4.6 Article

A deep learning-based framework for lung cancer survival analysis with biomarker interpretation

Journal

BMC BIOINFORMATICS
Volume 21, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s12859-020-3431-z

Keywords

Cell detection; Deep learning; Feature learning; Survival analysis

Funding

  1. National Key R&D Program of China [2017YFB1002504]
  2. National Natural Science Foundation of China [61701404]

Ask authors/readers for more resources

Background Lung cancer is the leading cause of cancer-related deaths in both men and women in the United States, and it has a much lower five-year survival rate than many other cancers. Accurate survival analysis is urgently needed for better disease diagnosis and treatment management. Results In this work, we propose a survival analysis system that takes advantage of recently emerging deep learning techniques. The proposed system consists of three major components. 1) The first component is an end-to-end cellular feature learning module using a deep neural network with global average pooling. The learned cellular representations encode high-level biologically relevant information without requiring individual cell segmentation, which is aggregated into patient-level feature vectors by using a locality-constrained linear coding (LLC)-based bag of words (BoW) encoding algorithm. 2) The second component is a Cox proportional hazards model with an elastic net penalty for robust feature selection and survival analysis. 3) The third commponent is a biomarker interpretation module that can help localize the image regions that contribute to the survival model's decision. Extensive experiments show that the proposed survival model has excellent predictive power for a public (i.e., The Cancer Genome Atlas) lung cancer dataset in terms of two commonly used metrics: log-rank test (p-value) of the Kaplan-Meier estimate and concordance index (c-index). Conclusions In this work, we have proposed a segmentation-free survival analysis system that takes advantage of the recently emerging deep learning framework and well-studied survival analysis methods such as the Cox proportional hazards model. In addition, we provide an approach to visualize the discovered biomarkers, which can serve as concrete evidence supporting the survival model's decision.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

MKPM: Multi keyword-pair matching for natural language sentences

Xin Lu, Yao Deng, Ting Sun, Yi Gao, Jun Feng, Xia Sun, Richard Sutcliffe

Summary: The research proposes a sentence matching method based on multi keyword-pair matching to represent the semantic relationship between sentences and avoid the interference of redundancy and noise. Experimental results show that this method can achieve state-of-the-art performance in several tasks.

APPLIED INTELLIGENCE (2022)

Article Biochemical Research Methods

A Novel Encoding and Decoding Calibration Guiding Pathway for Pathological Image Analysis

Hansheng Li, Jianping Li, Yuxin Kang, Chunbao Wang, Feihong Liu, Wenli Hui, Qirong Bo, Lei Cui, Jun Feng, Lin Yang

Summary: Diagnostic pathology is crucial for identifying carcinomas, and accurate quantification of pathological images can provide objective clues. The Global Bank (GLB) pathway has been proposed to guide the extraction of more RoI features, significantly improving performance and increasing the accuracy of quantitative results.

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (2022)

Article Mathematical & Computational Biology

Do Gender or Major Influence the Performance in Programming Learning? Teaching Mode Decision Based on Exercise Series Analysis

Zhizezhang Gao, Yan Zhang, RuiPeng Zhang, Xia Sun, Jun Feng

Summary: Both traditional teaching and online teaching emphasize individualized education. However, the process of exploring improvements in instructional design is hindered by the challenging task of collecting data. Existing research primarily focuses on students' exam scores and overlooks their daily practice. In this study, we propose an experimental paradigm of programming performance analysis based on students' daily practice-exam records and collect a comprehensive time-series dataset, including students' individual attributes, learning behavior, and performance. We then analyze the time-series dataset using generalized estimating equations (GEE) to examine the impact of individual attributes and learning behavior on performance. This is the first application of GEE for ordinal multinomial responses in this research field, from which we conclude that gender and major do contribute to differences in programming learning. Longer answer times and shorter cost times are associated with better performance. Regardless of gender, students tend to cram for exams and perform slightly worse in daily exercises. Finally, we provide teaching mode decisions for universities based on two important individual attributes and recommend different teaching methods for students of different genders at different time points.

COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE (2022)

Article Computer Science, Information Systems

GeoSDVA: A Semi-Supervised Dirichlet Variational Autoencoder Model for Transportation Mode Identification

Xiaoxi Zhang, Yuan Gao, Xin Wang, Jun Feng, Yan Shi

Summary: This paper investigates the problem of transportation mode identification using GPS trajectories and geographic information, and proposes a geographic information-fused semi-supervised method. The proposed method can train an excellent transportation mode identification model with only a few labeled samples.

ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION (2022)

Article Computer Science, Artificial Intelligence

A New Amharic Speech Emotion Dataset and Classification Benchmark

Ephrem Afele Retta, Eiad Almekhlafi, Richard Sutcliffe, Mustafa Mhamed, Haider Ali, Jun Feng

Summary: This article introduces the Amharic Speech Emotion Dataset (ASED) which consists of four dialects and five emotions. It is the first dataset for Speech Emotion Recognition (SER) in Amharic. The dataset was created by 65 native Amharic speakers who recorded 2,474 sound samples. The resulting dataset is freely available for download.

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING (2023)

Article Engineering, Civil

A Dynamic Ridesplitting Method With Potential Pick-Up Probability Based on GPS Trajectories

Boting Qu, Xinyu Ren, Jun Feng, Xin Wang

Summary: Ridesplitting is a convenient and budget-friendly for-hire transportation service that arranges shared rides on the fly, with effective rider allocation being a crucial component. The DRPP method proposed in this paper utilizes a grid network and historical GPS trajectories to predict pick-up probabilities and travel times, using ILSAS and TKdS-tree to improve efficiency for matching drivers and riders, which outperformed other methods in service rate, share rate, and rider waiting time in experiments.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2022)

Article Engineering, Biomedical

Does perfect filtering really guarantee perfect phase correction for diffusion MRI data?

Feihong Liu, Junwei Yang, Mingyue Feng, Zhiming Cui, Xiaowei He, Luping Zhou, Jun Feng, Dinggang Shen

Summary: Phase correction is used to reconstruct real-valued diffusion MRI data by estimating the noise-free background phase. However, signal-loss and artifacts can still occur. In this paper, we propose a complex polar coordinate system (CPCS) to analyze the phase correction procedure and identify its limitations. Based on CPCS, we develop a quantitative criterion to better exploit the background phase and propose a phase calibration procedure to improve phase correction. Experimental results on synthetic and real diffusion MRI data demonstrate the effectiveness of our proposed method in reducing signal-loss and eliminating artifacts in FA maps.

COMPUTERIZED MEDICAL IMAGING AND GRAPHICS (2023)

Article Computer Science, Hardware & Architecture

FCRoute: A Fast FPGA Connection Router Using Soft Routing-Space Pruning Algorithm

Dekui Wang, Jun Feng, Wei Zhou, Xingxing Hao, Xiaodan Zhang

Summary: This article presents a fast FPGA connection router called FCRoute, which is based on a novel soft routing-space pruning algorithm. FCRoute classifies routing resource nodes into high-priority and low-priority ones and consists of a fast maze search and a backtracking process. By avoiding the exploration of the majority of low-priority nodes, FCRoute maintains runtime efficiency while ensuring global search ability.

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Wse-MF: A weighting-based student exercise matrix factorization model

Xia Sun, Bo Li, Richard Sutcliffe, Zhizezhang Gao, Wenying Kang, Jun Feng

Summary: Students can develop their skills by completing a series of tailored exercises, which is more effective than choosing exercises from online sources themselves. This paper presents a novel approach called Weighting-based Student Exercise Matrix Factorization (Wse-MF) that combines student learning ability and exercise difficulty. The research results demonstrate that Wse-MF outperforms other models in cognitive diagnosis and matrix factorization in terms of prediction quality and time complexity. There is also an optimal value of the latent factor K and hyperparameter c0 for each dataset. Overall, this paper contributes to the improvement of matrix factorization in educational data.

PATTERN RECOGNITION (2023)

Article Computer Science, Hardware & Architecture

A Fast FPGA Connection Router Using Prerouting-Based Parallel Local Routing Algorithm

Dekui Wang, Jun Feng, Ke Liu, Wei Zhou, Xingxing Hao, Xiaodan Zhang

Summary: This article introduces a fast FPGA connection router called PRoute, which implements a novel prerouting-based parallel local routing algorithm. PRoute precomputes potential routing solutions for various connection patterns on FPGAs, and achieves runtime efficiency and global search ability through parallel local search and A-star maze expansion. Experimental results show that PRoute achieves significant speedups without degrading the quality of results.

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS (2023)

Article Engineering, Multidisciplinary

Arabic sentiment analysis using GCL-based architectures and a customized regularization function

Mustafa Mhamed, Richard Sutcliffe, Xia Sun, Jun Feng, Ephrem Afele Retta

Summary: Sentiment analysis aims to extract emotions from textual data, and various challenges have emerged due to the proliferation of social media platforms and the flow of data in the Arabic language. This paper introduces Gated Convolution Long (GCL), an architecture designed for Arabic Sentiment Analysis, which overcomes difficulties with lengthy sequence training samples and improves performance for binary and multiple classifications. The proposed method achieves better results than baselines in various Arabic datasets, and includes a Custom Regularization Function (CRF) that enhances performance and optimizes validation loss. Furthermore, the paper explores the relationship between Modern Standard Arabic and five Arabic dialects through a cross-dialect training study.

ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH (2023)

Article Computer Science, Artificial Intelligence

Sentiment Analysis: Comprehensive Reviews, Recent Advances, and Open Challenges

Qiang Lu, Xia Sun, Yunfei Long, Zhizezhang Gao, Jun Feng, Tao Sun

Summary: Sentiment analysis (SA) has achieved significant breakthroughs in the past decade and there is a growing interest in multimodal SA (MSA). This article provides a comprehensive overview of SA advances, introduces a novel framework for SA tasks, and discusses the workflows and recent advances of single-modal SA. It also explores the research gaps and challenges in MSA, and proposes potential directions for future improvement.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Chemistry, Multidisciplinary

Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages

Ephrem Afele Retta, Richard Sutcliffe, Jabar Mahmood, Michael Abebe Berwo, Eiad Almekhlafi, Sajjad Ahmad Khan, Shehzad Ashraf Chaudhry, Mustafa Mhamed, Jun Feng

Summary: Cross-lingual and multilingual training can be an effective strategy for training an SER classifier when resources for a language are scarce. The difficulty of SER varies for different languages, and better results can be obtained by using two or three non-target languages for training.

APPLIED SCIENCES-BASEL (2023)

Article Engineering, Electrical & Electronic

Speech Emotion Recognition via Multi-Level Attention Network

Ke Liu, Dekui Wang, Dongya Wu, Yutao Liu, Jun Feng

Summary: The aim of this research is to improve the performance of human speech emotion recognition. The proposed multi-level attention network (MLAnet) extracts low-level emotion features from the popular mel-scale frequency cepstral coefficient (MFCC) and weights these features using a multi-unit attention module. Experimental results show that this method outperforms other state-of-the-art approaches.

IEEE SIGNAL PROCESSING LETTERS (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Time-Frequency Attention for Speech Emotion Recognition with Squeeze-and-Excitation Blocks

Ke Liu, Chen Wang, Jiayue Chen, Jun Feng

Summary: The study proposes a novel Time-Frequency Attention (TFA) method to better extract low-level features in speech emotion recognition and improve accuracy. By utilizing Squeeze-and-Excitation (SE) blocks to effectively integrate global information, the experimental results indicate that the proposed method outperforms existing methods with significant improvements in emotion recognition accuracy.

MULTIMEDIA MODELING (MMM 2022), PT I (2022)

No Data Available