4.6 Article

Deep learning based early stage diabetic retinopathy detection using optical coherence tomography

Journal

NEUROCOMPUTING
Volume 369, Issue -, Pages 134-144

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2019.08.079

Keywords

Computer-aided diagnosis; Diabetic retinopathy; Optical coherence tomography; Deep learning

Funding

  1. National Natural Science Foundation of China [61672357, 61702337, U1713214]
  2. Science and Technology Project of Guangdong Province [2018A050501014]

Ask authors/readers for more resources

Diabetic retinopathy (DR) is one of the leading causes of preventable blindness globally. Performing retinal examinations on all diabetic patients is an unmet need, and detection at an early stage can provide better control of the disease. The objective of this study is to provide an optical coherence tomography (OCT) image based diagnostic technology for automated early DR diagnosis, including at both grades 0 and 1. This work can help ophthalmologists with evaluation and treatment, reducing the rate of vision loss, and enabling timely and accurate diagnosis. In this work, we developed and evaluated a novel deep network - OCTD_Net, for early-stage DR detection. While one of the networks extracted features from the original OCT image, the other extracted retinal layer information. The accuracy, sensitivity and specificity was 0.92, 0.90 and 0.95, respectively. Our analysis of retinal layers and the features learned by the proposed network suggests that grade 1 DR patients present with significant changes in the thickness and reflection of certain retinal layers. However, grade 0 DR patients do not have such significant changes. The heatmaps of the trained network also suggest that patients with early DR showed different textures around the myoid and ellipsoid zones, inner nuclear layers, and photoreceptor outer segments, which should all receive dedicated attention for early DR diagnosis. (C) 2019 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

TW-GAN: Topology and width aware GAN for retinal artery/vein classification

Wenting Chen, Shuang Yu, Kai Ma, Wei Ji, Cheng Bian, Chunyan Chu, Linlin Shen, Yefeng Zheng

Summary: This paper proposes a novel Topology and Width Aware Generative Adversarial Network (TW-GAN) that integrates topology connectivity and vessel width information into the deep learning framework for automatic artery/vein (A/V) classification. Experimental results demonstrate that the proposed framework significantly improves the topological connectivity of segmented A/V masks and achieves state-of-the-art A/V classification performance on public datasets.

MEDICAL IMAGE ANALYSIS (2022)

Article Computer Science, Artificial Intelligence

Tongue size and shape classification fusing segmentation features for traditional Chinese medicine diagnosis

Yating Huang, Xuechen Li, Siting Zheng, Zhongliang Li, Sihan Li, Linlin Shen, Changen Zhou, Zhihui Lai

Summary: The size and shape of the tongue can reflect different pathological changes of the human body in Traditional Chinese Medicine (TCM). In this work, an efficient deep network, TSCWNet, is proposed for tongue size and shape classification. Experimental results demonstrate that the network achieves better classification performance for tongue diagnosis.

NEURAL COMPUTING & APPLICATIONS (2023)

Article Engineering, Electrical & Electronic

Weakly Supervised Pedestrian Segmentation for Person Re-Identification

Ziqi Jin, Jinheng Xie, Bizhu Wu, Linlin Shen

Summary: In this paper, a weakly supervised pedestrian segmentation framework is proposed to directly generate the foreground mask from person re-identification datasets with only image-level subject ID labels. The Image Synthesis Augmentation (ISA) technique is also introduced to further enhance the dataset. Experimental results demonstrate that the proposed framework learns robust and discriminative features, achieving significant improvement in mAP compared to the baseline on widely used datasets including Market-1501, CUHK03, and MSMT17. The code will be made available soon.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Computer Science, Artificial Intelligence

CT-Based Automatic Spine Segmentation Using Patch-Based Deep Learning

Syed Furqan Qadri, Hongxiang Lin, Linlin Shen, Mubashir Ahmad, Salman Qadri, Salabat Khan, Maqbool Khan, Syeda Shamaila Zareen, Muhammad Azeem Akbar, Md Belal Bin Heyat, Saqib Qamar

Summary: This study proposes a patch-based deep learning approach for automatic CT vertebral segmentation. The method extracts discriminative features from unlabeled data using a stacked sparse autoencoder and achieves accurate segmentation of CT vertebrae.

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Learning from pseudo-lesion: a self-supervised framework for COVID-19 diagnosis

Zhongliang Li, Xuechen Li, Zhihao Jin, Linlin Shen

Summary: In this paper, a novel self-supervised pretraining method based on pseudo-lesion generation and restoration was proposed for COVID-19 diagnosis. The method trained an encoder-decoder architecture-based U-Net using pairs of pseudo-COVID-19 images and normal CT images for image restoration, and then fine-tuned the pretrained encoder using labeled data. Experimental results demonstrated that the proposed method extracted better feature representation for COVID-19 diagnosis, achieving higher accuracy compared to the supervised model pretrained on large-scale images by 6.57% and 3.03% on SARS-CoV-2 dataset and Jinan COVID-19 dataset, respectively.

NEURAL COMPUTING & APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

Enhanced robust spatial feature selection and correlation filter learning for UAV tracking

Jiajun Wen, Honglin Chu, Zhihui Lai, Tianyang Xu, Linlin Shen

Summary: This paper proposes an innovative method called EFSCF, which performs jointly sparse feature learning to handle the spatial boundary effect effectively while suppressing the influence of background pixels and noises. The proposed method achieves better tracking performance than the state-of-the-art trackers by exploring the structural sparsity in rows and columns of a learned filter simultaneously.

NEURAL NETWORKS (2023)

Article Neurosciences

Style attention based global-local aware GAN for personalized facial caricature generation

Xiuzhi Zhao, Wenting Chen, Weicheng Xie, Linlin Shen

Summary: This study proposes a Style Attention based Global-local Aware GAN to generate personalized caricatures. It integrates the facial characteristics of a subject through a landmark-based warp controller for personalized shape exaggeration and uses a style-attention module for appropriate fusion of facial features and caricature style. The results indicate that the proposed method can preserve the identity of input photos and generate caricatures close to those drawn by real artists.

FRONTIERS IN NEUROSCIENCE (2023)

Article Plant Sciences

Effect of Climate Change on the Potentially Suitable Distribution Pattern of Castanopsis hystrix Miq. in China

Linlin Shen, Haiyan Deng, Ganglong Zhang, Anqi Ma, Xiaoyong Mo

Summary: Climate warming poses a significant threat to global ecosystems, impacting the geographic distribution and suitable growth areas of species. This study focused on predicting the potential cultivation regions of Castanopsis hystrix Miq., a research object, based on the MaxEnt model and environmental variables. The key factors affecting the distribution area of C. hystrix Miq. were identified as the minimum temperature of the coldest month, precipitation of the driest month, and precipitation of the warmest quarter. The suitable cultivation regions were found in central and southern China, with a range of 18-34°N and 89-122°E, covering an area of 261.95 x 10(4) km(2). Under different climate scenarios, the spatial pattern of C. hystrix Miq. will migrate to different regions, with varying changes in suitable area and cultivation distribution.

PLANTS-BASEL (2023)

Article Computer Science, Artificial Intelligence

Adaptive weighted rain streaks model-driven deep network for single image deraining

Ya-Nan Zhang, Linlin Shen, Zhihui Lai

Summary: In the field of computer vision, removing rain streaks from images is an important task as it affects the performance and quality of subsequent tasks and outdoor images. Deep learning-based methods have been proposed to address this issue, but they lack interpretability and have limited performance in detail restoration and rain streak removal. This paper introduces a rain streaks model-driven deep network, MSANet, to overcome these limitations.

EXPERT SYSTEMS WITH APPLICATIONS (2023)

Article Automation & Control Systems

Robust Twin Bounded Support Vector Classifier With Manifold Regularization

Junhong Zhang, Zhihui Lai, Heng Kong, Linlin Shen

Summary: In this paper, a new robust manifold twin bounded SVM (RMTBSVM) algorithm is proposed, which considers both robustness and discriminability. By using the capped L-1-norm as the distance metric and adding robust manifold regularization, the robustness and classification performance are improved. The algorithm is extended for nonlinear classification using the kernel method.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

Article Computer Science, Artificial Intelligence

Self-Supervised Learning of Person-Specific Facial Dynamics for Automatic Personality Recognition

Siyang Song, Shashank Jaiswal, Enrique Sanchez, Georgios Tzimiropoulos, Linlin Shen, Michel Valstar

Summary: This article addresses two important issues in automatic personality analysis systems: the use of short video segments or single frames for inferring personality traits, and the lack of methods for encoding person-specific facial dynamics. To tackle these issues, the paper proposes a novel Rank Loss for self-supervised learning of facial dynamics and a method to represent person-specific dynamics. The approach achieves promising results in personality estimation and shows the importance of the tasks performed by the subject in the video and the use of multi-scale dynamics.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2023)

Article Thermodynamics

Structural, elastic and thermodynamic properties of the binary precipitates γ-TiAl, DO22-Al3Ti and α2-Ti3Al

Lili Liu, Cai Chen, Linlin Shen, Gang Xu, Yufeng Wen, Xianshi Zing

Summary: The pressure dependence of lattice and elastic constants of g-TiAl, DO22-Al3Ti, and alpha(2)-Ti3Al binary precipitates was investigated using a first-principles approach. The calculated results at 0 GPa and 0 K were in good agreement with existing experimental and theoretical values. The temperature and pressure dependencies of bulk modulus, Gibbs free energy, thermal expansion coefficient, and heat capacity at constant pressure were systematically studied using density-functional perturbation theory (DFPT) under the quasiharmonic approximation (QHA) in the ranges of 0-1000 K and 0-30 GPa.

HIGH TEMPERATURES-HIGH PRESSURES (2023)

Article Computer Science, Artificial Intelligence

Automatic Learning Rate Adaption for Memristive Deep Learning Systems

Yang Zhang, Linlin Shen

Summary: In this study, an automatic learning rate tuning method for memristive deep learning systems is presented. The method utilizes memristors to adjust the adaptive learning rate in deep neural networks. The proposed method is robust to noisy gradients, various architectures, and different datasets and can address the issue of over-fitting. Moreover, a quantized neural network architecture is utilized in the presented system, leading to an increase in training efficiency without the loss of testing accuracy.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Taming Self-Supervised Learning for Presentation Attack Detection: De-Folding and De-Mixing

Zhe Kong, Wentian Zhang, Feng Liu, Wenhan Luo, Haozhe Liu, Linlin Shen, Raghavendra Ramachandra

Summary: This study empirically proves the importance of model initialization for generalization and proposes a self-supervised learning-based method to address the issue of unknown PAI.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Proceedings Paper Neuroimaging

Point Beyond Class: A Benchmark for Weakly Semi-supervised Abnormality Localization in Chest X-Rays

Haoqin Ji, Haozhe Liu, Yuexiang Li, Jinheng Xie, Nanjun He, Yawen Huang, Dong Wei, Xinrong Chen, Linlin Shen, Yefeng Zheng

Summary: Accurate abnormality localization in chest X-rays (CXR) is important for clinical diagnosis, but obtaining lesion-level annotations is difficult and time-consuming. We propose a weakly semi-supervised strategy called Point Beyond Class (PBC) that utilizes fully annotated CXRs with bounding boxes and weakly annotated samples by points. PBC learns a mapping from point annotations to bounding boxes and utilizes regularization and self-supervision to improve accuracy. Experimental results show that PBC outperforms the state-of-the-art method in abnormality localization.

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT III (2022)

Article Computer Science, Artificial Intelligence

3D-KCPNet: Efficient 3DCNNs based on tensor mapping theory

Rui Lv, Dingheng Wang, Jiangbin Zheng, Zhao-Xu Yang

Summary: In this paper, the authors investigate tensor decomposition for neural network compression. They analyze the convergence and precision of tensor mapping theory, validate the rationality of tensor mapping and its superiority over traditional tensor approximation based on the Lottery Ticket Hypothesis. They propose an efficient method called 3D-KCPNet to compress 3D convolutional neural networks using the Kronecker canonical polyadic (KCP) tensor decomposition. Experimental results show that 3D-KCPNet achieves higher accuracy compared to the original baseline model and the corresponding tensor approximation model.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Personalized robotic control via constrained multi-objective reinforcement learning

Xiangkun He, Zhongxu Hu, Haohan Yang, Chen Lv

Summary: In this paper, a novel constrained multi-objective reinforcement learning algorithm is proposed for personalized end-to-end robotic control with continuous actions. The approach trains a single model using constraint design and a comprehensive index to achieve optimal policies based on user-specified preferences.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Overlapping community detection using expansion with contraction

Zhijian Zhuo, Bilian Chen, Shenbao Yu, Langcai Cao

Summary: In this paper, a novel method called Expansion with Contraction Method for Overlapping Community Detection (ECOCD) is proposed, which utilizes non-negative matrix factorization to obtain disjoint communities and applies expansion and contraction processes to adjust the degree of overlap. ECOCD is applicable to various networks with different properties and achieves high-quality overlapping community detection.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

High-compressed deepfake video detection with contrastive spatiotemporal distillation

Yizhe Zhu, Chunhui Zhang, Jialin Gao, Xin Sun, Zihan Rui, Xi Zhou

Summary: In this work, the authors propose a Contrastive Spatio-Temporal Distilling (CSTD) approach to improve the detection of high-compressed deepfake videos. The approach leverages spatial-frequency cues and temporal-contrastive alignment to fully exploit spatiotemporal inconsistency information.

NEUROCOMPUTING (2024)

Review Computer Science, Artificial Intelligence

A review of coverless steganography

Laijin Meng, Xinghao Jiang, Tanfeng Sun

Summary: This paper provides a review of coverless steganographic algorithms, including the development process, known contributions, and general issues in image and video algorithms. It also discusses the security of coverless steganography from theoretical analysis to actual investigation for the first time.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Confidence-based interactable neural-symbolic visual question answering

Yajie Bao, Tianwei Xing, Xun Chen

Summary: Visual question answering requires processing multi-modal information and effective reasoning. Neural-symbolic learning is a promising method, but current approaches lack uncertainty handling and can only provide a single answer. To address this, we propose a confidence based neural-symbolic approach that evaluates NN inferences and conducts reasoning based on confidence.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

A framework-based transformer and knowledge distillation for interior style classification

Anh H. Vo, Bao T. Nguyen

Summary: Interior style classification is an interesting problem with potential applications in both commercial and academic domains. This project proposes a method named ISC-DeIT, which combines data-efficient image transformer architectures and knowledge distillation, to address the interior style classification problem. Experimental results demonstrate a significant improvement in predictive accuracy compared to other state-of-the-art methods.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Improving robustness for vision transformer with a simple dynamic scanning augmentation

Shashank Kotyan, Danilo Vasconcellos Vargas

Summary: This article introduces a novel augmentation technique called Dynamic Scanning Augmentation to improve the accuracy and robustness of Vision Transformer (ViT). The technique leverages dynamic input sequences to adaptively focus on different patches, resulting in significant changes in ViT's attention mechanism. Experimental results demonstrate that Dynamic Scanning Augmentation outperforms ViT in terms of both robustness to adversarial attacks and accuracy against natural images.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Introducing shape priors in Siamese networks for image classification

Hiba Alqasir, Damien Muselet, Christophe Ducottet

Summary: The article proposes a solution to improve the learning process of a classification network by providing shape priors, reducing the need for annotated data. The solution is tested on cross-domain digit classification tasks and a video surveillance application.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Neural dynamics solver for time-dependent infinity-norm optimization based on ACP framework with robot application

Dexiu Ma, Mei Liu, Mingsheng Shang

Summary: This paper proposes a method using neural dynamics solvers to solve infinity-norm optimization problems. Two improved solvers are constructed and their effectiveness and superiority are demonstrated through theoretical analysis and simulation experiments.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

cpp-AIF: A multi-core C plus plus implementation of Active Inference for Partially Observable Markov Decision Processes

Francesco Gregoretti, Giovanni Pezzulo, Domenico Maisto

Summary: Active Inference is a computational framework that uses probabilistic inference and variational free energy minimization to describe perception, planning, and action. cpp-AIF is a header-only C++ library that provides a powerful tool for implementing Active Inference for Partially Observable Markov Decision Processes through multi-core computing. It is cross-platform and improves performance, memory management, and usability compared to existing software.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Predicting stock market trends with self-supervised learning

Zelin Ying, Dawei Cheng, Cen Chen, Xiang Li, Peng Zhu, Yifeng Luo, Yuqi Liang

Summary: This paper proposes a novel stock market trends prediction framework called SMART, which includes a self-supervised stock technical data sequence embedding model S3E. By training with multiple self-supervised auxiliary tasks, the model encodes stock technical data sequences into embeddings and uses the learned sequence embeddings for predicting stock market trends. Extensive experiments on China A-Shares market and NASDAQ market prove the high effectiveness of our model in stock market trends prediction, and its effectiveness is further validated in real-world applications in a leading financial service provider in China.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

DHGAT: Hyperbolic representation learning on dynamic graphs via attention networks

Hao Li, Hao Jiang, Dongsheng Ye, Qiang Wang, Liang Du, Yuanyuan Zeng, Liu Yuan, Yingxue Wang, C. Chen

Summary: DHGAT1, a dynamic hyperbolic graph attention network, utilizes hyperbolic metric properties to embed dynamic graphs. It employs a spatiotemporal self-attention mechanism and weighted node representations, resulting in excellent performance in link prediction tasks.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Jiehui Huang, Zhenchao Tang, Xuedong He, Jun Zhou, Defeng Zhou, Calvin Yu-Chian Chen

Summary: This study proposes a progressive learning multi-scale feature blending model for image deraining tasks. The model utilizes detail dilation and texture extraction to improve the restoration of rainy images. Experimental results show that the model achieves near state-of-the-art performance in rain removal tasks and exhibits better rain removal realism.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Stabilization and synchronization control for discrete-time complex networks via the auxiliary role of edges subsystem

Lizhi Liu, Zilin Gao, Yinhe Wang, Yongfu Li

Summary: This paper proposes a novel discrete-time interconnected model for depicting complex dynamical networks. The model consists of nodes and edges subsystems, which consider the dynamic characteristic of both nodes and edges. By designing control strategies and coupling modes, the stabilization and synchronization of the network are achieved. Simulation results demonstrate the effectiveness of the proposed methods.

NEUROCOMPUTING (2024)