4.7 Article

Efficient Deep Network Architectures for Fast Chest X-Ray Tuberculosis Screening and Visualization

期刊

SCIENTIFIC REPORTS
卷 9, 期 -, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41598-019-42557-4

关键词

-

资金

  1. German Research Foundation (DFG)
  2. Technical University of Munich within the Open Access Publishing Funding Programme

向作者/读者索取更多资源

Automated diagnosis of tuberculosis (TB) from chest X-Rays (CXR) has been tackled with either hand-crafted algorithms or machine learning approaches such as support vector machines (SVMs) and convolutional neural networks (CNNs). Most deep neural network applied to the task of tuberculosis diagnosis have been adapted from natural image classification. These models have a large number of parameters as well as high hardware requirements, which makes them prone to overfitting and harder to deploy in mobile settings. We propose a simple convolutional neural network optimized for the problem which is faster and more efficient than previous models but preserves their accuracy. Moreover, the visualization capabilities of CNNs have not been fully investigated. We test saliency maps and grad-CAMs as tuberculosis visualization methods, and discuss them from a radiological perspective.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Optics

Deep learning in attosecond metrology

Christian Brunner, Andreas Duensing, Christian Schroeder, Michael Mittermair, Vladimir Golkov, Maximilian Pollanka, Daniel Cremers, Reinhard Kienberger

Summary: In this study, deep neural networks are applied to solve the challenge of information extraction from spectrograms recorded with the attosecond streak camera in time-resolved photoelectron spectroscopy. Extensive benchmarking on simulated data shows that the deep neural networks exhibit competitive retrieval quality and superior tolerance against noisy data conditions.

OPTICS EXPRESS (2022)

Article Robotics

DM-VIO: Delayed Marginalization Visual-Inertial Odometry

Lukas von Stumberg, Daniel Cremers

Summary: We present a monocular visual-inertial odometry system based on delayed marginalization and pose graph bundle adjustment. By delaying marginalization, we can obtain updated marginalization prior and new linearization points, and inject IMU information into marginalized states. Our system outperforms existing techniques in visual-inertial odometry.

IEEE ROBOTICS AND AUTOMATION LETTERS (2022)

Article Computer Science, Artificial Intelligence

Lifting the Convex Conjugate in Lagrangian Relaxations: A Tractable Approach for Continuous Markov Random Fields

Hartmut Bauermeister, Emanuel Laude, Thomas Moellenhoff, Michael Moeller, Daniel Cremers

Summary: Dual decomposition approaches in nonconvex optimization often encounter duality gaps. This paper eliminates the duality gap by reformulating the nonconvex task in the space of measures and approximating the infinite-dimensional problem using a piecewise polynomial discretization in the dual. The approach successfully reduces the duality gap and demonstrates scalability in the stereo matching problem.

SIAM JOURNAL ON IMAGING SCIENCES (2022)

Article Computer Science, Artificial Intelligence

A Cutting-Plane Method for Sublabel-Accurate Relaxation of Problems with Product Label Spaces

Zhenzhang Ye, Bjoern Haefner, Yvain Queau, Thomas Moellenhoff, Daniel Cremers

Summary: This paper discusses the formulation of imaging and low-level vision problems as nonconvex variational problems and proposes convex relaxation methods to solve them. It extends a previous conference paper by introducing product-space relaxation and sublabel-accurate discretization, and demonstrates the use of a cutting-plane method to solve the resulting semi-infinite optimization problem. The journal version includes additional experiments, a more detailed algorithm outline, and a user-friendly introduction to functional lifting methods.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2023)

Article Computer Science, Artificial Intelligence

Learn to Predict Sets Using Feed-Forward Neural Networks

Hamid Rezatofighi, Tianyu Zhu, Roman Kaskman, Farbod T. Motlagh, Javen Qinfeng Shi, Anton Milan, Daniel Cremers, Laura Leal-Taixe, Ian Reid

Summary: This paper addresses the task of set prediction using deep feed-forward neural networks. It presents a novel approach for learning to predict sets with unknown permutation and cardinality using deep neural networks. The validity of the proposed approach is demonstrated on various vision problems.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Computer Science, Artificial Intelligence

Learning vision based autonomous lateral vehicle control without supervision

Qadeer Khan, Idil Sueloe, Melis Oecal, Daniel Cremers

Summary: Supervised deep learning methods using image data have shown promise in vehicle control, but suffer from the need for labeled training data and poor performance on out-of-distribution scenarios. To address these issues, we propose a framework that leverages visual odometry to determine vehicle trajectory and uses this to infer steering labels. Additionally, synthesized images from deviated trajectories are included in the training distribution for improved neural network robustness.

APPLIED INTELLIGENCE (2023)

Article Robotics

E-NeRF: Neural Radiance Fields From a Moving Event Camera

Simon Klenk, Lukas Koestler, Davide Scaramuzza, Daniel Cremers

Summary: Estimating neural radiance fields (NeRFs) from ideal images has been extensively studied. However, most methods assume optimal illumination and camera motion, which are often violated in robotic applications. To address this, we propose E-NeRF, the first method that estimates NeRFs from a fast-moving event camera.

IEEE ROBOTICS AND AUTOMATION LETTERS (2023)

Proceedings Paper Computer Science, Artificial Intelligence

High-Quality RGB-D Reconstruction via Multi-View Uncalibrated Photometric Stereo and Gradient-SDF

Lu Sang, Bjoern Haefner, Xingxing Zuo, Daniel Cremers

Summary: This paper presents a novel multi-view RGB-D based reconstruction method that utilizes a gradient signed distance field (gradient-SDF) to handle camera pose, lighting, albedo, and surface normal estimation. The proposed method optimizes the surface's quantities using its volumetric representation and validates two physically-based image formation models. Experimental results show that this method can recover high-quality surface geometry more accurately.

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Neural Implicit Representations for Physical Parameter Inference from a Single Video

Florian Hofherr, Lukas Koestler, Florian Bernard, Daniel Cremers

Summary: We propose a method that combines neural implicit representations with neural ordinary differential equations to directly identify dynamic scene representations from visual observations. Our model requires less training data and has stronger generalization abilities than existing methods, and it can process high-resolution videos and synthesize photorealistic images. Additionally, our model can identify interpretable physical parameters and make long-term predictions.

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) (2023)

Proceedings Paper Computer Science, Artificial Intelligence

VENTRILOQUIST-NET: LEVERAGING SPEECH CUES FOR EMOTIVE TALKING HEAD GENERATION

Deepan Das, Qadeer Khan, Daniel Cremers

Summary: Ventriloquist-Net is a novel model for generating talking head images using a speech segment and a single face image, emphasizing on emotive expressions. It can handle in-the-wild source images and demonstrates state-of-the-art performance on unseen input data.

2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP (2022)

Proceedings Paper Automation & Control Systems

DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment

Mariia Gladkova, Nikita Korobov, Nikolaus Demmel, Aljosa Osep, Laura Leal-Taixe, Daniel Cremers

Summary: This paper proposes DirectTracker, a framework that effectively combines direct image alignment and sliding-window photometric bundle adjustment for 3D multi-object tracking, showing competitive performance.

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Joint Deep Multi-Graph Matching and 3D Geometry Learning from Inhomogeneous 2D Image Collections

Zhenzhang Ye, Tarun Yenamandra, Florian Bernard, Daniel Cremers

Summary: This paper proposes a trainable framework that uses graph neural networks to learn a deformable 3D geometry model from inhomogeneous image collections for graph matching tasks. The method outperforms recent learning-based approaches in terms of accuracy and cycle-consistency error, while also obtaining the underlying 3D geometry of the objects in the images.

THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE (2022)

Proceedings Paper Computer Science, Artificial Intelligence

A Unified Framework for Implicit Sinkhorn Differentiation

Marvin Eisenberger, Aysim Toker, Laura Leal-Taixe, Florian Bernard, Daniel Cremers

Summary: The Sinkhorn operator has gained popularity in computer vision and related fields due to its easy integration into deep learning frameworks. This article proposes an algorithm for obtaining analytical gradients of a Sinkhorn layer through implicit differentiation, allowing for any type of loss function and joint differentiation of target capacities and cost matrices. Error bounds for approximate inputs are also constructed. The results demonstrate improved stability, accuracy, and computational efficiency compared to automatic differentiation, particularly in resource-constrained scenarios like GPU memory.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Biologically Inspired Neural Path Finding

Hang Li, Qadeer Khan, Volker Tresp, Daniel Cremers

Summary: This paper presents a computational framework inspired by the human brain to find the optimal low cost path between two nodes in a graph. The framework is able to handle unseen graphs and adapt to changes in node configurations during inference.

BRAIN INFORMATICS (BI 2022) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Perceiver Hopfield Pooling for Dynamic Multi-modal and Multi-instance Fusion

Dominik Roessle, Daniel Cremers, Torsten Schoen

Summary: This paper introduces a novel dynamic multi-modal and multi-instance network architecture that can learn intrinsic data fusion. By using Perceiver and Hopfield pooling, the proposed architecture outperforms the late fusion baseline by more than 40% accuracy in multi-modal setups, particularly on noisy data.

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I (2022)

暂无数据