4.6 Article

Modeling eye movement patterns to characterize perceptual skill in image-based diagnostic reasoning processes

Journal

COMPUTER VISION AND IMAGE UNDERSTANDING
Volume 151, Issue -, Pages 138-152

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.cviu.2016.03.001

Keywords

Visual attention; Multi-modal data; Diagnostic reasoning; Probabilistic modeling; Nonparametric Bayesian method; Markov chain Monte Carlo; Combinatorial stochastic processes

Funding

  1. National Science Foundation [IIS-0941452 CDI]
  2. National Institutes of Health [1 R21 LM010039-01A1]

Ask authors/readers for more resources

Experts have a remarkable capability of locating, perceptually organizing, identifying, and categorizing objects in images specific to their domains of expertise. In this article, we present a hierarchical probabilistic framework to discover the stereotypical and idiosyncratic viewing behaviors exhibited with expertise specific groups. Through these patterned eye movement behaviors we are able to elicit the domain specific knowledge and perceptual skills from the subjects whose eye movements are recorded during diagnostic reasoning processes on medical images. Analyzing experts' eye movement patterns provides us insight into cognitive strategies exploited to solve complex perceptual reasoning tasks. An experiment was conducted to collect both eye movement and verbal narrative data from three groups of subjects with different levels or no medical training (eleven board-certified dermatologists, four dermatologists in training and thirteen undergraduates) while they were examining and describing 50 photographic dermatological images. We use a hidden Markov model to describe each subject's eye movement sequence combined with hierarchical stochastic processes to capture and differentiate the discovered eye movement patterns shared by multiple subjects within and among the three groups. Independent experts' annotations of diagnostic conceptual units of thought in the transcribed verbal narratives are time-aligned with discovered eye movement patterns to help interpret the patterns' meanings. By mapping eye movement patterns to thought units, we uncover the relationships between visual and linguistic elements of their reasoning and perceptual processes, and show the manner in which these subjects varied their behaviors while parsing the images. We also show that inferred eye movement patterns characterize groups of similar temporal and spatial properties, and specify a subset of distinctive eye movement patterns which are commonly exhibited across multiple images. Based on the combinations of the occurrences of these eye movement patterns, we are able to categorize the images from the perspective of experts' viewing strategies in a novel way. In each category, images share similar lesion distributions and configurations. Our results show that modeling with multi-modal data, representative of physicians' diagnostic viewing behaviors and thought processes, is feasible and informative to gain insights into physicians' cognitive strategies, as well as medical image understanding. (C) 2016 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Multidisciplinary Sciences

Gaze-in-wild: A dataset for studying eye and head coordination in everyday activities

Rakshit Kothari, Zhizhuo Yang, Christopher Kanan, Reynold Bailey, Jeff B. Pelz, Gabriel J. Diaz

SCIENTIFIC REPORTS (2020)

Article Ophthalmology

Computational framework for fusing eye movements and spoken narratives for image annotation

Preethi Vaidyanathan, Emily Prud'hommeaux, Cecilia O. Alm, Jeff B. Pelz

JOURNAL OF VISION (2020)

Article Health Care Sciences & Services

Quantitative Methods for Analyzing Intimate Partner Violence in Microblogs: Observational Study

Christopher Michael Homan, J. Nicolas Schrading, Raymond W. Ptucha, Catherine Cerulli, Cecilia Ovesdotter Alm

JOURNAL OF MEDICAL INTERNET RESEARCH (2020)

Article Computer Science, Information Systems

Evaluating Technology-Mediated Collaborative Workflows for Telehealth

Christopher Bondy, Linlin Chen, Pamela Grover, Vicki Hanson, Rui Li, Pengcheng Shi

Summary: This paper introduces a cross-disciplinary evaluation method - Collaborative Space - Analysis Framework (CS-AF), designed to evaluate technology-mediated collaborative workflows. Through a 5-step meta-process, the CS-AF systematically analyzes gains and gaps of collaborative workflows, providing critical data for continuous workflow improvement.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2021)

Article Biology

Slice imputation: Multiple intermediate slices interpolation for anisotropic 3D medical image segmentation

Zhaotao Wu, Jia Wei, Jiabing Wang, Rui Li

Summary: This study introduces a novel frame-interpolation-based method for slice imputation to improve segmentation accuracy for anisotropic 3D medical images. Unlike previous methods, this study focuses on improving the smoothness of the interpolated 3D medical volumes in all three directions: axial, sagittal, and coronal. The proposed method incorporates a smoothness loss function to evaluate the smoothness in the through-plane direction and improves the resolution and isotropy of the interpolated volumes, leading to better segmentation performance.

COMPUTERS IN BIOLOGY AND MEDICINE (2022)

Article Biology

Learning multi-modal brain tumor segmentation from privileged semi-paired MRI images with curriculum disentanglement learning

Zecheng Liu, Jia Wei, Rui Li, Jianlong Zhou

Summary: Brain cancer is highly dangerous due to the importance of the brain as the primary command center. Automatic segmentation of brain tumors from multi-modal images is crucial for diagnosis and treatment. This study proposes a novel curriculum disentanglement learning framework for unimodal segmentation using limited paired images, which outperforms competing models on unimodal segmentation and demonstrates improved multi-modal segmentation performance.

COMPUTERS IN BIOLOGY AND MEDICINE (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Understanding Differences in Human-Robot Teaming Dynamics between Deaf/Hard of Hearing and Hearing Individuals

A'di Dust, Carola Gonzalez-Lebron, Shannon Connell, Saurav Singh, Reynold Bailey, Cecillia Ovesdotter Alm, Jamison Heard

Summary: With the development of industry 4.0, research is focusing on understanding the differences in interaction between hearing and deaf and hard of hearing individuals when collaborating with cobots. This understanding can lead to inclusive designs and strategies for more effective human-cobot interaction.

COMPANION OF THE ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2023 (2023)

Proceedings Paper Computer Science, Theory & Methods

Remote Early Research Experiences for Undergraduate Students in Computing

Cecilia O. Alm, Reynold Bailey, Hannah Miller

Summary: This paper provides an experience report on a remote framework for early undergraduate research experiences focused on sensing humans computationally. The framework consists of three components: team-based research cycle, professional development activities, and cohort-networking programming. The authors discuss the challenges and opportunities of remote training and evaluate its effectiveness through reflections and interviews.

PROCEEDINGS OF THE 53RD ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION (SIGCSE 2022), VOL 1 (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Handling Extreme Class Imbalance in Technical Logbook Datasets

Farhad Akhbardeh, Cecilia Ovesdotter Alm, Marcos Zampieri, Travis Desell

Summary: Technical logbooks are challenging text types in automated event identification due to their short length, non-standard yet technical language, and class imbalance issues. This paper introduces a feedback strategy from computer vision to handle extreme class imbalance, which significantly improves technical issue classification across various domains and datasets. This generic feedback strategy could be applied to any learning problem with substantial class imbalances.

59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Visualizing NLP in Undergraduate Students' Learning about Natural Language

Cecilia O. Alm, Alex Hedges

Summary: This study explores the use of open-source NLP capabilities in a web interface to enhance undergraduate students' understanding of formal natural language structures, encouraging critical thinking and evaluation of AI systems. The research emphasizes inclusivity in education resources and the importance of making AI systems interpretable for multi-disciplinary interest.

THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE (2021)

Proceedings Paper Computer Science, Cybernetics

Gaze-guided Magnification for Individuals with Vision Impairments

Natalie Maus, Dalton Rutledge, Sedeeq Al-Khazraji, Reynold Bailey, Cecilia Ovesdotter Alm, Kristen Shinohara

CHI'20: EXTENDED ABSTRACTS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (2020)

Proceedings Paper Computer Science, Artificial Intelligence

WHERE AM I LOOKING: LOCALIZING GAZE IN RECONSTRUCTED 3D SPACE

Devarth Parikh, Yawen Lu, Yuan Xin, Di Wu, Jeff Pelz, Guoyu Lu

2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP) (2019)

Proceedings Paper Computer Science, Interdisciplinary Applications

SNAG: Spoken Narratives and Gaze Dataset

Preethi Vaidyanathan, Emily Prud'hommeaux, Jeff B. Pelz, Cecilia O. Alm

PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2 (2018)

Article Ophthalmology

Motion tracking of iris features to detect small eye movements

Aayush K. Chaudhary, Jeff B. Pelz

JOURNAL OF EYE MOVEMENT RESEARCH (2019)

Article Computer Science, Artificial Intelligence

Disentangled generation network for enlarged license plate recognition and a unified dataset

Chenglong Li, Xiaobin Yang, Guohao Wang, Aihua Zheng, Chang Tan, Jin Tang

Summary: License plate recognition is crucial in various practical applications, however, recognizing license plates of large vehicles is challenging due to low resolution, contamination, low illumination, and occlusion. To address this problem, a novel data generation framework based on the Disentangled Generation Network is proposed to ensure the generation diversity and integrity for robust enlarged license plate recognition.

COMPUTER VISION AND IMAGE UNDERSTANDING (2024)

Article Computer Science, Artificial Intelligence

Distributed multi-target tracking and active perception with mobile camera networks

Sara Casao, Alvaro Serra-Gomez, Ana C. Murillo, Wendelin Bohmer, Javier Alonso-Mora, Eduardo Montijano

Summary: This paper presents a hybrid camera system that combines static and mobile cameras, exploiting the cooperation between tracking and control modules to achieve high-level scene understanding. The static camera network provides global awareness, while the mobile cameras enhance the information about the people on the scene.

COMPUTER VISION AND IMAGE UNDERSTANDING (2024)

Article Computer Science, Artificial Intelligence

Assessing domain gap for continual domain adaptation in object detection

Anh-Dzung Doan, Bach Long Nguyen, Surabhi Gupta, Ian Reid, Markus Wagner, Tat-Jun Chin

Summary: To ensure reliable object detection in autonomous systems, the detector needs to adapt to changes in appearance caused by environmental factors. We propose a selective adaptation approach using domain gap as a criterion to improve the efficiency of the detector's operation.

COMPUTER VISION AND IMAGE UNDERSTANDING (2024)

Article Computer Science, Artificial Intelligence

Wavelet-based network for high dynamic range imaging

Tianhong Dai, Wei Li, Xilei Cao, Jianzhuang Liu, Xu Jia, Ales Leonardis, Youliang Yan, Shanxin Yuan

Summary: This study proposes a novel frequency-guided deep neural network (FHDRNet) for high dynamic range (HDR) imaging from multiple low dynamic range (LDR) images, aiming to address ghosting artifacts. By conducting HDR fusion in the frequency domain, the network utilizes low-frequency signals to remove specific ghosting artifacts and high-frequency signals to preserve details. Extensive experiments demonstrate that this approach achieves state-of-the-art performance.

COMPUTER VISION AND IMAGE UNDERSTANDING (2024)

Article Computer Science, Artificial Intelligence

Feature learning based on connectivity estimation for unbiased mammography mass classification

Guobin Li, Reyer Zwiggelaar

Summary: Breast cancer is the most commonly diagnosed female malignancy worldwide. Recent developments in deep convolutional neural networks have shown promising performance for breast cancer detection and classification. However, biased features can be learned due to variations in appearance and small datasets. To address this issue, a densely connected convolutional network (DenseNet) was trained using texture features representing different physical morphological representations as inputs. The use of connectivity estimation and nearest neighbors improved the network's unbiased prediction. The approach achieved higher diagnostic accuracy and provided visual explanations for model predictions.

COMPUTER VISION AND IMAGE UNDERSTANDING (2024)

Article Computer Science, Artificial Intelligence

AdaNI: Adaptive Noise Injection to improve adversarial robustness

Yuezun Li, Cong Zhang, Honggang Qi, Siwei Lyu

Summary: Deep Neural Networks (DNNs) are vulnerable to adversarial perturbations, limiting their applicability in safe-critical scenarios. To address this, a new method called AdaNI is proposed to increase feature randomness through adaptive noise injection, improving adversarial robustness. Extensive experiments demonstrate the efficacy of AdaNI against various white-box and black-box attacks, as well as its applicability in DeepFake detection.

COMPUTER VISION AND IMAGE UNDERSTANDING (2024)

Article Computer Science, Artificial Intelligence

Adversarial Neon Beam: A light-based physical attack to DNNs

Chengyin Hu, Weiwen Shi, Ling Tian, Wen Li

Summary: In this study, we introduce a pioneering black-box light-based physical attack called Adversarial Neon Beam (AdvNB). Our method excels in attack modeling, efficient attack simulation, and robust optimization, striking a balance between robustness and efficiency. Through rigorous evaluation, we achieve impressive attack success rates in both digital and real-world scenarios. AdvNB demonstrates its stealthiness through comparisons with baseline samples and consistently achieves high success rates when targeting advanced DNN models.

COMPUTER VISION AND IMAGE UNDERSTANDING (2024)

Article Computer Science, Artificial Intelligence

MDC-Net: Multi-domain constrained kernel estimation network for blind image super resolution

Hang Wang, Zhenyu Ding, Cheng Cheng, Yuhai Li, Hongbin Sun

Summary: Learning-based super resolution has made remarkable progress in improving image quality, but the performance decreases when the degradation kernel changes. Blind SR networks can estimate the degradation kernel and adapt well in realistic scenarios, improving performance and runtime. This paper proposes a design that imposes constraints for the kernel estimation network in both the image domain and kernel domain, resulting in high-quality images and efficient runtime.

COMPUTER VISION AND IMAGE UNDERSTANDING (2024)

Article Computer Science, Artificial Intelligence

MFMAM: Image inpainting via multi-scale feature module with attention module

Yuantao Chen, Runlong Xia, Kai Yang, Ke Zou

Summary: This paper proposes an improved image inpainting network using a multi-scale feature module and improved attention module. The network addresses issues in deep learning-based image inpainting algorithms, such as information loss in deep level features and the neglect of semantic features. The proposed network generates better inpainting results by reducing information loss and enhancing the ability to restore texture and semantic features.

COMPUTER VISION AND IMAGE UNDERSTANDING (2024)

Article Computer Science, Artificial Intelligence

Ensemble learning-based method for maritime background subtraction in open sea environments

Yi-Tung Chan

Summary: This study proposes a novel maritime background subtraction method based on ensemble learning theory to address the challenges posed by dynamic marine environments and noise, improving the detection accuracy and enhancing maritime transportation security for autonomous ships in open waters.

COMPUTER VISION AND IMAGE UNDERSTANDING (2024)