4.6 Article

Modeling asynchronous event sequences with RNNs

Journal

JOURNAL OF BIOMEDICAL INFORMATICS
Volume 83, Issue -, Pages 167-177

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jbi.2018.05.016

Keywords

Temporal data; Deep learning; Electronic health records; Asthma

Funding

  1. NIH [R21AI116839, R01LM011934]

Ask authors/readers for more resources

Sequences of events have often been modeled with computational techniques, but typical preprocessing steps and problem settings do not explicitly address the ramifications of timestamped events. Clinical data, such as is found in electronic health records (EHRs), typically comes with timestamp information. In this work, we define event sequences and their properties: synchronicity, evenness, and co-cardinality; we then show how asynchronous, uneven, and multi-cardinal problem settings can support explicit accountings of relative dine. Our evaluation uses the temporally sensitive clinical use case of pediatric asthma, which is a chronic disease with symptoms (and lack thereof) evolving over time. We show several approaches to explicitly incorporating relative time into a recurrent neural network (RNN) model that improve the overall classification of patients into those with no asthma, those with persistent asthma, those in long-term remission, and those who have experienced relapse. We also compare and contrast these results with those in an inpatient intensive care setting.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Information Systems

Automated extraction of sudden cardiac death risk factors in hypertrophic cardiomyopathy patients by natural language processing

Sungrim Moon, Sijia Liu, Christopher G. Scott, Sujith Samudrala, Mohamed M. Abidian, Jeffrey B. Geske, Peter A. Noseworthy, Jane L. Shellum, Rajeev Chaudhry, Steve R. Ommen, Rick A. Nishimura, Hongfang Liu, Adelaide M. Arruda-Olson

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS (2019)

Article Computer Science, Interdisciplinary Applications

HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology

Feichen Shen, Suyuan Peng, Yadan Fan, Andrew Wen, Sijia Liu, Yanshan Wang, Liwei Wang, Hongfang Liu

JOURNAL OF BIOMEDICAL INFORMATICS (2019)

Review Computer Science, Interdisciplinary Applications

Clinical concept extraction: A methodology review

Sunyang Fu, David Chen, Huan He, Sijia Liu, Sungrim Moon, Kevin J. Peterson, Feichen Shen, Liwei Wang, Yanshan Wang, Andrew Wen, Yiqing Zhao, Sunghwan Sohn, Hongfang Liu

JOURNAL OF BIOMEDICAL INFORMATICS (2020)

Article Medical Informatics

Implementation of a Cohort Retrieval System for Clinical Data Repositories Using the Observational Medical Outcomes Partnership Common Data Model: Proof-of-Concept System Validation

Sijia Liu, Yanshan Wang, Andrew Wen, Liwei Wang, Na Hong, Feichen Shen, Steven Bedrick, William Hersh, Hongfang Liu

JMIR MEDICAL INFORMATICS (2020)

Article Computer Science, Interdisciplinary Applications

An aberration detection-based approach for sentinel syndromic surveillance of COVID-19 and other novel influenza-like illnesses

Andrew Wen, Liwei Wang, Huan He, Sijia Liu, Sunyang Fu, Sunghwan Sohn, Jacob A. Kugel, Vinod C. Kaggal, Ming Huang, Yanshan Wang, Feichen Shen, Jungwei Fan, Hongfang Liu

Summary: After the outbreak of the pandemic, early detection and intervention are key to managing the situation. Syndromic surveillance could offer a timelier screening option, but existing solutions often struggle to distinguish outbreaks of diseases sharing similar symptoms, posing a challenge for monitoring COVID-19.

JOURNAL OF BIOMEDICAL INFORMATICS (2021)

Article Medical Informatics

Family History Extraction From Synthetic Clinical Narratives Using Natural Language Processing: Overview and Evaluation of a Challenge Data Set and Solutions for the 2019 National NLP Clinical Challenges (n2c2)/Open Health Natural Language Processing (OHNLP) Competition

Feichen Shen, Sijia Liu, Sunyang Fu, Yanshan Wang, Sam Henry, Ozlem Uzuner, Hongfang Liu

Summary: The n2c2/OHNLP FH extraction task aimed to standardize evaluation and system development on FH extraction, with 17 teams participating and top performance by Harbin Institute of Technology. Results indicate that relation extraction from FH is more challenging than entity identification task.

JMIR MEDICAL INFORMATICS (2021)

Article Biochemical Research Methods

MedTator: a serverless annotation tool for corpus development

Huan He, Sunyang Fu, Liwei Wang, Sijia Liu, Andrew Wen, Hongfang Liu

Summary: Building a high-quality annotation corpus is time-consuming and requires expertise, but existing annotation tools often have difficulties with installation, integration, and usability. This paper presents MedTator, a new serverless annotation tool with an intuitive and interactive user interface, focusing on the core steps of corpus annotation.

BIOINFORMATICS (2022)

Article Computer Science, Interdisciplinary Applications

Developing an ETL tool for converting the PCORnet CDM into the OMOP CDM to facilitate the COVID-19 data integration

Yue Yu, Nansu Zong, Andrew Wen, Sijia Liu, Daniel J. Stone, David Knaack, Alanna M. Chamberlain, Emily Pfaff, Davera Gabriel, Christopher G. Chute, Nilay Shah, Guoqian Jiang

Summary: This study designed, developed, and evaluated an ETL tool that transforms data from the PCORnet CDM format to the OMOP CDM format. The results showed that the tool successfully converted the data, with minimal information loss and high mapping accuracy. The tool was also able to be used for COVID-19 surveillance and met the data collection criteria for the MN EHR Consortium COVID-19 project.

JOURNAL OF BIOMEDICAL INFORMATICS (2022)

Review Oncology

Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing

Liwei Wang, Sunyang Fu, Andrew Wen, Xiaoyang Ruan, Huan He, Sijia Liu, Sungrim Moon, Michelle Mai, Irbaz B. Riaz, Nan Wang, Ping Yang, Hua Xu, Jeremy L. Warner, Hongfang Liu

Summary: This review assesses the use of natural language processing (NLP) in electronic health records (EHRs) for cancer research and patient care. The findings highlight the need for additional data elements beyond the Minimal Common Oncology Data Elements (mCODE) for comprehensive analysis and evaluation. The review also identifies challenges and barriers in the adoption of NLP methods for cancer research and patient care.

JCO CLINICAL CANCER INFORMATICS (2022)

Article Medical Informatics

Acquisition of a Lexicon for Family History Information: Bidirectional Encoder Representations From Transformers-Assisted Sublanguage Analysis

Liwei Wang, Huan He, Andrew Wen, Sungrim Moon, Sunyang Fu, Kevin J. Peterson, Xuguang Ai, Sijia Liu, Ramakanth Kavuluru, Hongfang Liu

Summary: Without a standardized method to capture family history (FH) information, FH information in electronic health records is difficult to use in data analytics or clinical decision support applications. This study aimed to construct an FH lexical resource for information extraction and normalization. Using a transformer-based method, a lexicon was developed and demonstrated through the development of rule-based and deep learning-based FH systems. The evaluation showed that the rule-based FH system performed well, and combining rule-based and deep learning-based systems improved FH information recall.

JMIR MEDICAL INFORMATICS (2023)

Article Computer Science, Information Systems

An open natural language processing (NLP) framework for EHR-based clinical research: a case demonstration using the National COVID Cohort Collaborative (N3C)

Sijia Liu, Andrew Wen, Liwei Wang, Huan He, Sunyang Fu, Robert Miller, Andrew Williams, Daniel Harris, Ramakanth Kavuluru, Mei Liu, Noor Abu-el-Rub, Dalton Schutte, Rui Zhang, Masoud Rouhizadeh, John D. Osborne, Yongqun He, Umit Topaloglu, Stephanie S. Hong, Joel H. Saltz, Thomas Schaffter, Emily Pfaff, Christopher G. Chute, Tim Duong, Melissa A. Haendel, Rafael Fuentes, Peter Szolovits, Hua Xu, Hongfang Liu

Summary: Despite recent advancements in clinical natural language processing (NLP), the adoption of clinical NLP models in translational research is hindered by process heterogeneity and human factor variations. Developing NLP models in multi-site settings is challenging, but essential for algorithm robustness and generalizability. This study reports on the development of an NLP solution for COVID-19 signs and symptom extraction using an open NLP framework, highlighting the benefits of multi-site data and the need for federated annotation and evaluation to overcome challenges.

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2023)

Article Health Care Sciences & Services

Patient Portal Messaging for Asynchronous Virtual Care During the COVID-19 Pandemic: Retrospective Analysis

Ming Huang, Aditya Khurana, George Mastorakos, Andrew Wen, Huan He, Liwei Wang, Sijia Liu, Yanshan Wang, Nansu Zong, Julie Prigge, Brian Costello, Nilay Shah, Henry Ting, Jungwei Fan, Christi Patten, Hongfang Liu

Summary: This study analyzed patient portal messages during the COVID-19 pandemic to understand patient responses to the crisis. Most messages were related to COVID-19 symptom assessment and testing results. Trends in message usage correlated with national data on new cases and hospitalizations.

JMIR HUMAN FACTORS (2022)

Article Health Care Sciences & Services

A fast, resource efficient, and reliable rule-based system for COVID-19 symptom identification

Himanshu S. Sahoo, Greg M. Silverman, Nicholas E. Ingraham, Monica Lupei, Michael A. Puskarich, Raymond L. Finzel, John Sartori, Rui Zhang, Benjamin C. Knoll, Sijia Liu, Hongfang Liu, Genevieve B. Melton, Christopher J. Tignanelli, Serguei V. S. Pakhomov

Summary: The rule-based gazetteer developed in this study showed superior speed, resource utilization, and performance, providing an effective solution for real-time symptom identification and integration of unstructured data elements into clinical decision support systems. Fine-tuning lexical rules and running on multiple compute nodes were identified as opportunities to further enhance its performance.

JAMIA OPEN (2021)

Article Health Care Sciences & Services

Desiderata for delivering NLP to accelerate healthcare AI advancement and a Mayo Clinic NLP-as-a-service implementation

Andrew Wen, Sunyang Fu, Sungrim Moon, Mohamed El Wazir, Andrew Rosenbaum, Vinod C. Kaggal, Sijia Liu, Sunghwan Sohn, Hongfang Liu, Jungwei Fan

NPJ DIGITAL MEDICINE (2019)

Article Health Care Sciences & Services

Deep learning and alternative learning strategies for retrospective real-world clinical data

David Chen, Sijia Liu, Paul Kingsbury, Sunghwan Sohn, Curtis B. Storlie, Elizabeth B. Habermann, James M. Naessens, David W. Larson, Hongfang Liu

NPJ DIGITAL MEDICINE (2019)

No Data Available