4.6 Article

Unsupervised machine learning for the discovery of latent disease clusters and patient subgroups using electronic health records

Journal

JOURNAL OF BIOMEDICAL INFORMATICS
Volume 102, Issue -, Pages -

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jbi.2019.103364

Keywords

Unsupervised Machine learning; Artificial intelligence; Electronic health records; Epidemiology; Aging

Funding

  1. NIH [P01AG004875, R01GM102282, UL1TR002377, U01TR002062, R01LM011934]
  2. Mayo Clinic internal grants
  3. U.S. Public Health Service
  4. Rochester Epidemiology Project [R01AG034676]

Ask authors/readers for more resources

Machine learning has become ubiquitous and a key technology on mining electronic health records (EHRs) for facilitating clinical research and practice. Unsupervised machine learning, as opposed to supervised learning, has shown promise in identifying novel patterns and relations from EHRs without using human created labels. In this paper, we investigate the application of unsupervised machine learning models in discovering latent disease clusters and patient subgroups based on EHRs. We utilized Latent Dirichlet Allocation (LDA), a generative probabilistic model, and proposed a novel model named Poisson Dirichlet Model (PDM), which extends the LDA approach using a Poisson distribution to model patients' disease diagnoses and to alleviate age and sex factors by considering both observed and expected observations. In the empirical experiments, we evaluated LDA and PDM on three patient cohorts, namely Osteoporosis, Delirium/Dementia, and Chronic Obstructive Pulmonary Disease (COPD)/Bronchiectasis Cohorts, with their EHR data retrieved from the Rochester Epidemiology Project (REP) medical records linkage system, for the discovery of latent disease clusters and patient subgroups. We compared the effectiveness of LDA and PDM in identifying disease clusters through the visualization of disease representations. We tested the performance of LDA and PDM in differentiating patient subgroups through survival analysis, as well as statistical analysis of demographics and Elixhauser Comorbidity Index (ECI) scores in those subgroups. The experimental results show that the proposed PDM could effectively identify distinguished disease clusters based on the latent patterns hidden in the EHR data by alleviating the impact of age and sex, and that LDA could stratify patients into differentiable subgroups with larger p-values than PDM. However, those subgroups identified by LDA are highly associated with patients' age and sex. The subgroups discovered by PDM might imply the underlying patterns of diseases of greater interest in epidemiology research due to the alleviation of age and sex. Both unsupervised machine learning approaches could be leveraged to discover patient subgroups using EHRs but with different foci.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

ST-V-Net: incorporating shape prior into convolutional neural networks for proximal femur segmentation

Chen Zhao, Joyce H. Keyak, Jinshan Tang, Tadashi S. Kaneko, Sundeep Khosla, Shreyasee Amin, Elizabeth J. Atkinson, Lan-Juan Zhao, Michael J. Serou, Chaoyang Zhang, Hui Shen, Hong-Wen Deng, Weihua Zhou

Summary: In this study, we aimed to develop a deep-learning-based method for automatic proximal femur segmentation in quantitative computed tomography (QCT) images. We proposed a spatial transformation V-Net (ST-V-Net) that incorporates a shape prior into the segmentation network to improve model performance. Experimental results showed excellent performance of the proposed ST-V-Net for automatic proximal femur segmentation in QCT images.

COMPLEX & INTELLIGENT SYSTEMS (2023)

Article Psychology, Clinical

The importance of social activity to risk of major depression in older adults

Euijung Ryu, Gregory D. Jenkins, Yanshan Wang, Mark Olfson, Ardesheer Talati, Lauren Lepow, Brandon J. Coombes, Alexander W. Charney, Benjamin S. Glicksberg, J. John Mann, Myrna M. Weissman, Priya Wickramaratne, Jyotishman Pathak, Joanna M. Biernacka

Summary: This study aimed to identify the most relevant social determinants of health (SDoH) related to major depressive disorder (MDD) in older adults. The results showed that the perceived level of social activity was the most influential SDoH variable, with a lower level of social activity associated with a higher risk of MDD.

PSYCHOLOGICAL MEDICINE (2023)

Article Cell Biology

Targeted clearance of p21- but not p16-positive senescent cells prevents radiation-induced osteoporosis and increased marrow adiposity

Abhishek Chandra, Anthony B. Lagnado, Joshua N. Farr, Madison Doolittle, Tamara Tchkonia, James L. Kirkland, Nathan K. LeBrasseur, Paul D. Robbins, Laura J. Niedernhofer, Yuji Ikeno, Joao F. Passos, David G. Monroe, Robert J. Pignolo, Sundeep Khosla

Summary: This study demonstrates that cellular senescence-driven radiation-induced osteoporosis is primarily mediated by p21(Cip1) rather than p16(Ink4a), based on the clearance of senescent cells using genetic models. This approach may be used to investigate the contributions of these pathways in other senescence-associated conditions, including aging across tissues.

AGING CELL (2022)

Editorial Material Endocrinology & Metabolism

Osteoporosis in the USA: prevention and unmet needs

Sundeep Khosla, Nicole C. Wright, Ann L. Elderkin, Douglas P. Kiel

LANCET DIABETES & ENDOCRINOLOGY (2023)

Article Geriatrics & Gerontology

Drugs Targeting Mechanisms of Aging to Delay Age-Related Disease and Promote Healthspan: Proceedings of a National Institute on Aging Workshop

Sara E. Espinoza, Sundeep Khosla, Joseph A. Baur, Rafael de Cabo, Nicolas Musi

Summary: The geroscience hypothesis suggests that targeting key hallmarks of aging can improve healthspan and prevent age-related diseases. Several pharmacological interventions, including senolytics, NAD(+) boosters, and metformin, are being studied for their potential benefits. Preclinical studies show that senolytic drugs improve healthspan in rodents, while increasing NAD(+) through supplementation appears to extend healthspan in model organisms. Metformin, on the other hand, has pleiotropic effects and is being examined for its potential to improve healthspan and prevent frailty in clinical trials. However, further research is needed to determine their efficacy, safety, target populations, and long-term outcomes.

JOURNALS OF GERONTOLOGY SERIES A-BIOLOGICAL SCIENCES AND MEDICAL SCIENCES (2023)

Article Endocrinology & Metabolism

MicroRNA-19a-3p Decreases with Age in Mice and Humans and Inhibits Osteoblast Senescence

Japneet Kaur, Dominik Saul, Madison L. Doolittle, Joshua N. Farr, Sundeep Khosla, David G. Monroe

Summary: Aging is associated with an accumulation of senescent cells in various tissues, including bones. This study found that a specific miRNA, miR-19a-3p, decreases with age in mouse and human bones. Furthermore, inducing senescence in mouse bone marrow stromal cells also reduced the levels of miR-19a-3p. The study suggests that miR-19a-3p could be a potential therapeutic target for age-related bone loss.

JBMR PLUS (2023)

Article Endocrinology & Metabolism

Opportunistic Screening With CT: Comparison of Phantomless BMD Calibration Methods

Stefan Bartenschlager, Alexander Cavallaro, Tobias Pogarell, Oliver Chaudry, Michael Uder, Sundeep Khosla, Georg Schett, Klaus Engelke

Summary: Opportunistic screening is a promising technique for identifying individuals at high risk for osteoporotic fracture using CT scans. This study compared the performance of four existing phantomless calibration methods and found that precalibrated phantomless calibration methods performed well.

JOURNAL OF BONE AND MINERAL RESEARCH (2023)

Article Multidisciplinary Sciences

Multiparametric senescent cell phenotyping reveals targets of senolytic therapy in the aged murine skeleton

Madison L. Doolittle, Dominik Saul, Japneet Kaur, Jennifer L. Rowsey, Stephanie J. Vos, Kevin D. Pavelko, Joshua N. Farr, David G. Monroe, Sundeep Khosla

Summary: The article provides a detailed characterization of senescent skeletal cells in vivo, identifying a population of senescent cells associated with age and increased in late osteoblasts/osteocytes and CD24(high) osteolineage cells. The authors also establish CD24 as a marker for skeletal cells cleared by senolytics.

NATURE COMMUNICATIONS (2023)

Article Endocrinology & Metabolism

Single-Cell Integration of BMD GWAS Results Prioritize Candidate Genes Influencing Age-Related Bone Loss

Madison L. Doolittle, Sundeep Khosla, Dominik Saul

Summary: The regulation of bone mineral density (BMD) is influenced by genetics and age. Genome-wide association studies (GWAS) have identified many genes associated with BMD, but their specific mechanisms in different cell types and during aging are still unclear. By analyzing age-related transcriptomics and single-cell RNA-sequencing (scRNA-seq) datasets, this study investigated the cell-specific expression of GWAS candidate genes and identified enrichment in various cells related to bone metabolism. The findings provide potential therapeutic targets for osteoporosis treatment.

JBMR PLUS (2023)

Article Endocrinology & Metabolism

Modest Effects of Osteoclast-Specific ERa Deletion after Skeletal Maturity

Madison L. L. Doolittle, Brittany A. A. Eckhardt, Stephanie J. J. Vos, Sarah Grain, Jennifer L. L. Rowsey, Ming Ruan, Dominik Saul, Joshua N. N. Farr, Megan M. M. Weivoda, Sundeep Khosla, David G. G. Monroe

Summary: Estrogen plays a crucial role in regulating bone mass, primarily through its action on ERalpha. Recent studies have shown that estrogen action in osteocytes is more important than in osteoclasts, and the loss of ERalpha in specific cell types results in decreased bone volume and reduced bone formation rate.

JBMR PLUS (2023)

Article Endocrinology & Metabolism

Reference Intervals for Bone Impact Microindentation in Healthy Adults: A Multi-Centre International Study

Pamela Rufus-Membere, Kara L. Holloway-Kew, Adolfo Diez-Perez, Natasha M. Appelman-Dijkstra, Mary L. Bouxsein, Erik F. Eriksen, Joshua N. Farr, Sundeep Khosla, Mark A. Kotowicz, Xavier Nogues, Mishaela Rubin, Julie A. Pasco

Summary: Impact microindentation (IMI) is a novel technique for assessing bone material strength index (BMSi) in vivo. The aim of this study was to define the reference intervals for men and women by evaluating healthy adults from multiple countries. BMSi values ranged from 48 to 101, with mean values of 84.4 +/- 6.9 for men and 79.0 +/- 9.1 for women.

CALCIFIED TISSUE INTERNATIONAL (2023)

Article Computer Science, Interdisciplinary Applications

Fair patient model: Mitigating bias in the patient representation learned from the electronic health records

Sonish Sivarajkumar, Yufei Huang, Yanshan Wang

Summary: This study proposes a novel method to pre-train fair and unbiased patient representations from EHR data using a weighted loss function. The experimental results show that this method outperforms the baseline models in fairness metrics and achieves comparable predictive performance. The study also reveals that the method captures more information from clinical features.

JOURNAL OF BIOMEDICAL INFORMATICS (2023)

Review Health Care Sciences & Services

Adopting and expanding ethical principles for generative artificial intelligence from military to healthcare

David Oniani, Jordan Hilsman, Yifan Peng, Ronald K. Poropatich, Jeremy C. Pamplin, Gary L. Legault, Yanshan Wang

Summary: This article discusses the core similarities between the military and medical service and the ethical concerns posed by the application of generative AI in healthcare. It proposes a set of ethical principles called GREAT PLEA from the military perspective and introduces a framework for applying these principles. By addressing ethical dilemmas and challenges, the aim is to integrate generative AI into healthcare practice.

NPJ DIGITAL MEDICINE (2023)

Article Endocrinology & Metabolism

Guanylyl Cyclase-B Dependent Bone Formation in Mice is Associated with Youth, Increased Osteoblasts, and Decreased Osteoclasts

Brandon M. Wagner, Jerid W. Robinson, Timothy C. R. Prickett, Eric A. Espiner, Sundeep Khosla, Dana Gaddy, Larry J. Suva, Lincoln R. Potter

Summary: C-type natriuretic peptide (CNP) activates guanylyl cyclase-B (GC-B) to catalyze the synthesis of cGMP, which affects bone length and mass. GC-B mutant mice with increased CNP-dependent GC-B activity show increased bone length, mass, and strength. However, the mechanism by which GC-B increases bone mass remains unknown.

CALCIFIED TISSUE INTERNATIONAL (2022)

Meeting Abstract Endocrinology & Metabolism

Effects of miR-219a-5p expression on bone metabolism using a novel transgenic mouse model

Japneet Kaur, Jennifer L. Rowsey, Stephanie J. Vos, Joshua N. Farr, Sundeep Khosla, David G. Monroe

JOURNAL OF BONE AND MINERAL RESEARCH (2022)

No Data Available