4.7 Article Proceedings Paper

Deep learning with multimodal representation for pancancer prognosis prediction

期刊

BIOINFORMATICS
卷 35, 期 14, 页码 I446-I454

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btz342

关键词

-

资金

  1. National Institute of Biomedical Imaging and Bioengineering of the National Institutes of Health [R01EB020527]
  2. National Institute of Dental and Craniofacial Research (NIDCR) [U01DE025188]
  3. National Cancer Institute (NCI) [U01CA199241, U01CA217851]

向作者/读者索取更多资源

Motivation Estimating the future course of patients with cancer lesions is invaluable to physicians; however, current clinical methods fail to effectively use the vast amount of multimodal data that is available for cancer patients. To tackle this problem, we constructed a multimodal neural network-based model to predict the survival of patients for 20 different cancer types using clinical data, mRNA expression data, microRNA expression data and histopathology whole slide images (WSIs). We developed an unsupervised encoder to compress these four data modalities into a single feature vector for each patient, handling missing data through a resilient, multimodal dropout method. Encoding methods were tailored to each data type-using deep highway networks to extract features from clinical and genomic data, and convolutional neural networks to extract features from WSIs. Results We used pancancer data to train these feature encodings and predict single cancer and pancancer overall survival, achieving a C-index of 0.78 overall. This work shows that it is possible to build a pancancer model for prognosis that also predicts prognosis in single cancer sites. Furthermore, our model handles multiple data modalities, efficiently analyzes WSIs and represents patient multimodal data flexibly into an unsupervised, informative representation. We thus present a powerful automated tool to accurately determine prognosis, a key step towards personalized treatment for cancer patients. Availability and implementation https://github.com/gevaertlab/MultimodalPrognosis

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Oncology

Peripheral blood DNA methylation profiles predict future development of B-cell Non-Hodgkin Lymphoma

Almudena Espin-Perez, Kevin Brennan, Asiri Saumya Ediriwickrema, Olivier Gevaert, Izidore S. Lossos, Andrew J. Gentles

Summary: The lack of accurate early detection methods for lymphoma limits the ability to cure patients. A DNA methylation-based prediction tool for NHL was developed, which showed high accuracy in identifying patients at risk of developing future NHL and detecting active NHL and healthy status.

NPJ PRECISION ONCOLOGY (2022)

Article Engineering, Biomedical

Finding the Spatial Co-Variation of Brain Deformation With Principal Component Analysis

Xianghao Zhan, Yuzhe Liu, Nicholas J. Cecchi, Olivier Gevaert, Michael M. Zeineh, Gerald A. Grant, David B. Camarillo

Summary: The study utilized principal component analysis (PCA) to analyze the spatial co-variation of injury metrics in four types of head impacts, aiding in the improvement of the machine learning head model (MLHM). PCA-MLHM reduced model parameters by 74% with comparable MPS estimation accuracy.

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING (2022)

Article Computer Science, Information Systems

Reliably Filter Drug-Induced Liver Injury Literature With Natural Language Processing and Conformal Prediction

Xianghao Zhan, Fanjin Wang, Olivier Gevaert

Summary: Drug-induced liver injury refers to the adverse effects of drugs on the liver, and it is important to assess new drug candidates. This study developed a model using natural language processing techniques to rapidly filter literature and find relevant information about liver injury induced by medications. The ensemble model and TF-IDF model achieved satisfactory classification results.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2022)

Article Multidisciplinary Sciences

Accurate detection of benign and malignant renal tumor subtypes with MethylBoostER: An epigenetic marker-driven learning framework

Sabrina H. Rossi, Izzy Newsham, Sara Pita, Kevin Brennan, Gahee Park, Christopher G. Smith, Radoslaw P. Lach, Thomas Mitchell, Junfan Huang, Anne Babbage, Anne Y. Warren, John T. Leppert, Grant D. Stewart, Olivier Gevaert, Charles E. Massie, Shamith A. Samarajiwa

Summary: Current diagnostic strategies are unable to differentiate between benign and malignant small renal masses accurately, leading to unnecessary surgery in 20% of patients. The MethylBoostER machine learning model, utilizing DNA methylation data, can classify pathological subtypes of renal tumors and provide a more confident presurgical diagnosis, potentially improving treatment decision-making.

SCIENCE ADVANCES (2022)

Article Biology

Machine intelligence for radiation science: summary of the Radiation Research Society 67th annual meeting symposium

Lydia J. Wilson, Frederico C. Kiffer, Daniel C. Berrios, Abigail Bryce-Atkinson, Sylvain V. Costes, Olivier Gevaert, Bruno F. E. Matarese, Jack Miller, Pritam Mukherjee, Kristen Peach, Paul N. Schofield, Luke T. Slater, Britta Langen

Summary: The era of high-throughput techniques has generated large amounts of data in the medical and research fields. Machine intelligence (MI) approaches are being used to overcome limitations in processing, analyzing, and interpreting these massive data sets. The 67th Annual Meeting of the Radiation Research Society featured a symposium on MI approaches, highlighting recent advancements in radiation sciences and their clinical applications. This article summarizes three presentations on metadata processing and ontological formalization, data mining for radiation outcomes in pediatric oncology, and imaging in lung cancer.

INTERNATIONAL JOURNAL OF RADIATION BIOLOGY (2023)

Article Oncology

Targeting KDM2A Enhances T-cell Infiltration in NSD1-Deficient Head and Neck Squamous Cell Carcinoma

Chen Chen, June Ho Shin, Zhuoqing Fang, Kevin Brennan, Nina B. Horowitz, Kathleen L. Pfaff, Emma L. Welsh, Scott J. Rodig, Olivier Gevaert, Or Gozani, Ravindra Uppaluri, John B. Sunwoo

Summary: In head and neck squamous cell carcinoma (HNSCC), inactivating mutations in the histone methyltransferase NSD1 disproportionately contribute to tumor development and immune exclusion. Understanding the NSD1-mediated mechanism and targeting the histone-modifying enzyme KDM2A could enhance T-cell infiltration and suppress tumor growth in HNSCC.

CANCER RESEARCH (2023)

Article Biochemistry & Molecular Biology

A deep-learning algorithm to classify skin lesions from mpox virus infection

Alexander H. H. Thieme, Yuanning Zheng, Gautam Machiraju, Chris Sadee, Mirja Mittermaier, Maximilian Gertler, Jorge L. Salinas, Krithika Srinivasan, Prashnna Gyawali, Francisco Carrillo-Perez, Angelo Capodici, Maximilian Uhlig, Daniel Habenicht, Anastassia Loeser, Maja Kohler, Maximilian Schuessler, David Kaul, Johannes Gollrad, Jackie Ma, Christoph Lippert, Kendall Billick, Isaac Bogoch, Tina Hernandez-Boussard, Pascal Geldsetzer, Olivier Gevaert

Summary: A deep-learning algorithm, MPXV-CNN, was developed to identify skin lesions caused by the mpox virus for early detection and mitigation. It demonstrated a sensitivity of 0.83-0.91 and a specificity of 0.965-0.898 across different datasets. The algorithm was robust in classifying lesions on various skin tones and body regions, and a web-based app was developed for patient guidance.

NATURE MEDICINE (2023)

Article Multidisciplinary Sciences

A Large-scale Synthetic Pathological Dataset for Deep Learning-enabled Segmentation of Breast Cancer

Kexin Ding, Mu Zhou, He Wang, Olivier Gevaert, Dimitris Metaxas, Shaoting Zhang

Summary: To enhance computational pathology, we introduce a large-scale synthetic pathological image dataset paired with nucleus annotations, called SNOW. By applying off-the-shelf image generator and nuclei annotator, SNOW offers a cost-effective means to improve model performance. Results show that models trained on synthetic data are competitive and expand the use of synthetic images for data-driven clinical tasks.

SCIENTIFIC DATA (2023)

Article Computer Science, Artificial Intelligence

Multimodal data fusion for cancer biomarker discovery with deep learning

Sandra Steyaert, Marija Pizurica, Divya Nagaraj, Priya Khandelwal, Tina Hernandez-Boussard, Andrew J. Gentles, Olivier Gevaert

Summary: Cancer diagnosis and treatment decisions often focus on a single data source. However, there is a need for effective multimodal fusion approaches to integrate complementary data types. The current technological advances and introduction of deep learning have the potential to address the challenges of data integration in cancer research.

NATURE MACHINE INTELLIGENCE (2023)

Article Dermatology

Best Practices for Clinical Skin Image Acquisition in Translational Artificial Intelligence Research

Michelle Phung, Vijaytha Muralidharan, Veronica Rotemberg, Roberto Andres Novoa, Albert Sean Chiou, Christoph Y. Sadee, Bailie Rapaport, Kiana Yekrang, Jared Bitz, Olivier Gevaert, Justin Meng Ko, Roxana Daneshjou

Summary: Recent developments in artificial intelligence research have led to the increased use of algorithms for detecting malignancies in clinical and dermoscopic images of skin diseases. Gathering training and testing data is crucial for these methods. This paper explores the best practices and challenges in collecting skin images and data for translational artificial intelligence research, including ethics, image acquisition, labeling, curation, and storage. The aim is to enhance malignancy detection using artificial intelligence by facilitating intentional data collection and collaboration between dermatologists and data scientists.

JOURNAL OF INVESTIGATIVE DERMATOLOGY (2023)

Article Multidisciplinary Sciences

Spatial cellular architecture predicts prognosis in glioblastoma

Yuanning Zheng, Francisco Carrillo-Perez, Marija Pizurica, Dieter Henrik Heiland, Olivier Gevaert

Summary: In this study, two deep learning models were used to predict the transcriptional subtypes and prognosis of glioblastoma (GBM) cells from histology images. The results showed consistent associations between spatial cellular organization and patient prognosis. The study also confirmed that transcriptional heterogeneity and cell-state plasticity are key factors in the development of therapeutic resistance in GBM.

NATURE COMMUNICATIONS (2023)

Article Biochemical Research Methods

EpiMix is an integrative tool for epigenomic subtyping using DNA methylation

Yuanning Zheng, John Jun, Kevin Brennan, Olivier Gevaert

Summary: DNA methylation is an important epigenetic factor that affects gene expression, and alterations in it can lead to cancer and immunological and cardiovascular diseases. Recent advancements in technology have made it possible to analyze DNA methylation on a genome-wide scale in large human cohorts. This study presents an analytical framework called EpiMix, which provides higher sensitivity in detecting abnormal DNA methylation patterns in small subsets of patients compared to existing methods. The researchers also used EpiMix to analyze cis-regulatory elements, enhancers, and genes encoding microRNAs and long non-coding RNAs, and discovered epigenetic mechanisms underlying childhood food allergy and survival-associated, methylation-driven ncRNAs in non-small cell lung cancer.

CELL REPORTS METHODS (2023)

Article Medicine, Research & Experimental

Multimodal deep learning to predict prognosis in adult and pediatric brain tumors

Sandra Steyaert, Yeping Lina Qiu, Yuanning Zheng, Pritam Mukherjee, Hannes Vogel, Olivier Gevaert

Summary: Steyaert, Qiu et al. developed a deep learning framework for multimodal data fusion in brain tumors. Combining histopathology imaging and gene expression data, the multimodal data models outperformed single data models in predicting prognosis.

COMMUNICATIONS MEDICINE (2023)

Article Biochemical Research Methods

Identifying key multifunctional components shared by critical cancer and normal liver pathways via SparseGMM

Shaimaa Bakr, Kevin Brennan, Pritam Mukherjee, Josepmaria Argemi, Mikel Hernaez, Olivier Gevaert

Summary: In this study, we propose SparseGMM, a statistical approach that uses latent variable modeling with sparsity constraints to learn Gaussian mixtures from multiomic data, aiming to improve our understanding of diseases with genetic underpinnings. By combining coexpression patterns with a Bayesian framework, SparseGMM quantitatively measures confidence in regulators and uncertainty in target gene assignment by computing gene entropy. We apply SparseGMM to liver cancer and normal liver tissue data and identify regulators of angiogenesis, immune response, and blood coagulation in cancer. Furthermore, we show that high-entropy genes in cancer include key multifunctional components shared by critical pathways.

CELL REPORTS METHODS (2023)

Meeting Abstract Oncology

RADIOMICS-BASED MULTI-MODAL PREDICTION OF TREATMENT RESPONSE TO PD-1/PD-L1 IMMUNE CHECKPOINT INHIBITOR (ICI) THERAPY IN STAGE IV NON-SMALL CELL LUNG CARCINOMA (MNSCLC)

Ravi Parikh, Petr Jordan, Rita Ciaravino, Ryan Beasley, Arpan Patel, Dwight Owen, Arya Amini, Brendan Curti, Ray Page, Aurelie Swalduz, Jean-Paul Beregi, Jan Chrusciel, Eric Snyder, Pritam Mukherjee, Heather Selby, Soohee Lee, Roshanthi Weerasinghe, Shwetha Pindikuri, Jakob Weiss, Andrew Wentland, Anish Kirpalani, An Liu, Olivier Gevaert, George Simon, Hugo Aerts

JOURNAL FOR IMMUNOTHERAPY OF CANCER (2022)

暂无数据