4.7 Article

Cancer subtype identification using somatic mutation data

Journal

BRITISH JOURNAL OF CANCER
Volume 118, Issue 11, Pages 1492-1501

Publisher

SPRINGERNATURE
DOI: 10.1038/s41416-018-0109-7

Keywords

-

Categories

Funding

  1. NVIDIA foundation [2014-133322 (3953)]
  2. Postdoctoral Fellowship Program from the Charles A. King Trust Fund
  3. Sara Elizabeth O'Brien Trust
  4. Bank of America, N.A.

Ask authors/readers for more resources

BACKGROUND: With the onset of next-generation sequencing technologies, we have made great progress in identifying recurrent mutational drivers of cancer. As cancer tissues are now frequently screened for specific sets of mutations, a large amount of samples has become available for analysis. Classification of patients with similar mutation profiles may help identifying subgroups of patients who might benefit from specific types of treatment. However, classification based on somatic mutations is challenging due to the sparseness and heterogeneity of the data. METHODS: Here we describe a new method to de-sparsify somatic mutation data using biological pathways. We applied this method to 23 cancer types from The Cancer Genome Atlas, including samples from 5805 primary tumours. RESULTS: We show that, for most cancer types, de-sparsified mutation data associate with phenotypic data. We identify poor prognostic subtypes in three cancer types, which are associated with mutations in signal transduction pathways for which targeted treatment options are available. We identify subtype-drug associations for 14 additional subtypes. Finally, we perform a pan-cancer subtyping analysis and identify nine pan-cancer subtypes, which associate with mutations in four overarching sets of biological pathways. CONCLUSIONS: This study is an important step toward understanding mutational patterns in cancer.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Genetics & Heredity

Identification of Key Tissue-Specific, Biological Processes by Integrating Enhancer Information in Maize Gene Regulatory Networks

Maud Fagny, Marieke Lydia Kuijjer, Maike Stam, Johann Joets, Olivier Turc, Julien Roziere, Stephanie Pateyron, Anthony Venon, Clementine Vitte

Summary: Enhancers play a key role in coordinating gene expression during important developmental processes. This study investigated the enhancer-driven regulatory network in maize leaves and husks at different growth stages, revealing tissue-specific regulatory modules and potential new TF binding sites. The findings shed light on the complexity of enhancer-mediated gene regulation in plants.

FRONTIERS IN GENETICS (2021)

Article Multidisciplinary Sciences

Discovery of novel candidates for anti-liposarcoma therapies by medium-scale high-throughput drug screening

Iwona Grad, Robert Hanes, Pilar Ayuda-Duran, Marieke Lydia Kuijjer, Jorrit M. Enserink, Leonardo A. Meza-Zepeda, Ola Myklebost

Summary: Drug testing on liposarcoma cells has identified six potential anti-cancer drugs that target different mechanisms and have low toxicity to normal cells. These drugs show promise for the treatment of liposarcoma.

PLOS ONE (2021)

Article Computer Science, Information Systems

Mitigating Class-Boundary Label Uncertainty to Reduce Both Model Bias and Variance

Matthew Almeida, Yong Zhuang, Wei Ding, Scott E. Crouter, Ping Chen

Summary: The study addresses the issue of bias and variance in models when faced with inaccurate and uncertain training data labels. By estimating the uncertainty of labels and adjusting sample weights accordingly, the method reduces both bias and variance, improving model performance in a real-world case study.

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (2021)

Review Genetics & Heredity

Community Detection in Large-Scale Bipartite Biological Networks

Genis Calderer, Marieke L. Kuijjer

Summary: Networks are useful tools in biology for representing and analyzing interactions, with many biological networks being bipartite. Community structure detection can help identify clusters of nodes in these networks, aiding in understanding specific biological processes.

FRONTIERS IN GENETICS (2021)

Article Biochemical Research Methods

rPanglaoDB: an R package to download and merge labeled single-cell RNA-seq data from the PanglaoDB database

Daniel Osorio, Marieke L. Kuijjer, James J. Cai

Summary: This study introduces rPanglaoDB, an R package for downloading and merging uniformly processed scRNA-seq data to collect rare cell types by integrating multiple public datasets. By characterizing a set of 157 fibrocytes, the study demonstrates the potential and utility of rPanglaoDB in enabling the collection of rare cell types for transcriptomic profiling.

BIOINFORMATICS (2022)

Article Oncology

Regulatory Network of PD1 Signaling Is Associated with Prognosis in Glioblastoma Multiforme

Camila M. Lopes-Ramos, Tatiana Belova, Tess H. Brunner, Marouen Ben Guebila, Daniel Osorio, John Quackenbush, Marieke L. Kuijjer

Summary: Analysis identified seven pathways associated with survival in glioblastoma patients, with dysregulation of PD1 signaling correlating with poor prognosis. This suggests a new approach to predict patient survival and highlights potential therapeutic interventions based on gene regulatory network analysis.

CANCER RESEARCH (2021)

Article Biochemistry & Molecular Biology

GRAND: a database of gene regulatory network models across human conditions

Marouen Ben Guebila, Camila M. Lopes-Ramos, Deborah Weighill, Abhijeet Rajendra Sonawane, Rebekka Burkholz, Behrouz Shamsaei, John Platig, Kimberly Glass, Marieke L. Kuijjer, John Quackenbush

Summary: Gene regulation networks play a crucial role in tissue identity, disease development, and therapeutic response. The GRAND database provides computationally-inferred gene regulatory network models and targeting scores for predicting drug effects on network structures and matching potential therapeutic drugs to disease states.

NUCLEIC ACIDS RESEARCH (2022)

Letter Biochemical Research Methods

An online notebook resource for reproducible inference, analysis and publication of gene regulatory networks

Marouen Ben Guebila, Deborah Weighill, Camila M. Lopes-Ramos, Rebekka Burkholz, Romana T. Pop, Kalyan Palepu, Mia Shapoval, Maud Fagny, Daniel Schlauch, Kimberly Glass, Michael Altenbuchinger, Marieke L. Kuijjer, John Platig, John Quackenbush

NATURE METHODS (2022)

Editorial Material Genetics & Heredity

Editorial: Applications and Methods in Genomic Networks

Maud Fagny, Kimberly Glass, Marieke L. Kuijjer

FRONTIERS IN GENETICS (2022)

Article Oncology

Mutually exclusive genetic interactions and gene essentiality shape the genomic landscape of primary melanoma

Sofia Birkealv, Mark Harland, Larissa Satiko Alcantara Sekimoto Matsuyama, Mamun Rashid, Ishan Mehta, Jonathan P. Laye, Kerstin Haase, Tracey Mell, Vivek Iyer, Carla Daniela Robles-Espinoza, Ultan McDermott, Peter van Loo, Marieke L. Kuijjer, Patricia A. Possik, Silvya Stuchi Maria Engler, D. Timothy Bishop, Julia Newton-Bishop, David J. Adams

Summary: This study conducted sequence profiling of 524 American Joint Committee on Cancer Stage I-III primary tumors, revealing recurrent driver mutations, mutually exclusive genetic interactions, and an absence of co-occurring genetic events. By intersecting copy number calls with CRISPR screening data, the transcription factor IRF4 was identified as a melanoma-associated dependency.

JOURNAL OF PATHOLOGY (2023)

Article Biotechnology & Applied Microbiology

The Network Zoo: a multilingual package for the inference and analysis of gene regulatory networks

Marouen Ben Guebila, Tian Wang, Camila M. M. Lopes-Ramos, Viola Fanfani, Des Weighill, Rebekka Burkholz, Daniel Schlauch, Joseph N. N. Paulson, Michael Altenbuchinger, Katherine H. H. Shutta, Abhijeet R. R. Sonawane, James Lim, Genis Calderer, David G. P. van IJzendoorn, Daniel Morgan, Alessandro Marin, Cho-Yi Chen, Qi Song, Enakshi Saha, Dawn L. L. DeMeo, Megha Padi, John Platig, Marieke L. L. Kuijjer, Kimberly Glass, John Quackenbush

Summary: This article introduces an open-source software called NetZoo, which is used for inference and analysis of gene regulatory networks. NetZoo integrates multi-omic data from various sources and harmonizes the implementations of network methods in different computing languages and between methods to allow better integration of these tools into analytical pipelines.

GENOME BIOLOGY (2023)

Article Health Care Sciences & Services

Exploring approaches for predictive cancer patient digital twins: Opportunities for collaboration and innovation

Eric A. A. Stahlberg, Mohamed Abdel-Rahman, Boris Aguilar, Alireza Asadpoure, Robert A. A. Beckman, Lynn L. L. Borkon, Jeffrey N. N. Bryan, Colleen M. M. Cebulla, Young Hwan Chang, Ansu Chatterjee, Jun Deng, Sepideh Dolatshahi, Olivier Gevaert, Emily J. J. Greenspan, Wenrui Hao, Tina Hernandez-Boussard, Pamela R. R. Jackson, Marieke Kuijjer, Adrian Lee, Paul Macklin, Subha Madhavan, Matthew D. D. McCoy, Navid Mohammad Mirzaei, Talayeh Razzaghi, Heber L. L. Rocha, Leili Shahriyari, Ilya Shmulevich, Daniel G. G. Stover, Yi Sun, Tanveer Syeda-Mahmood, Jinhua Wang, Qi Wang, Ioannis Zervantonakis

Summary: We are approaching a future where cancer patient digital twins can accurately predict and assist in the prevention, diagnosis, and treatment of cancer. This is made possible through advancements in high performance computing, computational modeling, and the availability of diverse observational data. The US National Cancer Institute and the US Department of Energy have launched collaborative projects to develop and implement predictive Cancer Patient Digital Twins, aiming to explore different approaches, such as deep phenotyping, personalized treatment planning, and treatment monitoring. These efforts have provided valuable insights into the opportunities and challenges of cancer patient digital twin approaches and shaped the future research direction.

FRONTIERS IN DIGITAL HEALTH (2022)

Article Biochemical Research Methods

Reference-based comparison of adaptive immune receptor repertoires

Cedric R. Weber, Teresa Rubio, Longlong Wang, Wei Zhang, Philippe A. Robert, Rahmad Akbar, Igor Snapkov, Jinghua Wu, Marieke L. Kuijjer, Sonia Tarazona, Ana Conesa, Geir K. Sandve, Xiao Liu, Sai T. Reddy, Victor Greiff

Summary: The similarity of immune repertoires can represent an individual's immune history. However, current understanding might be incorrect as certain immune states show highly similar immune repertoires in both healthy and diseased individuals, suggesting that immune perturbations do not cause significant changes in repertoires.

CELL REPORTS METHODS (2022)

Article Genetics & Heredity

gpuZoo: Cost-effective estimation of gene regulatory networks using the Graphics Processing Unit

Marouen Ben Guebila, Daniel C. Morgan, Kimberly Glass, Marieke L. Kuijjer, Dawn L. DeMeo, John Quackenbush

Summary: Gene regulatory network inference allows for modeling genome-scale regulatory processes. Researchers have developed a collection of tools to model various regulatory processes and improve their performance through GPU-accelerated calculations.

NAR GENOMICS AND BIOINFORMATICS (2022)

Article Computer Science, Artificial Intelligence

The immuneML ecosystem for machine learning analysis of adaptive immune receptor repertoires

Milena Pavlovic, Lonneke Scheffer, Keshav Motwani, Chakravarthi Kanduri, Radmila Kompova, Nikolay Vazov, Knut Waagan, Fabian L. M. Bernal, Alexandre Almeida Costa, Brian Corrie, Rahmad Akbar, Ghadi S. Al Hajj, Gabriel Balaban, Todd M. Brusko, Maria Chernigovskaya, Scott Christley, Lindsay G. Cowell, Robert Frank, Ivar Grytten, Sveinung Gundersen, Ingrid Hobaek Haff, Eivind Hovig, Ping-Han Hsieh, Gunter Klambauer, Marieke L. Kuijjer, Christin Lund-Andersen, Antonio Martini, Thomas Minotto, Johan Pensar, Knut Rand, Enrico Riccardi, Philippe A. Robert, Artur Rocha, Andrei Slabodkin, Igor Snapkov, Ludvig M. Sollid, Dmytro Titov, Cedric R. Weber, Michael Widrich, Gur Yaari, Victor Greiff, Geir Kjetil Sandve

Summary: immuneML is an AIRR-based machine learning tool that addresses reproducibility, transparency, and interoperability issues in the field of AIRR ML through an extensible, open-source software ecosystem. Users can utilize immuneML through a command-line tool or a Galaxy web interface, with extensive workflow documentation provided to facilitate widespread adoption.

NATURE MACHINE INTELLIGENCE (2021)

No Data Available