4.6 Article

The center for expanded data annotation and retrieval

Journal

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/jamia/ocv048

Keywords

datasets as topic; data curation; data collection; standards; biological ontologies

Funding

  1. National Institute of Allergy and Infectious Diseases through trans-NIH Big Data to Knowledge (BD2K) initiative [U54 AI117925]
  2. BBSRC [BB/L024101/1, BB/J020265/1, BB/I025840/1, BB/I000771/1, BB/E025080/1] Funding Source: UKRI
  3. Biotechnology and Biological Sciences Research Council [BB/E025080/1, BB/I000771/1, BB/J020265/1, BB/I025840/1, BB/L024101/1] Funding Source: researchfish

Ask authors/readers for more resources

The Center for Expanded Data Annotation and Retrieval is studying the creation of comprehensive and expressive metadata for biomedical datasets to facilitate data discovery, data interpretation, and data reuse. We take advantage of emerging community-based standard templates for describing different kinds of biomedical datasets, and we investigate the use of computational techniques to help investigators to assemble templates and to fill in their values. We are creating a repository of metadata from which we plan to identify metadata patterns that will drive predictive data entry when filling in metadata templates. The metadata repository not only will capture annotations specified when experimental datasets are initially created, but also will incorporate links to the published literature, including secondary analyses and possible refinements or retractions of experimental interpretations. By working initially with the Human Immunology Project Consortium and the developers of the ImmPort data repository, we are developing and evaluating an end-to-end solution to the problems of metadata authoring and management that will generalize to other data-management environments.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Rheumatology

Characterising the autoantibody repertoire in systemic sclerosis following myeloablative haematopoietic stem cell transplantation

Burcu Ayoglu, Michele Donato, Daniel E. Furst, Leslie J. Crofford, Ellen Goldmuntz, Lynette Keyes-Elstein, Judith James, Susan Macwana, Maureen D. Mayes, Peter McSweeney, Richard A. Nash, Keith M. Sullivan, Beverly Welch, Ashley Pinckney, Rong Mao, Lorinda Chung, Purvesh Khatri, Paul J. Utz

Summary: Results from the SCOT clinical trial showed that HSCT had significant benefits over CTX in patients with systemic sclerosis. The objective of this study was to test the hypothesis that transplantation stabilizes the autoantibody repertoire in patients with favorable clinical outcomes. Analysis of autoantibody profiles revealed significant differences between HSCT and CTX-treated patients, suggesting that HSCT alters the autoantibody repertoire while CTX treatment does not.

ANNALS OF THE RHEUMATIC DISEASES (2023)

Article Oncology

LAMC2 Regulates Key Transcriptional and Targetable Effectors to Support Pancreatic Cancer Growth

Oihane Erice, Shruthi Narayanan, Iker Feliu, Rodrigo Entrialgo-Cadierno, Antonia Malinova, Caterina Vicentini, Elizabeth Guruceaga, Pietro Del fi No, Marija Trajkovic-Arsic, Haritz Moreno, Karmele Valencia, Ester Blanco, Irati Macaya, Daniel Ohlund, Purvesh Khatri, Fernando Lecanda, Aldo Scarpa, Jens T. Siveke, Vincenzo Corbo, Mariano Ponz-Sarvise, Silve Vicent

Summary: In this study, the expression of LAMC2 in PDAC and its regulated network were investigated, with the aim of identifying potential therapies. It was found that LAMC2 was consistently upregulated in PDAC and associated with tumor grade and survival. The LAMC2-regulated network contained downstream effectors shared by the KRAS signaling pathway, and the inhibition of LAMC2 or AXL showed a synergistic antiproliferative effect in combination with MEK1/2 inhibitors in PDAC models.

CLINICAL CANCER RESEARCH (2023)

Article Medicine, General & Internal

Multisite validation of a host response signature for predicting likelihood of bacterial and viral infections in patients with suspected influenza

Maryam Shojaei, Uan- Chen, Uros Midic, Simone Thair, Sally Teoh, Anthony McLean, Timothy E. E. Sweeney, Matthew Thompson, Oliver Liesenfeld, Purvesh Khatri, Benjamin Tang

Summary: IMX-BVN-1, a 29-host mRNA classifier, accurately differentiates bacterial and viral infections in patients with suspected influenza, providing potential clinical utility in acute healthcare settings.

EUROPEAN JOURNAL OF CLINICAL INVESTIGATION (2023)

Article Biology

Machine intelligence for radiation science: summary of the Radiation Research Society 67th annual meeting symposium

Lydia J. Wilson, Frederico C. Kiffer, Daniel C. Berrios, Abigail Bryce-Atkinson, Sylvain V. Costes, Olivier Gevaert, Bruno F. E. Matarese, Jack Miller, Pritam Mukherjee, Kristen Peach, Paul N. Schofield, Luke T. Slater, Britta Langen

Summary: The era of high-throughput techniques has generated large amounts of data in the medical and research fields. Machine intelligence (MI) approaches are being used to overcome limitations in processing, analyzing, and interpreting these massive data sets. The 67th Annual Meeting of the Radiation Research Society featured a symposium on MI approaches, highlighting recent advancements in radiation sciences and their clinical applications. This article summarizes three presentations on metadata processing and ontological formalization, data mining for radiation outcomes in pediatric oncology, and imaging in lung cancer.

INTERNATIONAL JOURNAL OF RADIATION BIOLOGY (2023)

Article Gastroenterology & Hepatology

Mass-Cytometry-Based Quantification of Global Histone Post-Translational Modifications at Single-Cell Resolution Across Peripheral Immune Cells in IBD

Lawrence Bai, Denis Dermadi, Laurynas Kalesinskas, Mai Dvorak, Sarah E. Chang, Ananthakrishnan Ganesan, Samuel J. S. Rubin, Alex Kuo, Peggie Cheung, Michele Donato, Paul J. Utz, Aida Habtezion, Purvesh Khatri

Summary: This study quantified histone modifications at a single-cell resolution in patients with inflammatory bowel disease (IBD) and discovered substantial heterogeneity across immune cell types. Specific histone modifications associated with gene transcription were identified, and a subset of NK cells was found to be associated with higher levels of C-reactive protein. These findings open new avenues for investigating the association between histone modifications and IBD pathology using other epigenomic tools.

JOURNAL OF CROHNS & COLITIS (2023)

Review Biology

Skin Lesion Analysis and Cancer Detection Based on Machine/Deep Learning Techniques: A Comprehensive Survey

Mehwish Zafar, Muhammad Imran Sharif, Muhammad Irfan Sharif, Seifedine Kadry, Syed Ahmad Chan Bukhari, Hafiz Tayyab Rauf

Summary: The skin is the largest organ in the human body and skin cancer is one of the most dangerous types of cancer. Abnormal cell growth in human skin cells can be caused by various pathological variations and genetic disorders. Early diagnosis is crucial due to the slow development and high mortality rate of skin cancer. Numerous computer-aided diagnosis systems utilizing deep learning, machine learning, and computer vision approaches have been proposed for the recognition of skin cancer. This research provides a comprehensive review of the methodologies, techniques, and approaches used for examining skin lesions, with a focus on the challenges and obstacles in analyzing complex and rare features.

LIFE-BASEL (2023)

Article Oncology

Targeting KDM2A Enhances T-cell Infiltration in NSD1-Deficient Head and Neck Squamous Cell Carcinoma

Chen Chen, June Ho Shin, Zhuoqing Fang, Kevin Brennan, Nina B. Horowitz, Kathleen L. Pfaff, Emma L. Welsh, Scott J. Rodig, Olivier Gevaert, Or Gozani, Ravindra Uppaluri, John B. Sunwoo

Summary: In head and neck squamous cell carcinoma (HNSCC), inactivating mutations in the histone methyltransferase NSD1 disproportionately contribute to tumor development and immune exclusion. Understanding the NSD1-mediated mechanism and targeting the histone-modifying enzyme KDM2A could enhance T-cell infiltration and suppress tumor growth in HNSCC.

CANCER RESEARCH (2023)

Correction Multidisciplinary Sciences

Systems vaccinology of the BNT162b2 mRNA vaccine in humans (vol 596, pg 410, 2021)

Prabhu S. Arunachalam, Madeleine K. D. Scott, Thomas Hagan, Chunfeng Li, Yupeng Feng, Florian Wimmers, Lilit Grigoryan, Meera Trisal, Venkata Viswanadh Edara, Lilin Lai, Sarah Esther Chang, Allan Feng, Shaurya Dhingra, Mihir Shah, Alexandra S. Lee, Sharon Chinthrajah, Sayantani B. Sindher, Vamsee Mallajosyula, Fei Gao, Natalia Sigal, Sangeeta Kowli, Sheena Gupta, Kathryn Pellegrini, Gregory Tharp, Sofia Maysel-Auslender, Sydney Hamilton, Hadj Aoued, Kevin Hrusovsky, Mark Roskey, Steven E. Bosinger, Holden T. Maecker, Scott D. Boyd, Mark M. Davis, Paul J. Utz, Mehul S. Suthar, Purvesh Khatri, Kari C. Nadeau, Bali Pulendran

NATURE (2023)

Article Biochemistry & Molecular Biology

A deep-learning algorithm to classify skin lesions from mpox virus infection

Alexander H. H. Thieme, Yuanning Zheng, Gautam Machiraju, Chris Sadee, Mirja Mittermaier, Maximilian Gertler, Jorge L. Salinas, Krithika Srinivasan, Prashnna Gyawali, Francisco Carrillo-Perez, Angelo Capodici, Maximilian Uhlig, Daniel Habenicht, Anastassia Loeser, Maja Kohler, Maximilian Schuessler, David Kaul, Johannes Gollrad, Jackie Ma, Christoph Lippert, Kendall Billick, Isaac Bogoch, Tina Hernandez-Boussard, Pascal Geldsetzer, Olivier Gevaert

Summary: A deep-learning algorithm, MPXV-CNN, was developed to identify skin lesions caused by the mpox virus for early detection and mitigation. It demonstrated a sensitivity of 0.83-0.91 and a specificity of 0.965-0.898 across different datasets. The algorithm was robust in classifying lesions on various skin tones and body regions, and a web-based app was developed for patient guidance.

NATURE MEDICINE (2023)

Article Multidisciplinary Sciences

A Large-scale Synthetic Pathological Dataset for Deep Learning-enabled Segmentation of Breast Cancer

Kexin Ding, Mu Zhou, He Wang, Olivier Gevaert, Dimitris Metaxas, Shaoting Zhang

Summary: To enhance computational pathology, we introduce a large-scale synthetic pathological image dataset paired with nucleus annotations, called SNOW. By applying off-the-shelf image generator and nuclei annotator, SNOW offers a cost-effective means to improve model performance. Results show that models trained on synthetic data are competitive and expand the use of synthetic images for data-driven clinical tasks.

SCIENTIFIC DATA (2023)

Article Computer Science, Artificial Intelligence

Multimodal data fusion for cancer biomarker discovery with deep learning

Sandra Steyaert, Marija Pizurica, Divya Nagaraj, Priya Khandelwal, Tina Hernandez-Boussard, Andrew J. Gentles, Olivier Gevaert

Summary: Cancer diagnosis and treatment decisions often focus on a single data source. However, there is a need for effective multimodal fusion approaches to integrate complementary data types. The current technological advances and introduction of deep learning have the potential to address the challenges of data integration in cancer research.

NATURE MACHINE INTELLIGENCE (2023)

Article Dermatology

Best Practices for Clinical Skin Image Acquisition in Translational Artificial Intelligence Research

Michelle Phung, Vijaytha Muralidharan, Veronica Rotemberg, Roberto Andres Novoa, Albert Sean Chiou, Christoph Y. Sadee, Bailie Rapaport, Kiana Yekrang, Jared Bitz, Olivier Gevaert, Justin Meng Ko, Roxana Daneshjou

Summary: Recent developments in artificial intelligence research have led to the increased use of algorithms for detecting malignancies in clinical and dermoscopic images of skin diseases. Gathering training and testing data is crucial for these methods. This paper explores the best practices and challenges in collecting skin images and data for translational artificial intelligence research, including ethics, image acquisition, labeling, curation, and storage. The aim is to enhance malignancy detection using artificial intelligence by facilitating intentional data collection and collaboration between dermatologists and data scientists.

JOURNAL OF INVESTIGATIVE DERMATOLOGY (2023)

Article Multidisciplinary Sciences

Spatial cellular architecture predicts prognosis in glioblastoma

Yuanning Zheng, Francisco Carrillo-Perez, Marija Pizurica, Dieter Henrik Heiland, Olivier Gevaert

Summary: In this study, two deep learning models were used to predict the transcriptional subtypes and prognosis of glioblastoma (GBM) cells from histology images. The results showed consistent associations between spatial cellular organization and patient prognosis. The study also confirmed that transcriptional heterogeneity and cell-state plasticity are key factors in the development of therapeutic resistance in GBM.

NATURE COMMUNICATIONS (2023)

Article Multidisciplinary Sciences

Signature-driven repurposing of Midostaurin for combination with MEK1/2 and KRASG12C inhibitors in lung cancer

Irati Macaya, Marta Roman, Connor Welch, Rodrigo Entrialgo-Cadierno, Marina Salmon, Alba Santos, Iker Feliu, Joanna Kovalski, Ines Lopez, Maria Rodriguez-Remirez, Sara Palomino-Echeverria, Shane M. Lonfgren, Macarena Ferrero, Silvia Calabuig, Iziar A. Ludwig, David Lara-Astiaso, Eloisa Jantus-Lewintre, Elizabeth Guruceaga, Shruthi Narayanan, Mariano Ponz-Sarvise, Antonio Pineda-Lucena, Fernando Lecanda, Davide Ruggero, Purvesh Khatri, Enrique Santamaria, Joaquin Fernandez-Irigoyen, Irene Ferrer, Luis Paz-Ares, Matthias Drosten, Mariano Barbacid, Ignacio Gil-Bazo, Silve Vicent

Summary: Drug combinations are essential for overcoming resistance to targeted therapies in cancer treatment. By utilizing a gene signature-driven drug repurposing approach and a pairwise pharmacological screen, the authors identified a synergistic drug combination for KRAS-mutated lung adenocarcinoma. The combination treatment shows cytotoxic response and involves inhibition of the PKC inhibitor target AURKB.

NATURE COMMUNICATIONS (2023)

Article Genetics & Heredity

A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples

Rushika Pandya, Yudong D. He, Timothy E. Sweeney, Yehudit Hasin-Brumshtein, Purvesh Khatri

Summary: This study identified a conserved host response in nasal samples that can be used for diagnosis and ruling out viral infection in symptomatic patients. By analyzing 1555 samples, a classifier based on 33 genes was developed, which can accurately distinguish viral acute respiratory illnesses from healthy controls and non-viral ARIs.

GENOME MEDICINE (2023)

No Data Available