4.7 Article

Community Approaches for Integrating Environmental Exposures into Human Models of Disease

Journal

ENVIRONMENTAL HEALTH PERSPECTIVES
Volume 128, Issue 12, Pages -

Publisher

US DEPT HEALTH HUMAN SCIENCES PUBLIC HEALTH SCIENCE
DOI: 10.1289/EHP7215

Keywords

-

Funding

  1. National Institutes of Health [5U13CA221044-03]

Ask authors/readers for more resources

BACKGROUND: A critical challenge in genomic medicine is identifying the genetic and environmental risk factors for disease. Currently, the available data links a majority of known coding human genes to phenotypes, but the environmental component of human disease is extremely underrepresented in these linked data sets. Without environmental exposure information, our ability to realize precision health is limited, even with the promise of modern genomics. Achieving integration of gene, phenotype, and environment will require extensive translation of data into a standard, computable form and the extension of the existing gene/phenotype data model. The data standards and models needed to achieve this integration do not currently exist. OBJECTIVES: Our objective is to foster development of community-driven data-reporting standards and a computational model that will facilitate the inclusion of exposure data in computational analysis of human disease. To this end, we present a preliminary semantic data model and use cases and competency questions for further community-driven model development and refinement. DISCUSSION: There is a real desire by the exposure science, epidemiology, and toxicology communities to use informatics approaches to improve their research workflow, gain new insights, and increase data reuse. Critical to success is the development of a community-driven data model for describing environmental exposures and linking them to existing models of human disease.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Toxicology

A novel approach to calculating the kinetically derived maximum dose

Lyle D. Burgoon, Claudio Fuentes, Christopher J. Borgert

Summary: This paper introduces a new method for calculating the kinetically derived maximal dose (KMD) based on Bayesian methods and the Kneedle algorithm. The new approach converts toxicokinetic data to the Michaelis-Menten curve using Bayesian methods and uses the Kneedle algorithm to find the point at which the curve begins to taper off. This reshapes the KMD methodology and places it within the Michaelis-Menten framework, leveraging existing biochemical and pharmacological concepts.

ARCHIVES OF TOXICOLOGY (2022)

Article Virology

NSAID use and clinical outcomes in COVID-19 patients: a 38-center retrospective cohort study

Justin T. Reese, Ben Coleman, Lauren Chan, Hannah Blau, Tiffany J. Callahan, Luca Cappelletti, Tommaso Fontana, Katie Rebecca Bradwell, Nomi L. Harris, Elena Casiraghi, Giorgio Valentini, Guy Karlebach, Rachel Deer, Julie A. McMurry, Melissa A. Haendel, Christopher G. Chute, Emily Pfaff, Richard Moffitt, Heidi Spratt, Jasvinder Singh, Christopher J. Mungall, Andrew E. Williams, Peter N. Robinson

Summary: This study found that the use of non-steroidal anti-inflammatory drugs (NSAIDs) is not associated with increased severity or other adverse outcomes in COVID-19 inpatients. The results confirm and extend the findings of previous observational studies and provide evidence against the initial concerns raised about the use of NSAIDs in COVID-19 patients.

VIROLOGY JOURNAL (2022)

Letter Psychiatry

Risk of new-onset psychiatric sequelae of COVID-19 in the early and late post-acute phase

Ben Coleman, Elena Casiraghi, Hannah Blau, Lauren Chan, Melissa A. Haendel, Bryan Laraway, Tiffany J. Callahan, Rachel R. Deer, Kenneth J. Wilkins, Justin Reese, Peter N. Robinson

WORLD PSYCHIATRY (2022)

Letter Biotechnology & Applied Microbiology

The GA4GH Phenopacket schema defines a computable representation of clinical data

Julius O. B. Jacobsen, Michael Baudis, Gareth S. Baynam, Jacques S. Beckmann, Sergi Beltran, Orion J. Buske, Tiffany J. Callahan, Christopher G. Chute, Melanie Courtot, Daniel Danis, Olivier Elemento, Andrea Essenwanger, Robert R. Freimuth, Michael A. Gargano, Tudor Groza, Ada Hamosh, Nomi L. Harris, Rajaram Kaliyaperumal, Kevin C. Kent Lloyd, Aly Khalifa, Peter M. Krawitz, Sebastian Koeler, Brian J. Laraway, Heikki Lehvaslaiho, Leslie Matalonga, Julie A. McMurry, Alejandro Metke-Jimenez, Christopher J. Mungall, Monica C. Munoz-Torres, Soichi Ogishima, Anastasios Papakonstantinou, Davide Piscia, Nikolas Pontikos, Nuria Queralt-Rosinach, Marco Roos, Julian Sass, Paul N. Schofield, Dominik Seelow, Anastasios Siapos, Damian Smedley, Lindsay D. Smith, Robin Steinhaus, Jagadish Chandrabose Sundaramurthi, Emilia M. Swietlik, Sylvia Thun, Nicole A. Vasilevsky, Alex H. Wagner, Jeremy L. Warner, Claus Weiland, Melissa A. Haendel, Peter N. Robinson

NATURE BIOTECHNOLOGY (2022)

Article Computer Science, Interdisciplinary Applications

A method for comparing multiple imputation techniques: A case study on the US national COVID cohort collaborative

Elena Casiraghi, Rachel Wong, Margaret Hall, Ben Coleman, Marco Notaro, Michael D. Evans, Jena S. Tronieri, Hannah Blau, Bryan Laraway, Tiffany J. Callahan, Lauren E. Chan, Carolyn T. Bramante, John B. Buse, Richard A. Moffitt, Til Sturmer, Steven G. Johnson, Yu Raymond Shao, Justin Reese, Peter N. Robinson, Alberto Paccanaro, Giorgio Valentini, Jared D. Huling, Kenneth J. Wilkins

Summary: Healthcare datasets from Electronic Health Records are valuable for assessing associations between patients' predictors and outcomes. However, missing values are common in these datasets, and removing them may introduce bias. Multiple imputation algorithms have been proposed to recover missing information, but there is no consensus on which algorithm works best. Choosing algorithm parameters and data-related modeling choices is also challenging.

JOURNAL OF BIOMEDICAL INFORMATICS (2023)

Article Multidisciplinary Sciences

Unifying the identification of biomedical entities with the Bioregistry

Charles Tapley Hoyt, Meghan Balk, Tiffany J. Callahan, Daniel Domingo-Fernandez, Melissa A. Haendel, Harshad B. Hegde, Daniel S. Himmelstein, Klas Karis, John Kunze, Tiago Lubiana, Nicolas Matentzoglu, Julie McMurry, Sierra Moxon, Christopher J. Mungall, Adriano Rutz, Deepak R. Unni, Egon Willighagen, Donald Winston, Benjamin M. Gyori

Summary: The standardized identification of biomedical entities is important for interoperability and data integration in the life sciences. The Bioregistry is an integrative and open metaregistry that expands upon existing registries to address the evolving needs of researchers. By leveraging public infrastructure and automation, and employing an open code and open data governance model, the Bioregistry promotes interoperability and reuse of data and scientific literature.

SCIENTIFIC DATA (2022)

Article Computer Science, Interdisciplinary Applications

Developing a Knowledge Graph for Pharmacokinetic Natural Product-Drug Interactions

Sanya B. Taneja, Tiffany J. Callahan, Mary F. Paine, Sandra L. Kane-Gill, Halil Kilicoglu, Marcin P. Joachimiak, Richard D. Boyce

Summary: The study constructed a knowledge graph (KG) for pharmacokinetic natural product-drug interactions (NPDIs), which can be used to discover plausible mechanistic explanations and guide scientific research.

JOURNAL OF BIOMEDICAL INFORMATICS (2023)

Article Computer Science, Interdisciplinary Applications

Causal feature selection using a knowledge graph combining structured knowledge from the biomedical literature and ontologies: A use case studying depression as a risk factor for Alzheimer's disease

Scott A. Malec, Sanya B. Taneja, Steven M. Albert, C. Elizabeth Shaaban, Helmet T. Karim, Arthur S. Levine, Paul Munro, Tiffany J. Callahan, Richard D. Boyce

Summary: Traditional methods of identifying confounders rely on content-matter expertise and literature review, but these methods have limitations. To overcome these challenges, researchers propose a novel method based on knowledge graph, which combines computable literature-derived knowledge with biomedical ontologies for better causal feature selection. The application of this method identifies potential confounders and highlights the need for standardized databases of causal variables.

JOURNAL OF BIOMEDICAL INFORMATICS (2023)

Article Health Care Sciences & Services

Ontologizing health systems data at scale: making translational discovery a reality

Tiffany J. Callahan, Adrianne L. Stefanski, Jordan M. Wyrwa, Chenjie Zeng, Anna Ostropolets, Juan M. Banda, William A. Baumgartner, Richard D. Boyce, Elena Casiraghi, Ben D. Coleman, Janine H. Collins, Sara J. Deakyne Davies, James A. Feinstein, Asiyah Y. Lin, Blake Martin, Nicolas A. Matentzoglu, Daniella Meeker, Justin Reese, Jessica Sinclair, Sanya B. Taneja, Katy E. Trinkley, Nicole A. Vasilevsky, Andrew E. Williams, Xingmin A. Zhang, Joshua C. Denny, Patrick B. Ryan, George Hripcsak, Tellen D. Bennett, Melissa A. Haendel, Peter N. Robinson, Lawrence E. Hunter, Michael G. Kahn

Summary: Common data models address standardization challenges in EHR data, but fail to integrate all resources for deep phenotyping. OBO ontologies provide computable representations of biological knowledge, but mapping EHR data to OBO ontologies requires manual curation. OMOP2OBO is an algorithm that maps OMOP vocabularies to OBO ontologies, enabling deep phenotyping and identification of undiagnosed patients.

NPJ DIGITAL MEDICINE (2023)

Article Health Care Sciences & Services

A metadata framework for computational phenotypes

Matthew Spotnitz, Nripendra Acharya, James J. Cimino, Shawn Murphy, Bahram Namjou, Nancy Crimmins, Theresa Walunas, Cong Liu, David Crosslin, Barbara Benoit, Elisabeth Rosenthal, Jennifer A. Pacheco, Anna Ostropolets, Harry Reyes Nieva, Jason S. Patterson, Lauren R. Richter, Tiffany J. Callahan, Ahmed Elhussein, Chao Pang, Krzysztof Kiryluk, Jordan Nestor, Atlas Khan, Sumit Mohan, Evan Minty, Wendy Chung, Wei-Qi Wei, Karthik Natarajan, Chunhua Weng

Summary: This study develops a comprehensive metadata framework for defining computational clinical phenotypes. The framework was positively evaluated by more than 90% of the survey respondents in terms of phenotype definition and validation methods and metrics. The strengths of the framework include explicit descriptions, compliance with data standards, and comprehensive validation methods.

JAMIA OPEN (2023)

Article Computer Science, Interdisciplinary Applications

GRAPE for fast and scalable graph processing and random-walk-based embedding

Luca Cappelletti, Tommaso Fontana, Elena Casiraghi, Vida Ravanmehr, Tiffany J. J. Callahan, Carlos Cano, Marcin P. P. Joachimiak, Christopher J. J. Mungall, Peter N. N. Robinson, Justin Reese, Giorgio Valentini

Summary: GRAPE is a software resource for graph processing and embedding that can scale with big graphs, showing substantial improvements in space and time complexity compared to existing resources. It offers efficient graph-processing utilities, node embedding methods, and inference models, making it a valuable tool for graph representation learning. GRAPE is capable of handling millions of nodes and billions of edges, enabling large-graph analysis in various real-world applications.

NATURE COMPUTATIONAL SCIENCE (2023)

Article Medicine, General & Internal

Generalisable long COVID subtypes: Findings from the NIH N3C and RECOVER programmes

Justin T. Reese, Hannah Blau, Elena Casiraghi, Timothy Bergquist, Johanna J. Loomba, Tiffany J. Callahan, Bryan Laraway, Corneliu Antonescu, Ben Coleman, Michael Gargano, Kenneth J. Wilkins, Luca Cappelletti, Tommaso Fontana, Nariman Ammar, Blessy Antony, T. M. Murali, J. Harry Caufield, Guy Karlebach, Julie A. McMurry, Andrew Williams, Richard Moffitt, Jineta Banerjee, Anthony E. Solomonides, Hannah Davis, Kristin Kostka, Giorgio Valentini, David Sahner, Christopher G. Chute, Charisse Madlock-Brown, Melissa A. Haendel, Peter N. Robinson

Summary: By computationally modelling PASC phenotype data and assessing semantic similarity, we identified six distinct clusters of PASC patients with different clinical features and severity, including diverse manifestations. This semantic phenotypic clustering approach provides a foundation for stratifying and studying PASC patients for natural history or therapy studies.

EBIOMEDICINE (2023)

Article Mathematical & Computational Biology

A Simple Standard for Sharing Ontological Mappings (SSSOM)

Nicolas Matentzoglu, James P. Balhoff, Susan M. Bello, Chris Bizon, Matthew Brush, Tiffany J. Callahan, Christopher G. Chute, William D. Duncan, Chris T. Evelo, Davera Gabriel, John Graybeal, Alasdair Gray, Benjamin M. Gyori, Melissa Haendel, Henriette Harmse, Nomi L. Harris, Ian Harrow, Harshad B. Hegde, Amelia L. Hoyt, Charles T. Hoyt, Dazhi Jiao, Ernesto Jimenez-Ruiz, Simon Jupp, Hyeongsik Kim, Sebastian Koehler, Thomas Liener, Qinqin Long, James Malone, James A. McLaughlin, Julie A. McMurry, Sierra Moxon, Monica C. Munoz-Torres, David Osumi-Sutherland, James A. Overton, Bjoern Peters, Tim Putman, Nuria Queralt-Rosinach, Kent Shefchek, Harold Solbrig, Anne Thessen, Tania Tudorache, Nicole Vasilevsky, Alex H. Wagner, Christopher J. Mungall

Summary: This paper introduces the "Simple Standard for Sharing Ontological Mappings (SSSOM)" standard, which addresses issues in mapping different representations of objects in different databases by enhancing readability of metadata, defining an easy-to-use table-based format, and implementing open collaborative workflows. The standard provides reference tools and software libraries for handling mappings.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2022)

Article Mathematical & Computational Biology

Conference report: Biocuration 2021 Virtual Conference

Federica Quaglia, Rama Balakrishnan, Susan M. Bello, Nicole Vasilevsky

Summary: The International Society for Biocuration (ISB) aims to promote the field of biocuration through annual international conferences and provide a platform for information exchange and networking. Due to the ongoing pandemic, this year's ISB conference was held virtually, covering topics such as the future of biocuration, career paths, and equity and inclusion.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2022)

No Data Available