4.4 Article

KERNEL-PENALIZED REGRESSION FOR ANALYSIS OF MICROBIOME DATA

Journal

ANNALS OF APPLIED STATISTICS
Volume 12, Issue 1, Pages 540-566

Publisher

INST MATHEMATICAL STATISTICS-IMS
DOI: 10.1214/17-AOAS1102

Keywords

Compositional data; distance-based analysis; kernel methods; microbial community data; penalized regression

Funding

  1. National Institutes of Health [P01-CA168530, U01-CA162077, R01-GM114029]
  2. National Science Foundation [DMS-1161565, DMS-1561814]

Ask authors/readers for more resources

The analysis of human microbiome data is often based on dimensionreduced graphical displays and clusterings derived from vectors of microbial abundances in each sample. Common to these ordination methods is the use of biologically motivated definitions of similarity. Principal coordinate analysis, in particular, is often performed using ecologically defined distances, allowing analyses to incorporate context-dependent, non-Euclidean structure. In this paper, we go beyond dimension-reduced ordination methods and describe a framework of high-dimensional regression models that extends these distance-based methods. In particular, we use kernel-based methods to show how to incorporate a variety of extrinsic information, such as phylogeny, into penalized regression models that estimate taxon-specific associations with a phenotype or clinical outcome. Further, we show how this regression framework can be used to address the compositional nature of multivariate predictors comprised of relative abundances; that is, vectors whose entries sum to a constant. We illustrate this approach with several simulations using data from two recent studies on gut and vaginal microbiomes. We conclude with an application to our own data, where we also incorporate a significance test for the estimated coefficients that represent associations between microbial abundance and a percent fat.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Nutrition & Dietetics

Plasma metabolomics profiles suggest beneficial effects of a low-glycemic load dietary pattern on inflammation and energy metabolism

Sandi L. Navarro, Aliasghar Tarkhan, Ali Shojaie, Timothy W. Randolph, Haiwei Gu, Danijel Djukovic, Katie J. Osterbauer, Meredith A. Hullar, Mario Kratz, Marian L. Neuhouser, Paul D. Lampe, Daniel Raftery, Johanna W. Lampe

AMERICAN JOURNAL OF CLINICAL NUTRITION (2019)

Article Nutrition & Dietetics

Associations of plasma trimethylamine N-oxide, choline, carnitine, and betaine with inflammatory and cardiometabolic risk biomarkers and the fecal microbiome in the Multiethnic Cohort Adiposity Phenotype Study

Benjamin C. Fu, Meredith A. J. Hullar, Timothy W. Randolph, Adrian A. Franke, Kristine R. Monroe, Iona Cheng, Lynne R. Wilkens, John A. Shepherd, Margaret M. Madeleine, Loic Le Marchand, Unhee Lim, Johanna W. Lampe

AMERICAN JOURNAL OF CLINICAL NUTRITION (2020)

Article Biochemical Research Methods

netgsa: Fast computation and interactive visualization for topology-based pathway enrichment analysis

Michael Hellstern, Jing Ma, Kun Yue, Ali Shojaie

Summary: This study focused on improving the existing topology-based pathway enrichment method, NetGSA, through three key enhancements: reducing computation time, integrating pathway databases, and providing interactive visualization. The improved NetGSA outperforms in efficiency and statistical power compared to previous versions and other similar methods.

PLOS COMPUTATIONAL BIOLOGY (2021)

Article Nutrition & Dietetics

Personalized Nutrition Using Microbial Metabolite Phenotype to Stratify Participants and Non-Invasive Host Exfoliomics Reveal the Effects of Flaxseed Lignan Supplementation in a Placebo-Controlled Crossover Trial

Destiny A. Mullens, Ivan Ivanov, Meredith A. J. Hullar, Timothy W. Randolph, Johanna W. Lampe, Robert S. Chapkin

Summary: This study investigated the impact of the metabolic phenotype of flaxseed lignan on host gene expression. The findings suggest that a higher conversion of flaxseed lignan to enterolactone (ENL) is associated with a suppressed inflammatory status.

NUTRIENTS (2022)

Article Medicine, Research & Experimental

Lipidomics of cyclophosphamide 4-hydroxylation in patients receiving post-transplant cyclophosphamide

Sandi L. Navarro, Zihan Zheng, Timothy W. Randolph, Ryotaro Nakamura, Brenda M. Sandmaier, David Hockenbery, Jeannine S. McCune

Summary: Biomarker-guided dosing could potentially improve the effectiveness and safety of cyclophosphamide (CY), but evaluating its association with CY plasma concentration-time curve (AUC) is time-consuming. This study aimed to identify lipidomic biomarkers associated with 4-hydroxycyclophosphamide (4HCY) formation clearance in hematopoietic cell transplant patients receiving CY. The results showed the feasibility of lipidomics but further studies are needed to optimize CY dosing in larger samples.

CTS-CLINICAL AND TRANSLATIONAL SCIENCE (2022)

Article Pharmacology & Pharmacy

Prediction of Busulfan Clearance by Predose Plasma Metabolomic Profiling

Jeannine S. McCune, Sandi L. Navarro, K. Scott Baker, Linda J. Risler, Brian R. Phillips, Timothy W. Randolph, Laura Shireman, H. Gary Schoch, H. Joachim Deeg, Yuzheng Zhang, Alex Men, Loes Maton, Alwin D. R. Huitema

Summary: A linear regression model of 13 endogenous metabolomic compounds (EMCs) can be used to predict an individual's busulfan clearance (BuCL) before administration. This pharmacometabolomics method is more effective than using a busulfan test dose or pharmacogenomics to guide dosing.

CLINICAL PHARMACOLOGY & THERAPEUTICS (2023)

Article Cell Biology

Posttranslational modifications induce autoantibodies with risk prediction capability in patients with small cell lung cancer

Kristin J. Lastwika, Andrew Kunihiro, Joell L. Solan, Yuzheng Zhang, Lydia R. Taverne, David Shelley, Jung-Hyun Rho, Timothy W. Randolph, Christopher I. Li, Eric L. Grogan, Pierre P. Massion, Annette L. Fitzpatrick, David MacPherson, A. McGarry Houghton, Paul D. Lampe

Summary: Small cell lung cancer (SCLC) triggers the generation of autoantibodies, causing unique paraneoplastic neurological syndromes. We developed a technique to detect autoantibodies directly from patient plasma and found that SCLC patients have significantly higher disease-specific autoantibody signals compared to patients with other cancers. We identified previously unknown autoantibodies produced in response to both intracellular and extracellular tumor antigens in multiple SCLC cohorts and discovered disease-specific posttranslational modifications within targeted extracellular proteins. These findings have implications for the early detection and clinical utility of SCLC.

SCIENCE TRANSLATIONAL MEDICINE (2023)

Article Medicine, Research & Experimental

Annexin A2/TLR2/MYD88 pathway induces arginase 1 expression in tumor-associated neutrophils

Huajia Zhang, Xiaodong Zhu, Travis J. Friesen, Jeff W. Kwak, Tatyana Pisarenko, Surapat Mekvanich, Mark A. Velasco, Timothy W. Randolph, Julia Kargl, A. McGarry Houghton

Summary: This study reveals the expression of ARG1 in neutrophil lineage cells in non-small cell lung cancer and the active transcription of ARG7 mRNA in tumor-associated neutrophils (TANs). ANXA2 is identified as the major driver of ARG7 mRNA expression in TANs through signaling via the TLR2/MYD88 axis. This study uncovers a novel mechanism in regulating ARG7 mRNA expression in neutrophils and emphasizes the crucial role of neutrophil lineage cells in suppressing tumor-infiltrating lymphocytes.

JOURNAL OF CLINICAL INVESTIGATION (2022)

Article Gastroenterology & Hepatology

Associations of the gut microbiome with hepatic adiposity in the Multiethnic Cohort Adiposity Phenotype Study

Meredith A. J. Hullar, Isaac C. Jenkins, Timothy W. Randolph, Keith R. Curtis, Kristine R. Monroe, Thomas Ernst, John A. Shepherd, Daniel O. Stram, Iona Cheng, Bruce S. Kristal, Lynne R. Wilkens, Adrian Franke, Loic Le Marchand, Unhee Lim, Johanna W. Lampe

Summary: This study investigated the association of gut microbiome with hepatic adiposity among different ethnicities. The research found that NAFLD patients from various ethnic groups exhibited differences in bacterial composition and metabolism, but shared similar bacterial metabolic pathways.

GUT MICROBES (2021)

Article Endocrinology & Metabolism

Urinary enterolactone is associated with plasma proteins related to immunity and cancer development in healthy participants on controlled diets

Fayth L. Miles, Sandi L. Navarro, Carly B. Garrison, Timothy W. Randolph, Yuzheng Zhang, Ali Shojaie, Mario Kratz, Meredith A. J. Hullar, Daniel Raftery, Marian L. Neuhouser, Paul D. Lampe, Johanna W. Lampe

Summary: Urinary excretion of the microbial metabolite ENL of dietary lignans is associated with plasma protein abundance, potentially linking to cancer prevention. Over-representation analysis indicates associations of ENL excretion with estrogen and TNF signaling pathways.

HUMAN NUTRITION & METABOLISM (2021)

Review Statistics & Probability

Differential network analysis: A statistical perspective

Ali Shojaie

Summary: Network analysis is crucial in various scientific disciplines, especially in biology and medicine where it can predict complex diseases and provide insights into disease mechanisms. Recent statistical machine learning methods have been developed for inferring networks and identifying changes in their structures.

WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS (2021)

Article Nutrition & Dietetics

Impact of the Analytical Approach on the Reliability of MRI-Based Assessment of Hepatic Fat Content

Maggie S. Burhans, Niranjan Balu, Kelsey A. Schmidt, Gail Cromer, Kristina M. Utzschneider, Ellen A. Schur, Sarah E. Holte, Timothy W. Randolph, Mario Kratz

CURRENT DEVELOPMENTS IN NUTRITION (2020)

Article Automation & Control Systems

The Reduced PC-Algorithm: Improved Causal Structure Learning in Large Random Networks

Arjun Sondhi, Ali Shojaie

JOURNAL OF MACHINE LEARNING RESEARCH (2019)

No Data Available