4.8 Article

Complex Principal Component and Correlation Structure of 16 Yeast Genomic Variables

期刊

MOLECULAR BIOLOGY AND EVOLUTION
卷 28, 期 9, 页码 2501-2512

出版社

OXFORD UNIV PRESS
DOI: 10.1093/molbev/msr077

关键词

molecular evolution; genome analysis; proteomics; principal component; analysis

资金

  1. German Academic Exchange Service
  2. Helmholtz-center Munich
  3. Helmholtz Association within Helmholtz Alliance on Systems Biology

向作者/读者索取更多资源

A quickly growing number of characteristics reflecting various aspects of gene function and evolution can be either measured experimentally or computed from DNA and protein sequences. The study of pairwise correlations between such quantitative genomic variables as well as collective analysis of their interrelations by multidimensional methods have delivered crucial insights into the processes of molecular evolution. Here, we present a principal component analysis (PCA) of 16 genomic variables from Saccharomyces cerevisiae, the largest data set analyzed so far. Because many missing values and potential outliers hinder the direct calculation of principal components, we introduce the application of Bayesian PCA. We confirm some of the previously established correlations, such as evolutionary rate versus protein expression, and reveal new correlations such as those between translational efficiency, phosphorylation density, and protein age. Although the first principal component primarily contrasts genomic change and protein expression, the second component separates variables related to gene existence and expressed protein functions. Enrichment analysis on genes affecting variable correlations unveils classes of influential genes. For example, although ribosomal and nuclear transport genes make important contributions to the correlation between protein isoelectric point and molecular weight, protein synthesis and amino acid metabolism genes help cause the lack of significant correlation between propensity for gene loss and protein age. We present the novel Quagmire database (Quantitative Genomics Resource) which allows exploring relationships between more genomic variables in three model organisms-Escherichia coli, S. cerevisiae, and Homo sapiens (http://webclu.bio.wzw.tum.de:18080/quagmire).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biotechnology & Applied Microbiology

Mapping single-cell data to reference atlases by transfer learning

Mohammad Lotfollahi, Mohsen Naghipourfar, Malte D. Luecken, Matin Khajavi, Maren Buettner, Marco Wagenstetter, Ziga Avsec, Adam Gayoso, Nir Yosef, Marta Interlandi, Sergei Rybakov, Alexander Misharin, Fabian J. Theis

Summary: scArches is a deep learning strategy for mapping query datasets on top of a reference, allowing efficient and decentralized reference construction while preserving biological state information and removing batch effects. It generalizes to multimodal reference mapping and can impute missing modalities.

NATURE BIOTECHNOLOGY (2022)

Review Biotechnology & Applied Microbiology

Spatial components of molecular tissue biology

Giovanni Palla, David S. Fischer, Aviv Regev, Fabian J. Theis

Summary: Methods for profiling RNA and protein expression in a spatially resolved manner have rapidly advanced, but clear articulation of key biological questions and development of computational tools are crucial. Decisions on molecular features and inclusion of cell shape in analysis need to be made by developers. Optimal ways to compare tissue samples at different length scales are still being sought.

NATURE BIOTECHNOLOGY (2022)

Review Endocrinology & Metabolism

Toward modeling metabolic state from single-cell transcriptomics

Karin Hrovatin, David S. Fischer, Fabian J. Theis

Summary: Single-cell metabolic modeling provides a new perspective for understanding cellular functions. The presented modeling approaches vary in terms of input requirements, assumptions, scalability, modeled metabolic layers, and newly gained insights. We believe that the use of prior metabolic knowledge will lead to more robust predictions and will pave the way for mechanistic and interpretable machine-learning models.

MOLECULAR METABOLISM (2022)

Article Genetics & Heredity

Origin and function of activated fibroblast states during zebrafish heart regeneration

Bo Hu, Sara Lelek, Bastiaan Spanjaard, Hadil El-Sammak, Mariana Guedes Simoes, Janita Mintcheva, Hananeh Aliee, Ronny Schaefer, Alexander M. Meyer, Fabian Theis, Didier Y. R. Stainier, Daniela Panakova, Jan Philipp Junker

Summary: This study identifies specialized activated fibroblast cell states in the regenerating zebrafish heart using single-cell transcriptomics and spatiotemporal analysis, and reveals the origin of these cell states and the regulatory mechanism of endocardial fibroblast response.

NATURE GENETICS (2022)

Article Biochemistry & Molecular Biology

Probing cell identity hierarchies by fate titration and collision during direct reprogramming

Bob A. Hersbach, David S. Fischer, Giacomo Masserdotti, Deeksha, Karolina Mojzisova, Thomas Waltzhoeni, Diego Rodriguez-Terrones, Matthias Heinig, Fabian J. Theis, Magdalena Goetz, Stefan H. Stricker

Summary: Collide-seq, a single-cell protocol, has shed light on the basic principles of fate erasure and cell identity conflict resolution in direct reprogramming. It revealed the lack of a common mechanism for the loss of fibroblast-specific gene expression and showed that the abrupt transcriptional changes in converting cells occur when critical levels of reprogramming factors are reached. The study also demonstrated that reprogramming factors can disturb cell identity programs independent of their ability to bind their target genes.

MOLECULAR SYSTEMS BIOLOGY (2022)

Article Biotechnology & Applied Microbiology

Modeling intercellular communication in tissues using spatial graphs of cells

David S. Fischer, Anna C. Schaar, Fabian J. Theis

Summary: A graph neural network is used to model how cells communicate in tissues. Existing models of intercellular communication only consider receptor-ligand signaling and ignore spatial proximity. This study presents a node-centric expression modeling method that estimates the impact of niche composition on gene expression from spatial molecular profiling data. The method successfully recovers signatures of molecular processes involved in cell communication.

NATURE BIOTECHNOLOGY (2023)

Article Cell Biology

Biologically informed deep learning to query gene programs in single-cell atlases

Mohammad Lotfollahi, Sergei Rybakov, Karin Hrovatin, Soroor Hediyeh-zadeh, Carlos Talavera-Lopez, Alexander V. Misharin, Fabian J. Theis

Summary: Lotfollahi et al. propose ExpiMap, a biologically informed deep-learning model for interpretable reference mapping of RNA sequencing data. ExpiMap maps cells into biologically understandable components representing known 'gene programs', allowing for detailed analysis and interpretation of single-cell data.

NATURE CELL BIOLOGY (2023)

Article Neurosciences

TDP-43 condensates and lipid droplets regulate the reactivity of microglia and regeneration after traumatic brain injury

Alessandro Zambusi, Klara Tereza Novoselc, Saskia Hutten, Sofia Kalpazidou, Christina Koupourtidou, Rico Schieweck, Sven Aschenbroich, Lara Silva, Ayse Seda Yazgili, Frauke van Bebber, Bettina Schmid, Gabriel Moeller, Clara Tritscher, Christian Stigloher, Claire Delbridge, Swetlana Sirko, Zeynep Irem Gunes, Sabine Liebscher, Jurgen Schlegel, Hananeh Aliee, Fabian Theis, Silke Meiners, Michael Kiebler, Dorothee Dormann, Jovica Ninkovic

Summary: The study reveals that clearing lipid droplets and TDP-43(+) condensates is crucial for restoring microglial cells to their nonactivated state and achieving scarless regeneration in zebrafish. The accumulation of these cellular components is also observed in postmortem brain tissues of patients with traumatic brain injury.

NATURE NEUROSCIENCE (2022)

Article Endocrinology & Metabolism

A transcriptional cross species map of pancreatic islet cells

Sophie Tritschler, Moritz Thomas, Anika Boettcher, Barbara Ludwig, Janine Schmid, Undine Schubert, Elisabeth Kemter, Eckhard Wolf, Heiko Lickert, Fabian J. Theis

Summary: This study provides a high-resolution transcriptional map of healthy human islet cells and their murine and porcine counterparts. The findings suggest both commonalities and differences in transcriptional profiles across different species, with important identity and functional markers shared between species. The pig data better recapitulated the heterogeneity and functional gene expression of human islet cells compared to the mouse data.

MOLECULAR METABOLISM (2022)

Article Biochemical Research Methods

Learning consistent subcellular landmarks to quantify changes in multiplexed protein maps

Hannah Spitzer, Scott Berry, Mark Donoghoe, Lucas Pelkmans, Fabian J. Theis

Summary: CAMPA is a deep learning framework that learns representations of molecular pixel profiles from multiplexed images. It clusters these representations to quantify subcellular landmarks and captures interpretable cellular phenotypes. Using this framework, the study reveals the changes in subcellular organization upon perturbation of RNA synthesis, RNA processing, or cell size, and uncovers the links between the molecular composition of membraneless organelles and cell-to-cell variability in bulk RNA synthesis rates.

NATURE METHODS (2023)

Review Oncology

Single-cell profiling to explore pancreatic cancer heterogeneity, plasticity and response to therapy

Stefanie Baerthel, Chiara Falcomata, Roland Rad, Fabian J. Theis, Dieter Saur

Summary: Pancreatic ductal adenocarcinoma (PDAC) is a highly lethal cancer with a heterogeneous genetic landscape and an immunosuppressive tumor microenvironment. Recent advances in single-cell sequencing and spatial transcriptomics have provided insights into the diversity and plasticity of PDAC, both in its malignant cells and the surrounding tissue. This review highlights the importance of single-cell analysis in understanding PDAC and discusses the potential of multimodal approaches to study its biology and response to therapy.

NATURE CANCER (2023)

Article Genetics & Heredity

Single-cell reference mapping to construct and extend cell-type hierarchies

Lieke Michielsen, Mohammad Lotfollahi, Daniel Strobl, Lisa Sikkema, Marcel J. T. Reinders, Fabian J. Theis, Ahmed Mahfouz

Summary: Single-cell genomics is generating a large amount of data, which can be integrated to create comprehensive reference atlases of tissue. However, there is a lack of systematic approach to harmonize cell type annotation terminology and depth across different datasets.

NAR GENOMICS AND BIOINFORMATICS (2023)

Meeting Abstract Respiratory System

Single cell transcriptomic dissection of virus induced immunopathology in interferon gamma receptor null mice

L. Yang, I. Angelidis, L. Heumos, M. Ansari, S. Zhou, C. Mayr, L. Simon, M. Strunz, F. Theis, H. Adler, H. Schiller

EUROPEAN RESPIRATORY JOURNAL (2022)

Meeting Abstract Respiratory System

Ex vivo modeling of human lung fibrogenesis and drug mode of action screens using single cell RNA-seq in precision-cut lung slices

N. J. Lang, M. Ansari, L. Yang, S. Zhou, D. Porras-Gonzalez, A. Agami, C. H. Mayr, B. Hooshiar Kashani, L. Heumos, M. Gerckens, Y. Chen, J. Schniering, M. Stoleriu, R. Hatz, J. Behr, F. J. Theis, G. Burgstaller, H. B. Schiller

EUROPEAN RESPIRATORY JOURNAL (2022)

Meeting Abstract Immunology

Revisiting the innate immune response - a temporal resolution of whole blood stimulation

N. Reusch, S. Mueller, K. Bassler, L. Bonaguro, J. Schulte-Schrepping, L. Balsevicius, K. Dahm, V Isakzai, T. Kapellos, S. Warnat-Herresthal, C. Kroeger, S. Agrawal, N. Balzer, T. Pecht, M. Becker, P. Guenther, C. Osei-Sarpong, W. Fujii, A. Horne, M. Lotfollahi, Y. Ji, E. Dudkin, H. J. C. Ferreira, E. Hinkley, M. Beyer, K. Haendler, J. Hasenauer, R. J. Argueello, F. Theis, M. Schmolz, A. Aschenbrenner, T. Ulas, J. L. Schultze

EUROPEAN JOURNAL OF IMMUNOLOGY (2022)

暂无数据