4.8 Editorial Material

ARCHAIC HUMANS Four makes a party

Journal

NATURE
Volume 505, Issue 7481, Pages 32-34

Publisher

NATURE PUBLISHING GROUP
DOI: 10.1038/nature12847

Keywords

-

Ask authors/readers for more resources

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemistry & Molecular Biology

AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models

Mihaly Varadi, Stephen Anyango, Mandar Deshpande, Sreenath Nair, Cindy Natassia, Galabina Yordanova, David Yuan, Oana Stroe, Gemma Wood, Agata Laydon, Augustin Zidek, Tim Green, Kathryn Tunyasuvunakool, Stig Petersen, John Jumper, Ellen Clancy, Richard Green, Ankur Vora, Mira Lutfi, Michael Figurnov, Andrew Cowie, Nicole Hobbs, Pushmeet Kohli, Gerard Kleywegt, Ewan Birney, Demis Hassabis, Sameer Velankar

Summary: AlphaFold DB is an openly accessible database with high-accuracy protein-structure predictions, powered by DeepMind's AlphaFold v2.0. It provides programmatic access to a vast number of predicted structures and is expanding to cover more sequences.

NUCLEIC ACIDS RESEARCH (2022)

Article Biochemistry & Molecular Biology

The European Bioinformatics Institute (EMBL-EBI) in 2021

Gaia Cantelli, Alex Bateman, Cath Brooksbank, Anton Petrov, Rahuman S. Malik-Sheriff, Michele Ide-Smith, Henning Hermjakob, Paul Flicek, Rolf Apweiler, Ewan Birney, Johanna McEntyre

Summary: The European Bioinformatics Institute (EMBL-EBI) offers a wide range of freely available molecular data resources, including new resources like the PGS Catalog and AlphaFold DB. They have also been involved in developing community-driven data standards, such as the Recommended Metadata for Biological Images and the BioModels Reproducibility Scorecard. Training is a core mission of EMBL-EBI, with improvements to their online training offerings being part of this year's update.

NUCLEIC ACIDS RESEARCH (2022)

Editorial Material Medicine, Research & Experimental

Mendelian Randomization

Ewan Birney

Summary: Mendelian randomization borrows statistical techniques from economics to analyze the effects of various factors on human biology and disease. By using genetic variation as instrumental variables, it can disentangle the effects of these factors on different outcomes.

COLD SPRING HARBOR PERSPECTIVES IN MEDICINE (2022)

Article Genetics & Heredity

The Gene Curation Coalition: A global effort to harmonize gene-disease evidence resources

Marina T. DiStefano, Scott Goehringer, Lawrence Babb, Fowzan S. Alkuraya, Joanna Amberger, Mutaz Amin, Christina Austin-Tse, Marie Balzotti, Jonathan S. Berg, Ewan Birney, Carol Bocchini, Elspeth A. Bruford, Alison J. Coffey, Heather Collins, Fiona Cunningham, Louise C. Daugherty, Yaron Einhorn, Helen Firth, David R. Fitzpatrick, Rebecca E. Foulger, Jennifer Goldstein, Ada Hamosh, Matthew R. Hurles, Sarah E. Leigh, Ivone U. S. Leong, Sateesh Maddirevula, Christa L. Martin, Ellen M. McDonagh, Annie Olry, Arina Puzriakova, Kelly Radtke, Erin M. Ramos, Ana Rath, Erin Rooney Riggs, Angharad M. Roberts, Charlotte Rodwell, Catherine Snow, Zornitza Stark, Jackie Tahiliani, Susan Tweedie, James S. Ware, Phillip Weller, Eleanor Williams, Caroline F. Wright, Thabo Michael Yates, Heidi L. Rehm

Summary: This study addresses the lack of universal standards and terminologies in defining gene-disease relationships. The Gene Curation Coalition (GenCC) was formed to establish harmonized definitions and develop a unified database. The results show that conflicts in gene-disease validity assertions exist, highlighting the importance of standardization and collaboration in genetic testing and variant interpretation.

GENETICS IN MEDICINE (2022)

Article Multidisciplinary Sciences

A joint NCBI and EMBL-EBI transcript set for clinical genomics and research

Joannella Morales, Shashikant Pujar, Jane E. Loveland, Alex Astashyn, Ruth Bennett, Andrew Berry, Eric Cox, Claire Davidson, Olga Ermolaeva, Catherine M. Farrell, Reham Fatima, Laurent Gil, Tamara Goldfarb, Jose M. Gonzalez, Diana Haddad, Matthew Hardy, Toby Hunt, John Jackson, Vinita S. Joardar, Michael Kay, Vamsi K. Kodali, Kelly M. McGarvey, Aoife McMahon, Jonathan M. Mudge, Daniel N. Murphy, Michael R. Murphy, Bhanu Rajput, Sanjida H. Rangwala, Lillian D. Riddick, Francoise Thibaud-Nissen, Glen Threadgold, Anjana R. Vatsan, Craig Wallin, David Webb, Paul Flicek, Ewan Birney, Kim D. Pruitt, Adam Frankish, Fiona Cunningham, Terence D. Murphy

Summary: Comprehensive genome annotation is crucial for understanding clinically relevant variants, but the lack of standardized reporting and browser display complicates interpretation and reporting. To address this, Ensembl/GENCODE and RefSeq launched the MATCHED Annotation from NCBI and EMBL-EBI (MANE) collaboration to define universal standards for variant reporting and browser display. The MANE transcript sets provide representative transcripts for each human protein-coding gene, improving consistency and facilitating clinical interpretation.

NATURE (2022)

Article Biochemistry & Molecular Biology

Nanopore ReCappable sequencing maps SARS-CoV-2 5 ' capping sites and provides new insights into the structure of sgRNAs

Camilla Ugolini, Logan Mulroney, Adrien Leger, Matteo Castelli, Elena Criscuolo, Maia Kavanagh Williamson, Andrew D. Davidson, Abdulaziz Almuqrin, Roberto Giambruno, Miten Jain, Gianmaria Frige, Hugh Olsen, George Tzertzinis, Ira Schildkraut, Madalee G. Wulf, Ivan R. Correa, Laurence Ettwiller, Nicola Clementi, Massimo Clementi, Nicasio Mancini, Ewan Birney, Mark Akeson, Francesco Nicassio, David A. Matthews, Tommaso Leonardi

Summary: The researchers used a new technique called NRCeq to identify the complete subgenomic RNAs of the SARS-CoV-2 virus and annotate the capping sites in the viral genome. They successfully obtained robust estimates of subgenomic RNA expression in cell lines and viral isolates, and discovered novel subgenomic RNA variants. These findings are of great importance to the scientific community.

NUCLEIC ACIDS RESEARCH (2022)

Article Multidisciplinary Sciences

Selective clonal persistence of human retroviruses in vivo: Radial chromatin organization, integration site, and host transcription

Anat Melamed, Tomas W. Fitzgerald, Yuchuan Wang, Jian Ma, Ewan Birney, Charles R. M. Bangham

Summary: Human retroviruses HTLV-1 and HIV-1 persist in the body as a reservoir of latently infected T cell clones. The study found that the position of the provirus in the nucleus, its distance from the centromere, and the intensity of local host genome transcription are important factors determining clonal survival. Similar factors were also found to explain clonal persistence of HIV-1. This research highlights the importance of the intranuclear and intrachromosomal location of the provirus and host transcription intensity in the persistence of human retroviruses.

SCIENCE ADVANCES (2022)

Correction Multidisciplinary Sciences

Genomic reconstruction of the SARS CoV-2 epidemic in England (vol 600, pg 506, 2021)

Harald S. Vohringer, Theo Sanderson, Matthew Sinnott, Nicola De Maio, Thuy Nguyen, Richard Goater, Frank Schwach, Ian Harrison, Joel Hellewell, Cristina V. Ariani, Sonia Goncalves, David K. Jackson, Ian Johnston, Alexander W. Jung, Callum Saint, John Sillitoe, Maria Suciu, Nick Goldman

NATURE (2022)

Article Multidisciplinary Sciences

The contribution of common regulatory and protein-coding TYR variants to the genetic architecture of albinism

Vincent Michaud, Eulalie Lasseaux, David J. Green, Dave T. Gerrard, Claudio Plaisant, Tomas Fitzgerald, Ewan Birney, Benoit Arveiler, Graeme C. Black, Panagiotis Sergouniotis

Summary: By studying a large cohort of individuals with albinism, researchers identified common and rare gene variants associated with the disorder, indicating a complex genetic architecture.

NATURE COMMUNICATIONS (2022)

Article Biotechnology & Applied Microbiology

Dynamic, adaptive sampling during nanopore sequencing using Bayesian experimental design

Lukas Weilguny, Nicola De Maio, Rory Munro, Charlotte Manser, Ewan Birney, Matthew Loose, Nick Goldman

Summary: BOSS-RUNS is an algorithmic framework and software that dynamically updates decision strategies based on real-time updates of uncertainty at each genome position. It optimizes information gain by deciding whether to fully sequence each DNA fragment, leading to improved variant calling in microbial communities.

NATURE BIOTECHNOLOGY (2023)

Article Multidisciplinary Sciences

Using machine learning to model older adult inpatient trajectories from electronic health records data

Maria Herrero-Zazo, Tomas Fitzgerald, Vince Taylor, Helen Street, Afzal N. Chaudhry, John R. Bradley, Ewan Birney, Victoria L. Keevil

Summary: Electronic Health Records (EHR) data can provide valuable insights into inpatient trajectories. By representing blood tests and vital signs as multivariate time-series (MVTS), unsupervised Hidden Markov Models (HMM) can be trained to classify each day of hospital admission as one of 17 states. Clinical interpretation of these HMM states revealed their associations with inpatient mortality and specific diagnoses. Machine learning models trained with MVTS data showed promising performance in predicting inpatient mortality, indicating the potential for developing decision-support tools for EHR systems.

ISCIENCE (2023)

Article Genetics & Heredity

Sub-cellular level resolution of common genetic variation in the photoreceptor layer identifies continuum between rare disease and common variation

Hannah Currant, Tomas W. Fitzgerald, Praveen J. Patel, Anthony P. Khawaja, Andrew R. Webster, Omar A. Mahroo, Ewan Birney

Summary: We conducted the largest genome-wide association study of photoreceptor cell (PRC) morphology to date using optical coherence tomography (OCT). We identified 111 loci associated with PRC thickness, many of which had prior associations to ocular phenotypes and pathologies. We also discovered 10 genes associated with PRC thickness through gene burden testing using exome data. These findings provide evidence for a relationship between common and rare genetic variation in retinal biology.

PLOS GENETICS (2023)

Article Biochemistry & Molecular Biology

AlphaFold Protein Structure Database in 2024: providing structure coverage for over 214 million protein sequences

Mihaly Varadi, Damian Bertoni, Paulyna Magana, Urmila Paramval, Ivanna Pidruchna, Malarvizhi Radhakrishnan, Maxim Tsenkov, Sreenath Nair, Milot Mirdita, Jingi Yeo, Oleg Kovalevskiy, Kathryn Tunyasuvunakool, Agata Laydon, Augustin Zidek, Hamish Tomlinson, Dhavanthi Hariharan, Josh Abrahamson, Tim Green, John Jumper, Ewan Birney, Martin Steinegger, Demis Hassabis, Sameer Velankar

Summary: The AlphaFold Database Protein Structure Database (AlphaFold DB) has expanded significantly since its initial release in 2021, now containing over 214 million predicted protein structures. Powered by the AlphaFold2 artificial intelligence (AI) system, the database has integrated its predictions into primary data resources such as PDB, UniProt, Ensembl, InterPro, and MobiDB. This manuscript details the enhancements made to data archiving, including the addition of model organisms, global health proteomes, Swiss-Prot integration, and curated protein datasets. The access mechanisms of AlphaFold DB, from direct file access to advanced queries using Google Cloud Public Datasets, are also discussed, along with improvements and added services since its release, such as enhancements to the Predicted Aligned Error viewer and the 3D viewer customization options.

NUCLEIC ACIDS RESEARCH (2023)

Article Medical Informatics

COVID-19 trajectories among 57 million adults in England: a cohort study using electronic health records

Johan H. Thygesen, Christopher Tomlinson, Sam Hollings, Mehrdad A. Mizani, Alex Handy, Ashley Akbari, Amitava Banerjee, Jennifer Cooper, Alvina G. Lai, Kezhi Li, Bilal A. Mateen, Naveed Sattar, Reecha Sofat, Ana Torralbo, Honghan Wu, Angela Wood, Jonathan A. C. Sterne, Christina Pagel, William N. Whiteley, Cathie Sudlow, Harry Hemingway, Spiros Denaxas

Summary: This study used nationwide linked electronic health records to define and validate ten COVID-19 phenotypes, providing insights into the different stages and transitions of the disease. The results showed infection rates, hospitalization rates, intensive care unit usage, and mortality rates of COVID-19. Longer patient trajectories were observed in the second wave compared to the first wave.

LANCET DIGITAL HEALTH (2022)

Article Biotechnology & Applied Microbiology

Genomic variations and epigenomic landscape of the Medaka Inbred Kiyosu-Karlsruhe (MIKK) panel

Adrien Leger, Ian Brettell, Jack Monahan, Carl Barton, Nadeshda Wolf, Natalja Kusminski, Cathrin Herder, Narendar Aadepu, Clara Becker, Jakob Gierten, Omar T. Hammouda, Eva Hasel, Colin Lischik, Katharina Lust, Natalia Sokolova, Risa Suzuki, Tinatini Tavhelidse, Thomas Thumberger, Erika Tsingos, Philip Watson, Bettina Welz, Kiyoshi Naruse, Felix Loosli, Joachim Wittbrodt, Ewan Birney, Tomas Fitzgerald

Summary: In this study, the researchers used long read data from Oxford Nanopore Technologies to analyze the genomes of medaka, creating a specific pan-genome reference dataset for the Medaka Inbred Kiyosu-Karlsruhe panel. This dataset allows for the investigation of novel variation types that would be difficult to detect using standard approaches.

GENOME BIOLOGY (2022)

No Data Available