Guy Cochrane

ORCID: 0000-0001-7954-7057
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Genomics and Phylogenetic Studies
  • Microbial Community Ecology and Physiology
  • Environmental DNA in Biodiversity Studies
  • RNA and protein synthesis mechanisms
  • Research Data Management Practices
  • Scientific Computing and Data Management
  • Bacteriophages and microbial interactions
  • Species Distribution and Climate Change
  • Genetics, Bioinformatics, and Biomedical Research
  • Gene expression and cancer classification
  • Protist diversity and phylogeny
  • Biomedical Text Mining and Ontologies
  • RNA modifications and cancer
  • Cancer Genomics and Diagnostics
  • CRISPR and Genetic Engineering
  • Molecular Biology Techniques and Applications
  • Rangeland and Wildlife Management
  • Coral and Marine Ecosystems Studies
  • Bioinformatics and Genomic Networks
  • Marine and fisheries research
  • Gut microbiota and health
  • Invertebrate Taxonomy and Ecology
  • SARS-CoV-2 and COVID-19 Research
  • Cancer-related molecular mechanisms research
  • Algorithms and Data Compression

European Bioinformatics Institute
2016-2025

Wellcome Trust
2009-2023

Bulgarian Academy of Sciences
2023

Institute of Biodiversity and Ecosystem Research
2023

Pensoft Publishers (Bulgaria)
2023

University of Tartu Natural History Museum and Botanical Garden
2023

Centre for Genomic Regulation
2022

SIB Swiss Institute of Bioinformatics
2022

University of Lausanne
2022

University of Newcastle Australia
2018

We present two standards developed by the Genomic Standards Consortium (GSC) for reporting bacterial and archaeal genome sequences. Both are extensions of Minimum Information about Any (x) Sequence (MIxS). The a Single Amplified Genome (MISAG) Metagenome-Assembled (MIMAG), including, but not limited to, assembly quality, estimates completeness contamination. These can be used in combination with other GSC checklists, including (MIGS), Metagenomic (MIMS), Marker Gene (MIMARKS). Community-wide...

10.1038/nbt.3893 article EN cc-by Nature Biotechnology 2017-08-01

Reconstruction of target genomes from sequence data produced by instruments that are agnostic as to the species-of-origin may be confounded contaminant DNA. Whether introduced during sample processing or through co-extraction alongside DNA, if insufficient care is taken assembly process, final assembled genome a mixture several species. Such assemblies can confound sequence-based biological inference and, when deposited in public databases, included downstream analyses users unaware...

10.1534/g3.119.400908 article EN cc-by G3 Genes Genomes Genetics 2020-02-19

The members of the International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org) set out to capture, preserve and present globally comprehensive public domain nucleotide sequence information. work long-standing collaboration includes provision data formats, annotation conventions routine global exchange. Among many developments INSDC resources in 2011 are newly launched BioProject database improved handling assembly In this article, we outline services update reader on 2011.

10.1093/nar/gkr1006 article EN cc-by-nc Nucleic Acids Research 2011-11-12
Pelin Yilmaz Renzo Kottmann Dawn Field Rob Knight James R. Cole and 93 more Linda Amaral‐Zettler Jack A. Gilbert Ilene Karsch‐Mizrachi Anjanette Johnston Guy Cochrane Robert Vaughan Chris Hunter Joonhong Park Norman Morrison Philippe Rocca‐Serra Peter Sterk Manimozhiyan Arumugam Mark Bailey Laura K. Baumgartner Bruce W. Birren Martin J. Blaser Vivien Bonazzi Tim Booth Peer Bork Frederic D. Bushman Pier Luigi Buttigieg Patrick Chain Emily S. Charlson Elizabeth K. Costello Heather Huot-Creasy Peter Dawyndt Todd Z. DeSantis Noah Fierer Jed A. Fuhrman Rachel E. Gallery Dirk Gevers Richard A. Gibbs Inigo San Gil Antonio González Jeffrey I. Gordon Robert M. Guralnick Wolfgang Hankeln Sarah K. Highlander Philip Hugenholtz Janet Jansson Andrew L. Kau Scott T. Kelley Jerry Kennedy Dan Knights Omry Koren Justin Kuczynski Nikos C. Kyrpides Robert D. Larsen Christian L. Lauber Teresa Legg Ruth E. Ley Catherine Lozupone Wolfgang Ludwig Donna Lyons Eamonn Maguire Barbara A. Methé Folker Meyer Brian D. Muegge Sara Nakielny William Nelson Diana R. Nemergut Josh D. Neufeld Lindsay K. Newbold Anna Oliver Norman R. Pace Giri Prakash Jörg Peplies Joseph F. Petrosino Lita M. Proctor Elmar Pruesse Christian Quast Jeroen Raes Sujeevan Ratnasingham Jacques Ravel David A. Relman Susanna‐Assunta Sansone Patrick D. Schloss Lynn M. Schriml Rohini Sinha Michelle I. Smith Erica Sodergren Aymé Spor Jesse Stombaugh James M. Tiedje Doyle V. Ward George M. Weinstock Doug Wendel Owen White Andrew S. Whiteley Andreas Wilke Jennifer R. Wortman Tanya Yatsunenko Frank Oliver Glöckner

10.1038/nbt.1823 article EN Nature Biotechnology 2011-05-01

The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena ) is Europe's primary nucleotide-sequence repository. ENA consists of three main databases: the Sequence Read (SRA), Trace and EMBL-Bank. objective to support promote use nucleotide sequencing as an experimental research platform by providing data submission, archive, search download services. In this article, we outline these services describe major changes improvements introduced during 2010. These include extended EMBL-Bank...

10.1093/nar/gkq967 article EN Nucleic Acids Research 2010-10-23

The methodologies used to generate genome and metagenome annotations are diverse vary between groups laboratories. Descriptions of the annotation process helpful in interpreting data. Some have produced Standard Operating Procedures (SOPs) that describe process, but standards lacking for structure content these descriptions. In addition, there is no central repository store disseminate procedures protocols annotation. We highlight importance SOPs endorse an online SOPs.

10.1089/omi.2008.0017 article EN OMICS A Journal of Integrative Biology 2008-04-16

This paper presents standards and best practices for reporting genome sequences of uncultivated viruses. We present an extension the Minimum Information about any (x) Sequence (MIxS) standard virus genomes. Uncultivated Virus Genome (MIUViG) were developed within Genomic Standards Consortium framework include origin, quality, annotation, taxonomic classification, biogeographic distribution in silico host prediction. Community-wide adoption MIUViG standards, which complement a Single...

10.1038/nbt.4306 article EN cc-by Nature Biotechnology 2018-12-17

MGnify (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the assembly, analysis and archiving of microbiome data derived from sequencing microbial populations that are present in particular environments. Over past 2 years, (formerly EBI Metagenomics) has more than doubled number publicly available analysed datasets held within resource. Recently, an updated approach been unveiled (version 5.0), replacing previous single pipeline with multiple pipelines tailored...

10.1093/nar/gkz1035 article EN cc-by Nucleic Acids Research 2019-10-23

Transects of the submersible Alvin across rock outcrops in Oregon subduction zone have furnished information on structural and stratigraphic framework this accretionary complex. Communities clams tube worms, authigenic carbonate mineral precipitates, are associated with venting sites cool fluids located a fault-bend anticline at water depth 2036 meters. The distribution animals carbonates suggests up-dip migration from both shallow deep sources along permeable strata or fault zones within...

10.1126/science.231.4738.561 article EN Science 1986-02-07

Data storage costs have become an appreciable proportion of total cost in the creation and analysis DNA sequence data. Of particular concern is that rate increase sequencing significantly outstripping disk capacity. In this paper we present a new reference-based compression method efficiently compresses sequences for storage. Our approach works resequencing experiments target well-studied genomes. We align to reference genome then encode differences between most efficient when allow...

10.1101/gr.114819.110 article EN cc-by-nc Genome Research 2011-01-18

The ocean is home to myriad small planktonic organisms that underpin the functioning of marine ecosystems. However, their spatial patterns diversity and underlying drivers remain poorly known, precluding projections responses global changes. Here we investigate latitudinal gradients predictors plankton across archaea, bacteria, eukaryotes, major virus clades using both molecular imaging data from Tara Oceans. We show a decline for most groups toward poles, mainly driven by decreasing...

10.1016/j.cell.2019.10.008 article EN cc-by-nc-nd Cell 2019-11-01

Ocean microbial communities strongly influence the biogeochemistry, food webs, and climate of our planet. Despite recent advances in understanding their taxonomic genomic compositions, little is known about how transcriptomes vary globally. Here, we present a dataset 187 metatranscriptomes 370 metagenomes from 126 globally distributed sampling stations establish resource 47 million genes to study community-level across depth layers pole-to-pole. We examine gene expression changes community...

10.1016/j.cell.2019.10.014 article EN cc-by-nc-nd Cell 2019-11-01

RNAcentral is a comprehensive database of non-coding RNA (ncRNA) sequences, collating information on ncRNA sequences all types from broad range organisms. We have recently added new genome mapping pipeline that identifies genomic locations for in 296 species. also several functional annotations, such as tRNA secondary structures, Gene Ontology and miRNA-target interactions. A quality control mechanism based Rfam family assignments potential contamination, incomplete more. The has become...

10.1093/nar/gky1034 article EN cc-by Nucleic Acids Research 2018-10-16

A vast and rich body of information has grown up as a result the world's enthusiasm for 'omics technologies. Finding ways to describe make available this that maximise its usefulness become major effort across world. At heart is Genomic Standards Consortium (GSC), an open-membership organization drives community-based standardization activities, Here we provide short history GSC, overview range current call scientific community join forces improve quality quantity contextual about our public...

10.1371/journal.pbio.1001088 article EN cc-by PLoS Biology 2011-06-21

RNAcentral is a database of non-coding RNA (ncRNA) sequences that aggregates data from specialised ncRNA resources and provides single entry point for accessing all types organisms. Since its launch in 2014, has integrated twelve new resources, taking the total number collaborating to 22, began importing data, such as modified nucleotides MODOMICS PDB. We created species-specific identifiers refer unique within context species. The website been subject continuous improvements focusing on...

10.1093/nar/gkw1008 article EN cc-by Nucleic Acids Research 2016-10-18

Abstract The International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org/) has been the core infrastructure for collecting and providing nucleotide sequence data metadata >30 years. Three partner organizations, DNA Data Bank of Japan (DDBJ) at National Institute Genetics in Mishima, Japan; European Archive (ENA) Molecular Biology Laboratory's Bioinformatics (EMBL-EBI) Hinxton, UK; GenBank Center Biotechnology Information (NCBI), Library Medicine, Institutes...

10.1093/nar/gkaa967 article EN cc-by Nucleic Acids Research 2020-10-09

EBI metagenomics (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the analysis and archiving of sequence data derived from microbial populations found in particular environment. Over past two years, has increased number datasets analysed 10-fold. In addition throughput, underlying pipeline been overhauled include both new or updated tools reference databases. Of note is workflow taxonomic assignments that extended based on large small subunit RNA marker genes encompass...

10.1093/nar/gkx967 article EN cc-by Nucleic Acids Research 2017-10-12

Abstract The MGnify platform (https://www.ebi.ac.uk/metagenomics) facilitates the assembly, analysis and archiving of microbiome-derived nucleic acid sequences. provides access to taxonomic assignments functional annotations for nearly half a million analyses covering metabarcoding, metatranscriptomic, metagenomic datasets, which are derived from wide range different environments. Over past 3 years, has not only grown in terms number datasets contained but also increased breadth provided,...

10.1093/nar/gkac1080 article EN cc-by Nucleic Acids Research 2022-12-07

Whereas DNA viruses are known to be abundant, diverse, and commonly key ecosystem players, RNA insufficiently studied outside disease settings. In this study, we analyzed ≈28 terabases of Global Ocean sequences expand Earth's virus catalogs their taxonomy, investigate evolutionary origins, assess marine biogeography from pole pole. Using new approaches optimize discovery classification, identified that necessitate substantive revisions taxonomy (doubling phyla adding >50% classes)...

10.1126/science.abm5847 article EN Science 2022-04-07

The International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org) comprises three global partners committed to capturing, preserving and providing comprehensive public-domain nucleotide sequence information. INSDC establishes standards, formats protocols for data metadata make it easier individuals organisations submit their reliably public archives. This work enables the continuous, exchange of information about living things. Here we present an update in 2015,...

10.1093/nar/gkv1323 article EN cc-by Nucleic Acids Research 2015-12-10

For more than 30 years, the International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org/) has been committed to capturing, preserving and providing access comprehensive public domain nucleotide sequence associated metadata which enables discovery in biomedicine, biodiversity biological sciences. Since 1987, DNA Data Bank of Japan (DDBJ) at National Institute for Genetics Mishima, Japan; European Archive (ENA) Molecular Biology Laboratory's Bioinformatics (EMBL-EBI)...

10.1093/nar/gkx1097 article EN cc-by-nc Nucleic Acids Research 2017-10-26
Adriana Alberti Julie Poulain Stéfan Engelen Karine Labadie Sarah Romac and 95 more Isabel Ferrera Guillaume Albini Jean‐Marc Aury Caroline Belser Alexis Bertrand Corinne Cruaud Corinne Da Silva Carole Dossat Frédérick Gavory Shahinaz Gas Julie Guy Maud Haquelle E'krame Jacoby Olivier Jaillon Arnaud Lemainque Éric Pelletier Gaëlle Samson Mark Wessner Pascal Bazire Odette Beluche Laurie Bertrand Marielle Besnard‐Gonnet Isabelle Bordelais Magali Boutard Maria Dubois Corinne Dumont Evelyne Ettedgui Patricia Carina Fernández E.S. Garcia Nathalie Aiach Thomas Guérin Chadia Hamon Élodie Brun Sandrine Lebled Patricia Lenoble Claudine Louesse Eric Mahieu Barbara Mairey Nathalie Martins Catherine Megret Claire Milani Jacqueline Muanga Céline Orvain Emilie Payen Peggy Perroud Emmanuelle Petit Dominique Robert Murielle Ronsin Benoît Vacherie Silvia G. Acinas Marta Royo‐Llonch Francisco M. Cornejo‐Castillo Ramiro Logares Beatriz Fernández-Gómez Chris Bowler Guy Cochrane Clara Amid Petra ten Hoopen Colomban de Vargas Nigel Grimsley Élodie Desgranges Stefanie Kandels‐Lewis Hiroyuki Ogata Nicole Poulton Michael E. Sieracki Ramūnas Stepanauskas Matthew B. Sullivan Jennifer R. Brum Melissa B. Duhaime Bonnie T. Poulos Bonnie L. Hurwitz Silvia G. Acinas Peer Bork Emmanuel Boss Chris Bowler Colomban De Vargas Michael Follows Gabriel Gorsky Nigel Grimsley Pascal Hingamp Daniele Iudicone Olivier Jaillon Stefanie Kandels‐Lewis Lee Karp-Boss Eric Karsenti Fabrice Not Hiroyuki Ogata Stéphane Pesant Jeroen Raes Christian Sardet Michael E. Sieracki Sabrina Speich Lars Stemmann Matthew B. Sullivan Shinichi Sunagawa

Abstract A unique collection of oceanic samples was gathered by the Tara Oceans expeditions (2009–2013), targeting plankton organisms ranging from viruses to metazoans, and providing rich environmental context measurements. Thanks recent advances in field genomics, extensive sequencing has been performed for a deep genomic analysis this huge samples. strategy based on different approaches, such as metabarcoding, metagenomics, single-cell genomics metatranscriptomics, chosen size-fractionated...

10.1038/sdata.2017.93 article EN cc-by Scientific Data 2017-08-01
Coming Soon ...