NFDI4DS | UHH-SEMS - Publication Details

Guy Cochrane

ORCID: 0000-0001-7954-7057

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5072348463

Research Areas

Genomics and Phylogenetic Studies
Microbial Community Ecology and Physiology
Environmental DNA in Biodiversity Studies
RNA and protein synthesis mechanisms
Research Data Management Practices
Scientific Computing and Data Management
Bacteriophages and microbial interactions
Species Distribution and Climate Change
Genetics, Bioinformatics, and Biomedical Research
Gene expression and cancer classification
Protist diversity and phylogeny
Biomedical Text Mining and Ontologies
RNA modifications and cancer
Cancer Genomics and Diagnostics
CRISPR and Genetic Engineering
Molecular Biology Techniques and Applications
Rangeland and Wildlife Management
Coral and Marine Ecosystems Studies
Bioinformatics and Genomic Networks
Marine and fisheries research
Gut microbiota and health
Invertebrate Taxonomy and Ecology
SARS-CoV-2 and COVID-19 Research
Cancer-related molecular mechanisms research
Algorithms and Data Compression

European Bioinformatics Institute
2016-2025

Wellcome Trust
2009-2023

Bulgarian Academy of Sciences
2023

Institute of Biodiversity and Ecosystem Research
2023

Pensoft Publishers (Bulgaria)
2023

University of Tartu Natural History Museum and Botanical Garden
2023

Centre for Genomic Regulation
2022

SIB Swiss Institute of Bioinformatics
2022

University of Lausanne
2022

University of Newcastle Australia
2018

Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea

OPENALEX - Publications

Robert M. Bowers Nikos C. Kyrpides Ramūnas Stepanauskas Miranda Harmon-Smith Devin F. R. Doud and 49 more

We present two standards developed by the Genomic Standards Consortium (GSC) for reporting bacterial and archaeal genome sequences. Both are extensions of Minimum Information about Any (x) Sequence (MIxS). The a Single Amplified Genome (MISAG) Metagenome-Assembled (MIMAG), including, but not limited to, assembly quality, estimates completeness contamination. These can be used in combination with other GSC checklists, including (MIGS), Metagenomic (MIMS), Marker Gene (MIMARKS). Community-wide...

10.1038/nbt.3893 article EN cc-by Nature Biotechnology 2017-08-01

BlobToolKit – Interactive Quality Assessment of Genome Assemblies

OPENALEX - Publications

Richard Challis E. G. Richards Jeena Rajan Guy Cochrane Mark Blaxter

Reconstruction of target genomes from sequence data produced by instruments that are agnostic as to the species-of-origin may be confounded contaminant DNA. Whether introduced during sample processing or through co-extraction alongside DNA, if insufficient care is taken assembly process, final assembled genome a mixture several species. Such assemblies can confound sequence-based biological inference and, when deposited in public databases, included downstream analyses users unaware...

10.1534/g3.119.400908 article EN cc-by G3 Genes Genomes Genetics 2020-02-19

The minimum information about a genome sequence (MIGS) specification

OPENALEX - Publications

Dawn Field George M Garrity Tanya Gray Norman Morrison Jeremy Selengut and 67 more

10.1038/nbt1360 article EN Nature Biotechnology 2008-05-01

Marine DNA Viral Macro- and Microdiversity from Pole to Pole

OPENALEX - Publications

Ann Gregory Ahmed A. Zayed Nádia Conceição‐Neto Ben Temperton Benjamin Bolduc and 56 more

10.1016/j.cell.2019.03.040 article EN publisher-specific-oa Cell 2019-04-25

The International Nucleotide Sequence Database Collaboration

OPENALEX - Publications

Ilene Karsch‐Mizrachi Yasukazu Nakamura Guy Cochrane

The members of the International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org) set out to capture, preserve and present globally comprehensive public domain nucleotide sequence information. work long-standing collaboration includes provision data formats, annotation conventions routine global exchange. Among many developments INSDC resources in 2011 are newly launched BioProject database improved handling assembly In this article, we outline services update reader on 2011.

10.1093/nar/gkr1006 article EN cc-by-nc Nucleic Acids Research 2011-11-12

Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications

OPENALEX - Publications

Pelin Yilmaz Renzo Kottmann Dawn Field Rob Knight James R. Cole and 93 more

10.1038/nbt.1823 article EN Nature Biotechnology 2011-05-01

The European Nucleotide Archive

OPENALEX - Publications

Rasko Leinonen R.A. Akhtar Ewan Birney L. Bower Ana Cerdeño-Tárraga and 16 more

The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena ) is Europe's primary nucleotide-sequence repository. ENA consists of three main databases: the Sequence Read (SRA), Trace and EMBL-Bank. objective to support promote use nucleotide sequencing as an experimental research platform by providing data submission, archive, search download services. In this article, we outline these services describe major changes improvements introduced during 2010. These include extended EMBL-Bank...

10.1093/nar/gkq967 article EN Nucleic Acids Research 2010-10-23

Toward an Online Repository of Standard Operating Procedures (SOPs) for (Meta)genomic Annotation

OPENALEX - Publications

Samuel V. Angiuoli Aaron Gussman William Klimke Guy Cochrane Dawn Field and 8 more

The methodologies used to generate genome and metagenome annotations are diverse vary between groups laboratories. Descriptions of the annotation process helpful in interpreting data. Some have produced Standard Operating Procedures (SOPs) that describe process, but standards lacking for structure content these descriptions. In addition, there is no central repository store disseminate procedures protocols annotation. We highlight importance SOPs endorse an online SOPs.

10.1089/omi.2008.0017 article EN OMICS A Journal of Integrative Biology 2008-04-16

Minimum Information about an Uncultivated Virus Genome (MIUViG)

OPENALEX - Publications

Simon Roux Evelien M. Adriaenssens Bas E. Dutilh Eugene V. Koonin Andrew M. Kropinski and 56 more

This paper presents standards and best practices for reporting genome sequences of uncultivated viruses. We present an extension the Minimum Information about any (x) Sequence (MIxS) standard virus genomes. Uncultivated Virus Genome (MIUViG) were developed within Genomic Standards Consortium framework include origin, quality, annotation, taxonomic classification, biogeographic distribution in silico host prediction. Community-wide adoption MIUViG standards, which complement a Single...

10.1038/nbt.4306 article EN cc-by Nature Biotechnology 2018-12-17

MGnify: the microbiome analysis resource in 2020

OPENALEX - Publications

Alex Mitchell Alexandre Almeida Martín Beracochea Miguel Boland Josephine Burgin and 12 more

MGnify (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the assembly, analysis and archiving of microbiome data derived from sequencing microbial populations that are present in particular environments. Over past 2 years, (formerly EBI Metagenomics) has more than doubled number publicly available analysed datasets held within resource. Recently, an updated approach been unveiled (version 5.0), replacing previous single pipeline with multiple pipelines tailored...

10.1093/nar/gkz1035 article EN cc-by Nucleic Acids Research 2019-10-23

Oregon Subduction Zone: Venting, Fauna, and Carbonates

OPENALEX - Publications

L. D. Kulm Erwin Suess J. Casey Moore Bobb Carson Brian T. R. Lewis and 9 more

Transects of the submersible Alvin across rock outcrops in Oregon subduction zone have furnished information on structural and stratigraphic framework this accretionary complex. Communities clams tube worms, authigenic carbonate mineral precipitates, are associated with venting sites cool fluids located a fault-bend anticline at water depth 2036 meters. The distribution animals carbonates suggests up-dip migration from both shallow deep sources along permeable strata or fault zones within...

10.1126/science.231.4738.561 article EN Science 1986-02-07

Efficient storage of high throughput DNA sequencing data using reference-based compression

OPENALEX - Publications

Markus Hsi-Yang Fritz Rasko Leinonen Guy Cochrane Ewan Birney

Data storage costs have become an appreciable proportion of total cost in the creation and analysis DNA sequence data. Of particular concern is that rate increase sequencing significantly outstripping disk capacity. In this paper we present a new reference-based compression method efficiently compresses sequences for storage. Our approach works resequencing experiments target well-studied genomes. We align to reference genome then encode differences between most efficient when allow...

10.1101/gr.114819.110 article EN cc-by-nc Genome Research 2011-01-18

Global Trends in Marine Plankton Diversity across Kingdoms of Life

OPENALEX - Publications

Federico M. Ibarbalz Nicolas Henry Manoela C. Brandão Séverine Martini Greta Busseni and 69 more

The ocean is home to myriad small planktonic organisms that underpin the functioning of marine ecosystems. However, their spatial patterns diversity and underlying drivers remain poorly known, precluding projections responses global changes. Here we investigate latitudinal gradients predictors plankton across archaea, bacteria, eukaryotes, major virus clades using both molecular imaging data from Tara Oceans. We show a decline for most groups toward poles, mainly driven by decreasing...

10.1016/j.cell.2019.10.008 article EN cc-by-nc-nd Cell 2019-11-01

Gene Expression Changes and Community Turnover Differentially Shape the Global Ocean Metatranscriptome

OPENALEX - Publications

Guillem Salazar Lucas Paoli Adriana Alberti Jaime Huerta‐Cepas Hans‐Joachim Ruscheweyh and 66 more

Ocean microbial communities strongly influence the biogeochemistry, food webs, and climate of our planet. Despite recent advances in understanding their taxonomic genomic compositions, little is known about how transcriptomes vary globally. Here, we present a dataset 187 metatranscriptomes 370 metagenomes from 126 globally distributed sampling stations establish resource 47 million genes to study community-level across depth layers pole-to-pole. We examine gene expression changes community...

10.1016/j.cell.2019.10.014 article EN cc-by-nc-nd Cell 2019-11-01

RNAcentral: a hub of information for non-coding RNA sequences

OPENALEX - Publications

Blake Sweeney Anton I. Petrov Boris Burkov ROBERT FINN Alex Bateman and 56 more

RNAcentral is a comprehensive database of non-coding RNA (ncRNA) sequences, collating information on ncRNA sequences all types from broad range organisms. We have recently added new genome mapping pipeline that identifies genomic locations for in 296 species. also several functional annotations, such as tRNA secondary structures, Gene Ontology and miRNA-target interactions. A quality control mechanism based Rfam family assignments potential contamination, incomplete more. The has become...

10.1093/nar/gky1034 article EN cc-by Nucleic Acids Research 2018-10-16

The Genomic Standards Consortium

OPENALEX - Publications

Dawn Field Linda Amaral-Zettler Guy Cochrane James R. Cole Peter Dawyndt and 18 more

A vast and rich body of information has grown up as a result the world's enthusiasm for 'omics technologies. Finding ways to describe make available this that maximise its usefulness become major effort across world. At heart is Genomic Standards Consortium (GSC), an open-membership organization drives community-based standardization activities, Here we provide short history GSC, overview range current call scientific community join forces improve quality quantity contextual about our public...

10.1371/journal.pbio.1001088 article EN cc-by PLoS Biology 2011-06-21

RNAcentral: a comprehensive database of non-coding RNA sequences

OPENALEX - Publications

Anton I. Petrov Simon Kay Ioanna Kalvari Kevin Howe Kristian Gray and 47 more

RNAcentral is a database of non-coding RNA (ncRNA) sequences that aggregates data from specialised ncRNA resources and provides single entry point for accessing all types organisms. Since its launch in 2014, has integrated twelve new resources, taking the total number collaborating to 22, began importing data, such as modified nucleotides MODOMICS PDB. We created species-specific identifiers refer unique within context species. The website been subject continuous improvements focusing on...

10.1093/nar/gkw1008 article EN cc-by Nucleic Acids Research 2016-10-18

The international nucleotide sequence database collaboration

OPENALEX - Publications

Masanori Arita Ilene Karsch‐Mizrachi Guy Cochrane

Abstract The International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org/) has been the core infrastructure for collecting and providing nucleotide sequence data metadata &gt;30 years. Three partner organizations, DNA Data Bank of Japan (DDBJ) at National Institute Genetics in Mishima, Japan; European Archive (ENA) Molecular Biology Laboratory's Bioinformatics (EMBL-EBI) Hinxton, UK; GenBank Center Biotechnology Information (NCBI), Library Medicine, Institutes...

10.1093/nar/gkaa967 article EN cc-by Nucleic Acids Research 2020-10-09

EBI Metagenomics in 2017: enriching the analysis of microbial communities, from sequence reads to assemblies

OPENALEX - Publications

Alex Mitchell Maxim Scheremetjew Hubert Denise Simon Potter Aleksandra Tarkowska and 12 more

EBI metagenomics (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the analysis and archiving of sequence data derived from microbial populations found in particular environment. Over past two years, has increased number datasets analysed 10-fold. In addition throughput, underlying pipeline been overhauled include both new or updated tools reference databases. Of note is workflow taxonomic assignments that extended based on large small subunit RNA marker genes encompass...

10.1093/nar/gkx967 article EN cc-by Nucleic Acids Research 2017-10-12

MGnify: the microbiome sequence data analysis resource in 2023

OPENALEX - Publications

Lorna Richardson Ben Allen Germana Baldi Martín Beracochea Maxwell L. Bileschi and 16 more

Abstract The MGnify platform (https://www.ebi.ac.uk/metagenomics) facilitates the assembly, analysis and archiving of microbiome-derived nucleic acid sequences. provides access to taxonomic assignments functional annotations for nearly half a million analyses covering metabarcoding, metatranscriptomic, metagenomic datasets, which are derived from wide range different environments. Over past 3 years, has not only grown in terms number datasets contained but also increased breadth provided,...

10.1093/nar/gkac1080 article EN cc-by Nucleic Acids Research 2022-12-07

Cryptic and abundant marine viruses at the evolutionary origins of Earth’s RNA virome

OPENALEX - Publications

Ahmed A. Zayed James M. Wainaina Guillermo Domínguez-Huerta Éric Pelletier Jiarong Guo and 52 more

Whereas DNA viruses are known to be abundant, diverse, and commonly key ecosystem players, RNA insufficiently studied outside disease settings. In this study, we analyzed ≈28 terabases of Global Ocean sequences expand Earth's virus catalogs their taxonomy, investigate evolutionary origins, assess marine biogeography from pole pole. Using new approaches optimize discovery classification, identified that necessitate substantive revisions taxonomy (doubling phyla adding >50% classes)...

10.1126/science.abm5847 article EN Science 2022-04-07

The International Nucleotide Sequence Database Collaboration

OPENALEX - Publications

Guy Cochrane Ilene Karsch‐Mizrachi Toshihisa Takagi International Nucleotide Sequence Database Collaboration

The International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org) comprises three global partners committed to capturing, preserving and providing comprehensive public-domain nucleotide sequence information. INSDC establishes standards, formats protocols for data metadata make it easier individuals organisations submit their reliably public archives. This work enables the continuous, exchange of information about living things. Here we present an update in 2015,...

10.1093/nar/gkv1323 article EN cc-by Nucleic Acids Research 2015-12-10

The international nucleotide sequence database collaboration

OPENALEX - Publications

Ilene Karsch‐Mizrachi Toshihisa Takagi Guy Cochrane

For more than 30 years, the International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org/) has been committed to capturing, preserving and providing access comprehensive public domain nucleotide sequence associated metadata which enables discovery in biomedicine, biodiversity biological sciences. Since 1987, DNA Data Bank of Japan (DDBJ) at National Institute for Genetics Mishima, Japan; European Archive (ENA) Molecular Biology Laboratory's Bioinformatics (EMBL-EBI)...

10.1093/nar/gkx1097 article EN cc-by-nc Nucleic Acids Research 2017-10-26

Viral to metazoan marine plankton nucleotide sequences from the Tara Oceans expedition

OPENALEX - Publications

Adriana Alberti Julie Poulain Stéfan Engelen Karine Labadie Sarah Romac and 95 more

Abstract A unique collection of oceanic samples was gathered by the Tara Oceans expeditions (2009–2013), targeting plankton organisms ranging from viruses to metazoans, and providing rich environmental context measurements. Thanks recent advances in field genomics, extensive sequencing has been performed for a deep genomic analysis this huge samples. strategy based on different approaches, such as metabarcoding, metagenomics, single-cell genomics metatranscriptomics, chosen size-fractionated...

10.1038/sdata.2017.93 article EN cc-by Scientific Data 2017-08-01

Coming Soon ...