NFDI4DS | UHH-SEMS - Publication Details

Clustal W and Clustal X version 2.0

OPENALEX - Publications

Mark Larkin Gordon Blackshields Nigel P. Brown Chenna Ramu Paul McGettigan and 8 more

Abstract Summary: The Clustal W and X multiple sequence alignment programs have been completely rewritten in C++. This will facilitate the further development of algorithms future has allowed proper porting to latest versions Linux, Macintosh Windows operating systems. Availability: can be run on-line from EBI web server: http://www.ebi.ac.uk/tools/clustalw2. source code executables for Windows, Linux computers are available ftp site ftp://ftp.ebi.ac.uk/pub/software/clustalw2/ Contact:...

10.1093/bioinformatics/btm404 article EN Bioinformatics 2007-09-10

Fast, scalable generation of high‐quality protein multiple sequence alignments using Clustal Omega

OPENALEX - Publications

Fabian Sievers Andreas Wilm David Dineen Toby J. Gibson Kevin Karplus and 7 more

10.1038/msb.2011.75 article EN Molecular Systems Biology 2011-01-01

InterProScan 5: genome-scale protein function classification

OPENALEX - Publications

Philip Jones David Binns Hsin-Yu Chang Matthew Fraser Weizhong Li and 12 more

Motivation: Robust large-scale sequence analysis is a major challenge in modern genomic science, where biologists are frequently trying to characterize many millions of sequences. Here, we describe new Java-based architecture for the widely used protein function prediction software package InterProScan. Developments include improvements and additions outputs complete reimplementation framework, resulting flexible stable system that able use both multiprocessor machines and/or conventional...

10.1093/bioinformatics/btu031 article EN cc-by Bioinformatics 2014-01-23

A new bioinformatics analysis tools framework at EMBL-EBI

OPENALEX - Publications

M. Goujon Hamish McWilliam Weizhong Li F. Valentin Silvano Squizzato and 2 more

The EMBL-EBI provides access to various mainstream sequence analysis applications. These include similarity search services such as BLAST, FASTA, InterProScan and multiple alignment tools ClustalW, T-Coffee MUSCLE. Through the services, users can databases EMBL-Bank UniProt, more than 2000 completed genomes proteomes. We present here a new framework aimed at both novice well expert that exposes novel methods of obtaining annotations visualizing results through one uniform consistent...

10.1093/nar/gkq313 article EN Nucleic Acids Research 2010-05-03

Analysis Tool Web Services from the EMBL-EBI

OPENALEX - Publications

Hamish McWilliam Weizhong Li Mahmut Uludağ Silvano Squizzato Young Mi Park and 3 more

Since 2004 the European Bioinformatics Institute (EMBL-EBI) has provided access to a wide range of databases and analysis tools via Web Services interfaces. This comprises services search across available from EMBL-EBI explore network cross-references present in data (e.g. EB-eye), retrieve entry various formats specific fields dbfetch), tool services, for example, sequence similarity FASTA NCBI BLAST), multiple alignment Clustal Omega MUSCLE), pairwise protein functional InterProScan...

10.1093/nar/gkt376 article EN cc-by Nucleic Acids Research 2013-05-11

The EMBL-EBI bioinformatics web and programmatic tools framework

OPENALEX - Publications

Weizhong Li Andrew Cowley Mahmut Uludağ Tamer Gur Hamish McWilliam and 4 more

Since 2009 the EMBL-EBI Job Dispatcher framework has provided free access to a range of mainstream sequence analysis applications. These include similarity search services (https://www.ebi.ac.uk/Tools/sss/) such as BLAST, FASTA and PSI-Search, multiple alignment tools (https://www.ebi.ac.uk/Tools/msa/) Clustal Omega, MAFFT T-Coffee, other (https://www.ebi.ac.uk/Tools/pfa/) InterProScan. Through these users can databases ENA, UniProt Ensembl Genomes, utilising uniform web interface or...

10.1093/nar/gkv279 article EN cc-by-nc Nucleic Acids Research 2015-04-06

The IMGT/HLA database

OPENALEX - Publications

James Robinson Matthew Waller Sylvie C. Fail Hamish McWilliam Rodrigo López and 2 more

It is 14 years since the IMGT/HLA database was first released, providing HLA community with a searchable repository of highly curated sequences. The complex located within 6p21.3 region human chromosome 6 and contains more than 220 genes diverse function. Of these, 21 encode proteins immune system that are polymorphic. naming these alleles their quality control responsibility World Health Organization Nomenclature Committee for Factors System. Through work Informatics Group in collaboration...

10.1093/nar/gkn662 article EN cc-by-nc Nucleic Acids Research 2008-10-07

The IMGT/HLA database

OPENALEX - Publications

James Robinson Jason A. Halliwell Hamish McWilliam Rodrigo López Peter Parham and 1 more

It is 14 years since the IMGT/HLA database was first released, providing HLA community with a searchable repository of highly curated sequences. The complex located within 6p21.3 region human chromosome 6 and contains more than 220 genes diverse function. Of these, 21 encode proteins immune system that are polymorphic. naming these alleles their quality control responsibility World Health Organization Nomenclature Committee for Factors System. Through work Informatics Group in collaboration...

10.1093/nar/gks949 article EN cc-by-nc Nucleic Acids Research 2012-10-17

IPD—the Immuno Polymorphism Database

OPENALEX - Publications

James Robinson Kavita Mistry Hamish McWilliam Rodrigo López Steven G. E. Marsh

The Immuno Polymorphism Database (IPD), http://www.ebi.ac.uk/ipd/ is a set of specialist databases related to the study polymorphic genes in immune system. IPD project works with groups or nomenclature committees who provide and curate individual sections before they are submitted for online publication. stores all data databases. currently consists four databases: IPD-KIR, contains allelic sequences killer-cell immunoglobulin-like receptors, IPD-MHC, database major histocompatibility...

10.1093/nar/gkp879 article EN cc-by-nc Nucleic Acids Research 2009-10-29

The IMGT/HLA database

OPENALEX - Publications

James Robinson Kavita Mistry Hamish McWilliam Rodrigo López Peter Parham and 1 more

It is 12 years since the IMGT/HLA database was first released, providing HLA community with a searchable repository of highly curated sequences. The complex located within 6p21.3 region human chromosome 6 and contains more than 220 genes diverse function. Many encode proteins immune system are polymorphic. naming these alleles their quality control responsibility WHO Nomenclature Committee for Factors System. Through work Informatics Group in collaboration European Bioinformatics Institute,...

10.1093/nar/gkq998 article EN cc-by-nc Nucleic Acids Research 2010-11-11

EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats

OPENALEX - Publications

Jon Ison Matúš Kalaš Inge Jonassen Dan Bolser Mahmut Uludağ and 5 more

Abstract Motivation: Advancing the search, publication and integration of bioinformatics tools resources demands consistent machine-understandable descriptions. A comprehensive ontology allowing such descriptions is therefore required. Results: EDAM an operations (tool or workflow functions), types data identifiers, application domains formats. supports semantic annotation diverse entities as Web services, databases, programmatic libraries, standalone tools, interactive applications,...

10.1093/bioinformatics/btt113 article EN cc-by Bioinformatics 2013-03-11

The EMBL Nucleotide Sequence Database

OPENALEX - Publications

T. Kulikova R.A. Akhtar P. Aldebert N. Althorpe Martin Andersson and 29 more

The EMBL Nucleotide Sequence Database ( http://www.ebi.ac.uk/embl ), maintained at the European Bioinformatics Institute (EBI) near Cambridge, UK, is a comprehensive collection of nucleotide sequences and annotation from available public sources. database part an international collaboration with DDBJ (Japan) GenBank (USA). Data are exchanged daily between collaborating institutes to achieve swift synchrony. Webin preferred tool for individual submissions sequences, including Third Party...

10.1093/nar/gki098 article EN other-oa Nucleic Acids Research 2004-12-17

EMBL Nucleotide Sequence Database in 2006

OPENALEX - Publications

T. Kulikova R. Akhtar P. Aldebert N. Althorpe Martin Andersson and 29 more

The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl) at the European Bioinformatics Institute, UK, offers a large and freely accessible collection of nucleotide sequences accompanying annotation. database is maintained in collaboration with DDBJ GenBank. Data are exchanged between collaborating databases on daily basis to achieve optimal synchrony. Webin preferred tool for individual submissions sequences, including Third Party Annotation, alignments bulk data. Automated...

10.1093/nar/gkl913 article EN cc-by-nc Nucleic Acids Research 2006-12-06

IPD—the Immuno Polymorphism Database

OPENALEX - Publications

James Robinson Jason A. Halliwell Hamish McWilliam Rodrigo López Steven G. E. Marsh

The Immuno Polymorphism Database (IPD), http://www.ebi.ac.uk/ipd/ is a set of specialist databases related to the study polymorphic genes in immune system. IPD project works with groups or nomenclature committees who provide and curate individual sections before they are submitted for online publication. stores all data databases. currently consists four databases: IPD-KIR, contains allelic sequences killer-cell immunoglobulin-like receptors, IPD-MHC, database major histocompatibility...

10.1093/nar/gks1140 article EN cc-by-nc Nucleic Acids Research 2012-11-23

Petabyte-scale innovations at the European Nucleotide Archive

OPENALEX - Publications

Guy Cochrane R.A. Akhtar James Bonfield L. Bower F. Demiralp and 22 more

Dramatic increases in the throughput of nucleotide sequencing machines, and promise ever greater performance, have thrust bioinformatics into era petabyte-scale data sets. Sequence repositories, which provide feed for these sets worldwide computational infrastructure, are challenged by impact volumes. The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/embl), comprising EMBL Database Ensembl Trace Archive, has identified challenges storage, movement, analysis, interpretation...

10.1093/nar/gkn765 article EN cc-by-nc Nucleic Acids Research 2008-11-01

Facing growth in the European Nucleotide Archive

OPENALEX - Publications

Guy Cochrane Blaise Alako Clara Amid Lawrence Bower Ana Cerdeño-Tárraga and 24 more

The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena/) collects, maintains and presents comprehensive nucleic acid sequence related information as part of the permanent public scientific record. Here, we provide brief updates on ENA content developments major service enhancements in 2012 describe more detail two important areas development policy that are driven by ongoing growth sequencing technologies. First, data warehouse, a resource for which programmatic entry point to...

10.1093/nar/gks1175 article EN cc-by-nc Nucleic Acids Research 2012-11-29

Priorities for nucleotide trace, sequence and annotation data capture at the Ensembl Trace Archive and the EMBL Nucleotide Sequence Database

OPENALEX - Publications

Guy Cochrane R.A. Akhtar P. Aldebert N. Althorpe A. Baldwin and 31 more

The Ensembl Trace Archive ( http://trace.ensembl.org/ ) and the EMBL Nucleotide Sequence Database http://www.ebi.ac.uk/embl/ ), known together as European Archive, continue to see growth in data volume diversity. Selected major developments of 2007 are presented briefly, along with submission retrieval information. In face increasing requirements for nucleotide trace, sequence annotation archiving, capture priority decisions have been taken at Archive. Priorities discussed terms how reliably...

10.1093/nar/gkm1018 article EN cc-by-nc Nucleic Acids Research 2007-11-27

Web services at the European Bioinformatics Institute-2009

OPENALEX - Publications

Hamish McWilliam F. Valentin M. Goujon Weizhong Li N. Mathivanan and 3 more

The European Bioinformatics Institute (EMBL-EBI) has been providing access to mainstream databases and tools in bioinformatics since 1997. In addition the traditional web form based interfaces, APIs exist for core data resources such as EMBL-Bank, Ensembl, UniProt, InterPro, PDB ArrayExpress. These are on Web Services (SOAP/REST) interfaces that allow users systematically analytical tools. From user's point of view, these provide same functionality browser-based forms. However, using frees...

10.1093/nar/gkp302 article EN cc-by-nc Nucleic Acids Research 2009-05-12

Improvements to services at the European Nucleotide Archive

OPENALEX - Publications

Rasko Leinonen Ruth A. Akhtar Ewan Birney James Bonfield Lawrence Bower and 24 more

The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena) is Europe's primary nucleotide sequence archival resource, safeguarding open data access, engaging in worldwide collaborative exchange and integrating with the scientific publication process. ENA has made significant contributions to arena as an active proponent of extending traditional collaboration cover capillary next-generation sequencing information. We have continued co-develop metadata representation formats our...

10.1093/nar/gkp998 article EN cc-by-nc Nucleic Acids Research 2009-11-10

Assembly information services in the European Nucleotide Archive

OPENALEX - Publications

Nima Pakseresht Blaise Alako Clara Amid Ana Cerdeño-Tárraga Iain Cleland and 25 more

The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena) is a repository for the world public domain nucleotide sequence data output. ENA content covers spectrum of types including raw reads, assembly and functional annotation. has faced dramatic growth in genome submission rates, volumes complexity datasets. This prompted broad reworking services, which we now reach end major programme work many enhancements have already been made available over year to components service. In this...

10.1093/nar/gkt1082 article EN cc-by Nucleic Acids Research 2013-11-08

The EBI Search engine: providing search and retrieval functionality for biological data from EMBL-EBI

OPENALEX - Publications

Silvano Squizzato Young Mi Park Nicola Buso Tamer Gur Andrew Cowley and 6 more

The European Bioinformatics Institute (EMBL-EBI—https://www.ebi.ac.uk) provides free and unrestricted access to data across all major areas of biology biomedicine. Searching extracting knowledge these domains requires a fast scalable solution that addresses the requirements domain experts as well casual users. We present EBI Search engine, referred here 'EBI Search', an easy-to-use text search indexing system with powerful navigation retrieval capabilities. API integration analytical tools,...

10.1093/nar/gkv316 article EN cc-by-nc Nucleic Acids Research 2015-04-08

Fast and efficient searching of biological data resources--using EB-eye

OPENALEX - Publications

F. Valentin Silvano Squizzato M. Goujon Hamish McWilliam J. Paern and 1 more

The EB-eye is a fast and efficient search engine that provides easy uniform access to the biological data resources hosted at EMBL-EBI. Currently, users can information from more than 62 distinct datasets covering some 400 million entries. represented in include: nucleotide protein sequences both genomic proteomic levels, structures ranging chemicals macro-molecular complexes, gene-expression experiments, binary level molecular interactions as well reaction maps pathway models, functional...

10.1093/bib/bbp065 article EN Briefings in Bioinformatics 2010-02-11

PSI-Search: iterative HOE-reduced profile SSEARCH searching

OPENALEX - Publications

Weizhong Li Hamish McWilliam Mickael Goujon Andrew Cowley Rodrigo López and 1 more

Iterative similarity searches with PSI-BLAST position-specific score matrices (PSSMs) find many more homologs than single searches, but PSSMs can be contaminated when homologous alignments are extended into unrelated protein domains-homologous over-extension (HOE). PSI-Search combines an optimal Smith-Waterman local alignment sequence search, using SSEARCH, the profile construction strategy. An optional boundary-masking procedure, which prevents from being after they initially included,...

10.1093/bioinformatics/bts240 article EN cc-by-nc Bioinformatics 2012-04-25

BioCatalogue: A Curated Web Service Registry For The Life Science Community

OPENALEX - Publications

Franck Tanoh Carole Goble Khalid Belhajjame Franck Tanoh Jiten Bhagat and 6 more

Web Services have gained a momentum as means for packaging existing data and computational resources in form that is amenable use composition by third party applications. The life science community certainly among the first adopters of Services. For example, "Taverna":http://www.mygrid.org.uk, workflow workbench popular within community, provides access to over 3500 thousands web services can be composed scientists constructing enacting their silico experiments. However, one main issues...

10.1038/npre.2009.3132 preprint EN Nature Precedings 2009-04-22

Using EMBL‐EBI Services via Web Interface and Programmatically via Web Services

OPENALEX - Publications

Rodrigo López Andrew Cowley Weizhong Li Hamish McWilliam

The European Bioinformatics Institute (EMBL-EBI) provides access to a wide range of databases and analysis tools that are key importance in bioinformatics. As well as providing Web interfaces these resources, Services available using SOAP REST protocols enable programmatic our resources allow their integration into other applications analytical workflows. This unit describes the various options typical researcher or bioinformatician who wishes use via interface programmatically programming languages.

10.1002/0471250953.bi0312s48 article EN Current Protocols in Bioinformatics 2014-12-01

ORKG

DBLP

CEUR

MyBinder

Hamish McWilliam