NFDI4DS | UHH-SEMS - Publication Details

Eugene Kulesha

ORCID: 0000-0002-4285-6232

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5015292353

Research Areas

Genomics and Phylogenetic Studies
Genetic Associations and Epidemiology
Machine Learning in Bioinformatics
Epigenetics and DNA Methylation
Genomics and Rare Diseases
Biomedical Text Mining and Ontologies
Gene expression and cancer classification
Genetics, Bioinformatics, and Biomedical Research
Glycosylation and Glycoproteins Research
Genomics and Chromatin Dynamics
Chromosomal and Genetic Variations
Genetic diversity and population structure
Genomic variations and chromosomal abnormalities
Bioinformatics and Genomic Networks
Genetic Mapping and Diversity in Plants and Animals
Microbial Metabolic Engineering and Bioproduction
Protein Structure and Dynamics
RNA regulation and disease
Enzyme Structure and Function
RNA and protein synthesis mechanisms
RNA modifications and cancer
Plant Disease Resistance and Genetics
RNA Research and Splicing
Nutrition, Genetics, and Disease
Zoonotic diseases and public health

European Bioinformatics Institute
2007-2016

Wellcome Trust
2006-2016

Wellcome Sanger Institute
2006-2016

Oxford Metrics (United Kingdom)
2015

MRC Laboratory of Molecular Biology
2013

Cold Spring Harbor Laboratory
2013

Cornell University
2013

A global reference for human genetic variation

OPENALEX - Publications

Adam Auton Gonçalo R. Abecasis David Altshuler Richard Durbin Gonçalo R. Abecasis and 95 more

The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing diverse individuals from multiple populations. Here we report completion the project, having reconstructed genomes 2,504 26 populations using combination low-coverage sequencing, deep exome and dense microarray genotyping. We characterized broad spectrum variation, in total over 88 million variants (84.7 single nucleotide polymorphisms (SNPs), 3.6...

10.1038/nature15393 article EN cc-by-nc-sa Nature 2015-09-29

Ensembl 2014

OPENALEX - Publications

Paul Flicek M Ridwan Amode Daniel Barrell Kathryn Beal Konstantinos Billis and 47 more

Ensembl (http://www.ensembl.org) creates tools and data resources to facilitate genomic analysis in chordate species with an emphasis on human, major vertebrate model organisms farm animals. Over the past year we have increased number of that support 77 expanded our genome browser a new scrollable overview improved variation phenotype views. We also report updates core datasets improvements gene homology relationships from addition species. Our REST service has been extended additional for...

10.1093/nar/gkt1196 article EN cc-by Nucleic Acids Research 2013-12-06

Ensembl 2013

OPENALEX - Publications

Paul Flicek Ikhlak Ahmed M Ridwan Amode Daniel Barrell Kathryn Beal and 50 more

The Ensembl project (http://www.ensembl.org) provides genome information for sequenced chordate genomes with a particular focus on human, mouse, zebrafish and rat. Our resources include evidenced-based gene sets all supported species; large-scale whole multiple species alignments across vertebrates clade-specific eutherian mammals, primates, birds fish; variation data 17 regulation annotations based ENCODE other sets. are accessible through the browser at http://www.ensembl.org tools...

10.1093/nar/gks1236 article EN cc-by-nc Nucleic Acids Research 2012-11-30

Ensembl 2012

OPENALEX - Publications

Paul Flicek M Ridwan Amode Daniel Barrell Kathryn Beal Shannon E. Brent and 52 more

The Ensembl project (http://www.ensembl.org) provides genome resources for chordate genomes with a particular focus on human data as well key model organisms such mouse, rat and zebrafish. Five additional species were added in the last year including gibbon (Nomascus leucogenys) Tasmanian devil (Sarcophilus harrisii) bringing total number of supported to 61 release 64 (September 2011). Of these, 55 appear main website six are provided preview site (Pre!Ensembl; http://pre.ensembl.org)...

10.1093/nar/gkr991 article EN cc-by-nc Nucleic Acids Research 2011-11-15

Ensembl 2009

OPENALEX - Publications

Tim Hubbard Bronwen Aken Sarah Ayling Benoît Ballester Kathryn Beal and 53 more

The Ensembl project (http://www.ensembl.org) is a comprehensive genome information system featuring an integrated set of annotation, databases, and other for chordate, selected model organism disease vector genomes. As release 51 (November 2008), fully supports 45 species, three additional species have preliminary support. New in the past year include orangutan six low coverage mammalian Major additions improvements to since our previous report major redesign website; generation multiple...

10.1093/nar/gkn828 article EN cc-by-nc Nucleic Acids Research 2008-11-25

A Bayesian deconvolution strategy for immunoprecipitation-based DNA methylome analysis

OPENALEX - Publications

Thomas A. Down Vardhman K. Rakyan Daniel J. Turner Paul Flicek Heng Li and 17 more

10.1038/nbt1414 article EN Nature Biotechnology 2008-07-01

Ensembl 2007

OPENALEX - Publications

Tim Hubbard Bronwen Aken Kathryn Beal Benoît Ballester Mario Cáccamo and 53 more

The Ensembl (http://www.ensembl.org/) project provides a comprehensive and integrated source of annotation chordate genome sequences. Over the past year number genomes available from has increased 15 to 33, with addition sites for mammalian elephant, rabbit, armadillo, tenrec, platypus, pig, cat, bush baby, common shrew, microbat european hedgehog; fish stickleback medaka second example sea squirt (Ciona savignyi) mosquito (Aedes aegypti). Some major features added during include first...

10.1093/nar/gkl996 article EN Nucleic Acids Research 2006-12-06

Ensembl 2011

OPENALEX - Publications

Paul Flicek M Ridwan Amode Daniel Barrell Kathryn Beal Shannon E. Brent and 47 more

The Ensembl project ( http://www.ensembl.org ) seeks to enable genomic science by providing high quality, integrated annotation on chordate and selected eukaryotic genomes within a consistent accessible infrastructure. All supported species include comprehensive, evidence-based gene annotations set of includes additional data focused variation, comparative, evolutionary, functional regulatory annotation. most advanced resources are provided for key including human, mouse, rat zebrafish...

10.1093/nar/gkq1064 article EN cc-by-nc Nucleic Acids Research 2010-11-02

Ensembl Genomes 2016: more genomes, more complexity

OPENALEX - Publications

Paul Kersey James E. Allen Irina M. Armean Sanjay Boddu Bruce J. Bolt and 33 more

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources vertebrate genomics developed in context of project (http://www.ensembl.org). Together, two provide a consistent set programmatic and interactive interfaces to rich range including reference sequence, gene models, transcriptional data, genetic variation comparative analysis. This paper provides update previous publications about resource,...

10.1093/nar/gkv1209 article EN cc-by Nucleic Acids Research 2015-11-17

Ensembl comparative genomics resources

OPENALEX - Publications

Javier Herrero Matthieu Muffato Kathryn Beal Stephen Fitzgerald Leo I. Gordon and 9 more

Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets homologous genes other relevant datasets in order evaluate answer evolutionary-related questions. However, complexity computational requirements producing such are substantial: this has led only a small number reference resources that used for most analyses. Ensembl one set...

10.1093/database/bav096 article EN cc-by Database 2016-01-01

The 1000 Genomes Project: data management and community access

OPENALEX - Publications

Laura Clarke Xiangqun Zheng-Bradley Richard Smith Eugene Kulesha Chunlin Xiao and 7 more

The 1000 Genomes Project was launched as one of the largest distributed data collection and analysis projects ever undertaken in biology. In addition to primary scientific goals creating both a deep catalog human genetic variation extensive methods accurately discover characterize using new sequencing technologies, project makes all its publicly available. Members coordination center have developed deployed several tools enable widespread access.

10.1038/nmeth.1974 article EN cc-by-nc-sa Nature Methods 2012-04-27

Ensembl 2008

OPENALEX - Publications

Paul Flicek Bronwen Aken Kathryn Beal Benoît Ballester Mario Cáccamo and 54 more

The Ensembl project (http://www.ensembl.org) is a comprehensive genome information system featuring an integrated set of annotation, databases and other for chordate selected model organism disease vector genomes. As release 47 (October 2007), fully supports 35 species, with preliminary support six additional species. New species in the past year include platypus horse. Major additions improvements to since our previous report extensive functional genomics data form specialized database,...

10.1093/nar/gkm988 article EN cc-by-nc Nucleic Acids Research 2007-11-14

An integrated resource for genome-wide identification and analysis of human tissue-specific differentially methylated regions (tDMRs)

OPENALEX - Publications

Vardhman K. Rakyan Thomas A. Down Natalie Thorne Paul Flicek Eugene Kulesha and 15 more

We report a novel resource (methylation profiles of DNA, or mPod) for human genome-wide tissue-specific DNA methylation profiles. mPod consists three fully integrated parts, reference 13 normal somatic tissues, placenta, sperm, and an immortalized cell line, visualization tool that has been with the Ensembl genome browser new algorithm analysis immunoprecipitation-based demonstrate utility our by identifying first comprehensive set differentially methylated regions (tDMRs) may play role in...

10.1101/gr.077479.108 article EN Genome Research 2008-06-24

The SCOP database in 2020: expanded classification of representative family and superfamily domains of known protein structures

OPENALEX - Publications

Antonina Andreeva Eugene Kulesha Julian Gough Alexey G. Murzin

Abstract The Structural Classification of Proteins (SCOP) database is a classification protein domains organised according to their evolutionary and structural relationships. We report major effort increase the coverage data, aiming provide almost all domain superfamilies with representatives in PDB. have also improved schema, provided new API modernised web interface. This by far most significant update since SCOP 1.75 builds on advances schema from 2 prototype. accessible...

10.1093/nar/gkz1064 article EN cc-by Nucleic Acids Research 2019-10-30

SCOP2 prototype: a new approach to protein structure mining

OPENALEX - Publications

Antonina Andreeva Dave Howorth Cyrus Chothia Eugene Kulesha Alexey G. Murzin

We present a prototype of new structural classification proteins, SCOP2 (http://scop2.mrc-lmb.cam.ac.uk/), that we have developed recently. is successor to the Structural Classification Proteins (SCOP, http://scop.mrc-lmb.cam.ac.uk/scop/) database. Similarly SCOP, main focus organize structurally characterized proteins according their and evolutionary relationships. was designed provide more advanced framework for protein structure annotation classification. It defines approach essentially...

10.1093/nar/gkt1242 article EN cc-by Nucleic Acids Research 2013-11-29

Ensembl's 10th year

OPENALEX - Publications

Paul Flicek Bronwen Aken Benoît Ballester Kathryn Beal Eugene Bragin and 52 more

Ensembl(http://www.ensembl.org)integrates genomic information for a comprehensive set of chordate genomes with particular focus on resources human, mouse, rat, zebrafish and other high-value sequenced genomes.We provide complete gene annotations all supported species in addition to specific that target genome variation, function evolution.Ensembl data is accessible variety formats including via our browser, API BioMart.This year marks the tenth anniversary Ensembl time project has grown...

10.1093/nar/gkp972 article EN cc-by-nc Nucleic Acids Research 2009-11-11

Ensembl Genomes 2013: scaling up access to genome-wide data

OPENALEX - Publications

Paul Kersey James E. Allen Mikkel Christensen Paul A. Davis Lee J. Falin and 28 more

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species. The project exploits and extends technologies genome annotation, analysis dissemination, developed in the context of vertebrate-focused project, provides a complementary set resources species through consistent programmatic interactive interfaces. These provide access to including reference sequence, gene models, transcriptional data, polymorphisms comparative...

10.1093/nar/gkt979 article EN cc-by Nucleic Acids Research 2013-10-25

Ensembl Genomes: an integrative resource for genome-scale data from non-vertebrate species

OPENALEX - Publications

Paul Kersey D. Staines Daniel Lawson Eugene Kulesha Paul Derwent and 16 more

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrative resource for genome-scale data from non-vertebrate species. The project exploits and extends technology (for genome annotation, analysis dissemination) developed in the context of (vertebrate-focused) provides a complementary set resources species through consistent programmatic interactive interfaces. These provide access to including reference sequence, gene models, transcriptional data, polymorphisms comparative analysis....

10.1093/nar/gkr895 article EN cc-by-nc Nucleic Acids Research 2011-11-08

Ensembl Genomes: Extending Ensembl across the taxonomic space

OPENALEX - Publications

Paul Kersey Daniel Lawson Ewan Birney Paul Derwent Matthias Haimel and 15 more

Ensembl Genomes (http://www.ensemblgenomes.org) is a new portal offering integrated access to genome-scale data from non-vertebrate species of scientific interest, developed using the genome annotation and visualisation platform. consists five sub-portals (for bacteria, protists, fungi, plants invertebrate metazoa) designed complement availability vertebrate genomes in Ensembl. Many databases supporting have been built close collaboration with community, which we consider as essential for...

10.1093/nar/gkp871 article EN Nucleic Acids Research 2009-10-31

Ensembl variation resources

OPENALEX - Publications

Yuan Chen Fiona Cunningham Daniel Ríos William McLaren James Smith and 8 more

Abstract Background The maturing field of genomics is rapidly increasing the number sequenced genomes and producing more information from those previously sequenced. Much this additional variation data derived sampling multiple individuals a given species with goal discovering new variants characterising population frequencies that are already known. These have immense value for many studies, including designed to understand evolution connect genotype phenotype. Maximising utility requires...

10.1186/1471-2164-11-293 article EN cc-by BMC Genomics 2010-05-11

What's in my pot? Real-time species identification on the MinION

OPENALEX - Publications

Sissel Juul Fernando Izquierdo Adam M. Hurst Xiaoguang Dai Amber Wright and 3 more

Abstract Whole genome sequencing on next-generation instruments provides an unbiased way to identify the organisms present in complex metagenomic samples. However, time-to-result can be protracted because of fixed-time runs and cumbersome bioinformatics workflows. This limits utility approach settings where rapid species identification is crucial, such as quality control food-chain components, or during outbreak infectious disease. Here we What’s my Pot? (WIMP), a laboratory analysis...

10.1101/030742 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2015-11-06

RNAcentral: an international database of ncRNA sequences

OPENALEX - Publications

Anton I. Petrov Simon Kay Richard Gibson Eugene Kulesha D. Staines and 35 more

The field of non-coding RNA biology has been hampered by the lack availability a comprehensive, up-to-date collection accessioned sequences. Here we present first release RNAcentral, database that collates and integrates information from an international consortium established sequence databases. initial contains over 8.1 million sequences, including representatives all major functional classes. A web portal (http://rnacentral.org) provides free access to data, search functionality,...

10.1093/nar/gku991 article EN cc-by Nucleic Acids Research 2014-10-28

Integrating biological data – the Distributed Annotation System

OPENALEX - Publications

Andy Jenkinson Mario Albrecht Ewan Birney Hagen Blankenburg Thomas A. Down and 10 more

The Distributed Annotation System (DAS) is a widely adopted protocol for dynamically integrating wide range of biological data from geographically diverse sources. DAS continues to expand its applicability and evolve in response new challenges facing integrative bioinformatics. Here we describe the various infrastructure components present extended version specification. Version 1.53E incorporates several recent developments, including extension serve types an ontology protein features. Our...

10.1186/1471-2105-9-s8-s3 article EN cc-by BMC Bioinformatics 2008-07-22

Coming Soon ...