NFDI4DS | UHH-SEMS - Publication Details

Jennifer Harrow

ORCID: 0000-0003-0338-3070

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5080927998

Research Areas

Genomics and Phylogenetic Studies
RNA and protein synthesis mechanisms
RNA modifications and cancer
Genomics and Chromatin Dynamics
RNA Research and Splicing
Genetics, Bioinformatics, and Biomedical Research
Cancer-related molecular mechanisms research
Chromosomal and Genetic Variations
Machine Learning in Bioinformatics
Olfactory and Sensory Function Studies
Scientific Computing and Data Management
Genetic Mapping and Diversity in Plants and Animals
Animal Genetics and Reproduction
Research Data Management Practices
Genomics and Rare Diseases
Molecular Biology Techniques and Applications
Biochemical Analysis and Sensing Techniques
Genetic and phenotypic traits in livestock
Advanced Proteomics Techniques and Applications
Gene expression and cancer classification
Immune Cell Function and Interaction
Epigenetics and DNA Methylation
T-cell and B-cell Immunology
Animal Virus Infections Studies
Genomic variations and chromosomal abnormalities

AstraZeneca (Brazil)
2024

AstraZeneca (United Kingdom)
2023-2024

Genomics (United Kingdom)
2023-2024

Wellcome Sanger Institute
2013-2022

European Bioinformatics Institute
2007-2019

Illumina (United Kingdom)
2017-2019

Max Planck Institute for Developmental Biology
2013

University of California, Santa Cruz
2012-2013

University College London
2013

University of Cambridge
2013

Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project

OPENALEX - Publications

Ewan Birney J Stamatoyannopoulos Anindya Dutta Roderic Guigó T Gingeras and 95 more

10.1038/nature05874 article EN Nature 2007-06-01

Landscape of transcription in human cells

OPENALEX - Publications

Sarah Djebali Carrie Davis Angelika Merkel Alexander Dobin Timo Lassmann and 80 more

Eukaryotic cells make many types of primary and processed RNAs that are found either in specific subcellular compartments or throughout the cells. A complete catalogue these is not yet available their characteristic localizations also poorly understood. Because RNA represents direct output genetic information encoded by genomes a significant proportion cell's regulatory capabilities focused on its synthesis, processing, transport, modification translation, generation such crucial for...

10.1038/nature11233 article EN cc-by-nc-sa Nature 2012-09-01

The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression

OPENALEX - Publications

Thomas Derrien Rory Johnson Giovanni Bussotti Andrea Tanzer Sarah Djebali and 22 more

The human genome contains many thousands of long noncoding RNAs (lncRNAs). While several studies have demonstrated compelling biological and disease roles for individual examples, analytical experimental approaches to investigate these genes been hampered by the lack comprehensive lncRNA annotation. Here, we present analyze most complete annotation date, produced GENCODE consortium within framework ENCODE project comprising 9277 manually annotated producing 14,880 transcripts. Our analyses...

10.1101/gr.132159.111 article EN cc-by-nc Genome Research 2012-09-01

GENCODE: The reference human genome annotation for The ENCODE Project

OPENALEX - Publications

Jennifer Harrow Adam Frankish José M. González Electra Tapanari Mark Diekhans and 36 more

The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since first public release this annotation data set, few new protein-coding loci have been added, yet number alternative splicing transcripts annotated has steadily increased. 7 contains 20,687 9640 long noncoding RNA 33,977 coding not represented UCSC genes RefSeq. It also most comprehensive (lncRNA) publicly available...

10.1101/gr.135350.111 article EN cc-by-nc Genome Research 2012-09-01

A conditional knockout resource for the genome-wide study of mouse gene function

OPENALEX - Publications

William C. Skarnes Barry P. Rosen Anthony P. West Manousos Koutsourakis Wendy Bushell and 13 more

10.1038/nature10163 article EN Nature 2011-06-01

Analyses of pig genomes provide insight into porcine demography and evolution

OPENALEX - Publications

Martien A. M. Groenen Alan Archibald Hirohide Uenishi Christopher K. Tuggle Yasuhiro Takeuchi and 95 more

For 10,000 years pigs and humans have shared a close complex relationship. From domestication to modern breeding practices, shaped the genomes of domestic pigs. Here we present assembly analysis genome sequence female Duroc pig (Sus scrofa) comparison with wild from Europe Asia. Wild emerged in South East Asia subsequently spread across Eurasia. Our results reveal deep phylogenetic split between European Asian boars ∼1 million ago, selective sweep indicates selection on genes involved RNA...

10.1038/nature11622 article EN cc-by-nc-sa Nature 2012-11-01

Ensembl 2016

OPENALEX - Publications

Andrew Yates Wasiu Akanni M Ridwan Amode Daniel Barrell Konstantinos Billis and 42 more

The Ensembl project (http://www.ensembl.org) is a system for genome annotation, analysis, storage and dissemination designed to facilitate the access of genomic annotation from chordates key model organisms. It provides data 87 species across our main early Pre! websites. This year we introduced three newly annotated released numerous updates supported with concentration on latest assemblies human, mouse, zebrafish rat. We also provided two previous human assembly, GRCh37, through dedicated...

10.1093/nar/gkv1157 article EN cc-by Nucleic Acids Research 2015-12-19

A Systematic Survey of Loss-of-Function Variants in Human Protein-Coding Genes

OPENALEX - Publications

Daniel G. MacArthur Suganthi Balasubramanian Adam Frankish Ni Huang James Morris and 46 more

Defective Gene Detective Identifying genes that give rise to diseases is one of the major goals sequencing human genomes. However, putative loss-of-function genes, which are often some first identified targets genome and exome sequencing, have turned out be errors rather than true genetic variants. In order identify scope within genome, MacArthur et al. (p. 823 ; see Perspective by Quintana-Murci ) extensively validated genomes from 1000 Genomes Project, as well an additional European...

10.1126/science.1215040 article EN Science 2012-02-16

Ensembl 2014

OPENALEX - Publications

Paul Flicek M Ridwan Amode Daniel Barrell Kathryn Beal Konstantinos Billis and 47 more

Ensembl (http://www.ensembl.org) creates tools and data resources to facilitate genomic analysis in chordate species with an emphasis on human, major vertebrate model organisms farm animals. Over the past year we have increased number of that support 77 expanded our genome browser a new scrollable overview improved variation phenotype views. We also report updates core datasets improvements gene homology relationships from addition species. Our REST service has been extended additional for...

10.1093/nar/gkt1196 article EN cc-by Nucleic Acids Research 2013-12-06

Ensembl 2015

OPENALEX - Publications

Fiona Cunningham M Ridwan Amode Daniel Barrell Kathryn Beal Konstantinos Billis and 44 more

Ensembl (http://www.ensembl.org) is a genomic interpretation system providing the most up-to-date annotations, querying tools and access methods for chordates key model organisms. This year we released updated annotation (gene models, comparative genomics, regulatory regions variation) on new human assembly, GRCh38, although continue to support researchers using GRCh37.p13 assembly through dedicated site (http://grch37.ensembl.org). Our Regulatory Build has been revamped identify of interest...

10.1093/nar/gku1010 article EN cc-by Nucleic Acids Research 2014-10-28

Ensembl 2013

OPENALEX - Publications

Paul Flicek Ikhlak Ahmed M Ridwan Amode Daniel Barrell Kathryn Beal and 50 more

The Ensembl project (http://www.ensembl.org) provides genome information for sequenced chordate genomes with a particular focus on human, mouse, zebrafish and rat. Our resources include evidenced-based gene sets all supported species; large-scale whole multiple species alignments across vertebrates clade-specific eutherian mammals, primates, birds fish; variation data 17 regulation annotations based ENCODE other sets. are accessible through the browser at http://www.ensembl.org tools...

10.1093/nar/gks1236 article EN cc-by-nc Nucleic Acids Research 2012-11-30

Ensembl 2012

OPENALEX - Publications

Paul Flicek M Ridwan Amode Daniel Barrell Kathryn Beal Shannon E. Brent and 52 more

The Ensembl project (http://www.ensembl.org) provides genome resources for chordate genomes with a particular focus on human data as well key model organisms such mouse, rat and zebrafish. Five additional species were added in the last year including gibbon (Nomascus leucogenys) Tasmanian devil (Sarcophilus harrisii) bringing total number of supported to 61 release 64 (September 2011). Of these, 55 appear main website six are provided preview site (Pre!Ensembl; http://pre.ensembl.org)...

10.1093/nar/gkr991 article EN cc-by-nc Nucleic Acids Research 2011-11-15

Assessment of transcript reconstruction methods for RNA-seq

OPENALEX - Publications

Tamara Steijger Josep F. Abril Pär G. Engström Felix Kokocinski Martin Akerman and 53 more

We evaluated 25 protocol variants of 14 independent computational methods for exon identification, transcript reconstruction and expression-level quantification from RNA-seq data. Our results show that most algorithms are able to identify discrete components with high success rates but assembly complete isoform structures poses a major challenge even when all constituent elements identified. Expression-level estimates also varied widely across methods, based on similar models. Consequently,...

10.1038/nmeth.2714 article EN cc-by-nc-sa Nature Methods 2013-11-03

GENCODE: producing a reference annotation for ENCODE

OPENALEX - Publications

Jennifer Harrow France Denœud Adam Frankish Alexandre Reymond Chao-Kung Chen and 10 more

The GENCODE consortium was formed to identify and map all protein-coding genes within the ENCODE regions. This achieved by a combination of initial manual annotation HAVANA team, experimental validation refinement based on these results.The gene features are divided into eight different categories which only first two (known novel coding sequence) confidently predicted be genes. 5' rapid amplification cDNA ends (RACE) RT-PCR were used experimentally verify annotation. Of 420 loci tested, 229...

10.1186/gb-2006-7-s1-s4 article EN cc-by Genome biology 2006-08-07

The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes

OPENALEX - Publications

Kim D. Pruitt Jennifer Harrow Rachel Harte Craig Wallin Mark Diekhans and 44 more

Effective use of the human and mouse genomes requires reliable identification genes their products. Although multiple public resources provide annotation, different methods are used that can result in similar but not identical representation genes, transcripts, proteins. The collaborative consensus coding sequence (CCDS) project tracks protein annotations on reference with a stable identifier (CCDS ID), ensures they consistently represented NCBI, Ensembl, UCSC Genome Browsers. Importantly,...

10.1101/gr.080531.108 article EN cc-by-nc Genome Research 2009-06-04

Multiple evidence strands suggest that there may be as few as 19 000 human protein-coding genes

OPENALEX - Publications

Iakes Ezkurdia David Juan José Manuel Rodrı́guez Adam Frankish Mark Diekhans and 4 more

Determining the full complement of protein-coding genes is a key goal genome annotation. The most powerful approach for confirming potential detection cellular protein expression through peptide mass spectrometry (MS) experiments. Here, we mapped peptides detected in seven large-scale proteomics studies to almost 60% GENCODE annotation human genome. We found strong relationship between experiments and both gene family age cross-species conservation. Most which were highly conserved. >96%...

10.1093/hmg/ddu309 article EN cc-by-nc Human Molecular Genetics 2014-06-16

Systematic evaluation of spliced alignment programs for RNA-seq data

OPENALEX - Publications

Pär G. Engström Tamara Steijger Botond Sipos Gregory R. Grant André Kahles and 6 more

Authors compare RNA-seq aligners on mouse and human data sets using benchmarks such as alignment yield, splice junction accuracy suitability for transcript reconstruction. The work highlights the strength of each program discusses outstanding needs in analysis. High-throughput RNA sequencing is an increasingly accessible method studying gene structure activity a genome-wide scale. A critical step analysis partial reads to reference genome sequence. To assess performance current mapping...

10.1038/nmeth.2722 article EN cc-by-nc-sa Nature Methods 2013-11-03

Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel

OPENALEX - Publications

Olivier Delaneau Jonathan Marchini Gil McVean Peter Donnelly Gerton Lunter and 95 more

10.1038/ncomms4934 article EN Nature Communications 2014-06-13

The vertebrate genome annotation (Vega) database

OPENALEX - Publications

Laurens Wilming James Gilbert Kerstin Howe Stephen J. Trevanion Tim Hubbard and 1 more

The Vertebrate Genome Annotation (Vega) database ( http://vega.sanger.ac.uk ) was first made public in 2004 and has been designed to view manual annotation of human, mouse zebrafish genomic sequences produced at the Wellcome Trust Sanger Institute. Since its initial release, number human annotated loci more than doubled close 33 000 now contains comprehensive on 20 24 chromosomes, four whole chromosomes around 40% Danio rerio genome. In addition, we offer a haplotype regions comparative...

10.1093/nar/gkm987 article EN cc-by-nc Nucleic Acids Research 2007-11-15

The GENCODE pseudogene resource

OPENALEX - Publications

Baikang Pei Cristina Sisu Adam Frankish Cédric Howald Lukas Habegger and 9 more

Pseudogenes have long been considered as nonfunctional genomic sequences. However, recent evidence suggests that many of them might some form biological activity, and the possibility functionality has increased interest in their accurate annotation integration with functional genomics data. As part GENCODE human genome, we present first genome-wide pseudogene assignment for protein-coding genes, based on both large-scale manual silico pipelines. A key aspect this coupled approach is it...

10.1186/gb-2012-13-9-r51 article EN cc-by Genome biology 2012-01-01

Variation analysis and gene annotation of eight MHC haplotypes: The MHC Haplotype Project

OPENALEX - Publications

Roger W. Horton Richard Gibson Penny Coggill Marcos Miretti Richard Allcock and 22 more

The human major histocompatibility complex (MHC) is contained within about 4 Mb on the short arm of chromosome 6 and recognised as most variable region in genome. primary aim MHC Haplotype Project was to provide a comprehensively annotated reference sequence single, leukocyte antigen-homozygous haplotype use it basis against which variations could be assessed from seven other similarly homozygous cell lines, representative common haplotypes European population. Comparison sequences,...

10.1007/s00251-007-0262-2 article EN cc-by-nc Immunogenetics 2008-01-01

Comparative analysis of the transcriptome across distant species

OPENALEX - Publications

Mark Gerstein Joel Rozowsky Koon‐Kiu Yan Daifeng Wang Chao Cheng and 91 more

Uniform processing and detailed annotation of human, worm fly RNA-sequencing data reveal ancient, conserved features the transcriptome, shared co-expression modules (many enriched in developmental genes), matched expression patterns across development similar extent non-canonical, non-coding transcription; furthermore, are used to create a single, universal model predict gene-expression levels for all three organisms from chromatin at promoter. In this paper modENCODE consortium reports on...

10.1038/nature13424 article EN cc-by-nc-sa Nature 2014-08-26

Towards FAIR principles for research software

OPENALEX - Publications

Anna‐Lena Lamprecht Leyla García Mateusz Kuzak Carlos Martínez-Ortiz Ricardo Arcila and 13 more

The FAIR Guiding Principles, published in 2016, aim to improve the findability, accessibility, interoperability and reusability of digital research objects for both humans machines.Until now principles have been mostly applied data.The ideas behind these are, however, also directly relevant software.Hence there is a distinct need explore how can be software.In this work, we summarize current status debate around software, as basis development community-agreed software future.We discuss what...

10.3233/ds-190026 article EN cc-by-nc Data Science 2019-11-13

Coming Soon ...