NFDI4DS | UHH-SEMS - Publication Details

Michael L. Tress

ORCID: 0000-0001-9046-6370

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5088945131

Research Areas

RNA and protein synthesis mechanisms
Genomics and Phylogenetic Studies
RNA modifications and cancer
Protein Structure and Dynamics
RNA Research and Splicing
Enzyme Structure and Function
Machine Learning in Bioinformatics
Bioinformatics and Genomic Networks
Molecular Biology Techniques and Applications
Advanced Proteomics Techniques and Applications
Genomics and Chromatin Dynamics
Computational Drug Discovery Methods
Chromosomal and Genetic Variations
Cancer-related molecular mechanisms research
Genomics and Rare Diseases
Protein Degradation and Inhibitors
Genetics, Bioinformatics, and Biomedical Research
Microbial Metabolic Engineering and Bioproduction
Microbial Natural Products and Biosynthesis
Genetics and Neurodevelopmental Disorders
Genetic and phenotypic traits in livestock
Monoclonal and Polyclonal Antibodies Research
Biomedical Text Mining and Ontologies
Machine Learning in Materials Science
PI3K/AKT/mTOR signaling in cancer

Spanish National Cancer Research Centre
2015-2024

Centro de Investigación del Cáncer
2012-2016

Spanish National Centre for Cardiovascular Research
2015

Dana-Farber Cancer Institute
2012

Cancer Research Center
2012

Biocom
2009

University of Manchester
2007

European Bioinformatics Institute
2007

Instituto de Salud Carlos III
2007

Consejo Superior de Investigaciones Científicas
2004-2006

GENCODE: The reference human genome annotation for The ENCODE Project

OPENALEX - Publications

Jennifer Harrow Adam Frankish José M. González Electra Tapanari Mark Diekhans and 36 more

The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since first public release this annotation data set, few new protein-coding loci have been added, yet number alternative splicing transcripts annotated has steadily increased. 7 contains 20,687 9640 long noncoding RNA 33,977 coding not represented UCSC genes RefSeq. It also most comprehensive (lncRNA) publicly available...

10.1101/gr.135350.111 article EN cc-by-nc Genome Research 2012-09-01

GENCODE reference annotation for the human and mouse genomes

OPENALEX - Publications

Adam Frankish Mark Diekhans Anne-Maud Ferreira Rory Johnson Irwin Jungreis and 50 more

The accurate identification and description of the genes in human mouse genomes is a fundamental requirement for high quality analysis data informing both genome biology clinical genomics. Over last 15 years, GENCODE consortium has been producing reference gene annotations to provide this foundational resource. includes experimental computational groups who work together improve extend annotation. Specifically, we generate primary data, create bioinformatics tools support expert manual...

10.1093/nar/gky955 article EN cc-by Nucleic Acids Research 2018-10-08

GENCODE 2021

OPENALEX - Publications

Adam Frankish Mark Diekhans Irwin Jungreis Julien Lagarde Jane Loveland and 51 more

Abstract The GENCODE project annotates human and mouse genes transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology clinical genomics. annotation processes make use of primary bioinformatic tools analysis generated both within the consortium externally to support creation transcript structures determination their function. Here, we present improvements our infrastructure, bioinformatics tools, analysis, advances they in...

10.1093/nar/gkaa1087 article EN cc-by Nucleic Acids Research 2020-10-25

Multiple evidence strands suggest that there may be as few as 19 000 human protein-coding genes

OPENALEX - Publications

Iakes Ezkurdia David Juan José Manuel Rodrı́guez Adam Frankish Mark Diekhans and 4 more

Determining the full complement of protein-coding genes is a key goal genome annotation. The most powerful approach for confirming potential detection cellular protein expression through peptide mass spectrometry (MS) experiments. Here, we mapped peptides detected in seven large-scale proteomics studies to almost 60% GENCODE annotation human genome. We found strong relationship between experiments and both gene family age cross-species conservation. Most which were highly conserved. >96%...

10.1093/hmg/ddu309 article EN cc-by-nc Human Molecular Genetics 2014-06-16

SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification

OPENALEX - Publications

Manuel Tardáguila Lorena de la Fuente Cristina Martí Cécile Pereira Francisco J. Pardo-Palacios and 14 more

High-throughput sequencing of full-length transcripts using long reads has paved the way for discovery thousands novel transcripts, even in well-annotated mammalian species. The advances technology have created a need studies and tools that can characterize these variants. Here, we present SQANTI, an automated pipeline classification long-read assess quality data preprocessing 47 unique descriptors. We apply SQANTI to neuronal mouse transcriptome Pacific Biosciences (PacBio) illustrate how...

10.1101/gr.222976.117 article EN cc-by-nc Genome Research 2018-02-09

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

OPENALEX - Publications

Naihui Zhou Yuxiang Jiang Timothy Bergquist Alexandra Lee Balint Z. Kacsoh and 95 more

Abstract Background The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation protein function. Results Here, we report on results third CAFA challenge, CAFA3, that featured expanded analysis over previous rounds, both in terms volume data analyzed types performed. In a novel major new development, predictions assessment goals drove some experimental assays, resulting functional annotations for...

10.1186/s13059-019-1835-8 article EN cc-by Genome biology 2019-11-19

GENCODE: reference annotation for the human and mouse genomes in 2023

OPENALEX - Publications

Adam Frankish Sílvia Carbonell Sala Mark Diekhans Irwin Jungreis Jane Loveland and 57 more

GENCODE produces high quality gene and transcript annotation for the human mouse genomes. All is supported by experimental data serves as a reference genome biology clinical genomics. The consortium generates targeted data, develops bioinformatic tools carries out analyses that, along with externally produced methods, support identification of structures determination their function. Here, we present an update on genes, including developments in tools, major collaborations which underpin...

10.1093/nar/gkac1071 article EN cc-by Nucleic Acids Research 2022-11-24

APPRIS: annotation of principal and alternative splice isoforms

OPENALEX - Publications

José Manuel Rodrı́guez Paolo Maietta Iakes Ezkurdia Alessandro Pietrelli Jan‐Jaap Wesselink and 3 more

Here, we present APPRIS (http://appris.bioinfo.cnio.es), a database that houses annotations of human splice isoforms. has been designed to provide value manual the genome by adding reliable protein structural and functional data information from cross-species conservation. The visual representation provided for each gene allows annotators researchers alike easily identify changes brought about splicing events. In addition collecting, integrating analyzing predictions effect events, also...

10.1093/nar/gks1058 article EN cc-by-nc Nucleic Acids Research 2012-11-17

Evidence for Transcript Networks Composed of Chimeric RNAs in Human Cells

OPENALEX - Publications

Sarah Djebali Julien Lagarde Philipp Kapranov Vincent Lacroix Christelle Borel and 33 more

The classic organization of a gene structure has followed the Jacob and Monod bacterial model proposed more than 50 years ago. Since then, empirical determinations complexity transcriptomes found in yeast to human blurred definition physical boundaries genes. Using multiple analysis approaches we have characterized individual mapping on chromosomes 21 22. Analyses locations 5′ 3′ transcriptional termini 492 protein coding genes revealed that for 85% these extend beyond current annotated...

10.1371/journal.pone.0028213 article EN cc-by PLoS ONE 2012-01-04

Extreme genomic erosion after recurrent demographic bottlenecks in the highly endangered Iberian lynx

OPENALEX - Publications

Federico Abascal André Corvelo Fernando Cruz José Luis Villanueva‐Cañas Anna Vlasova and 45 more

Genomic studies of endangered species provide insights into their evolution and demographic history, reveal patterns genomic erosion that might limit viability, offer tools for effective conservation. The Iberian lynx (Lynx pardinus) is the most felid a unique example on brink extinction.

10.1186/s13059-016-1090-1 article EN cc-by Genome biology 2016-12-01

The implications of alternative splicing in the ENCODE protein complement

OPENALEX - Publications

Michael L. Tress Pier Luigi Martelli Adam Frankish Gabrielle A. Reeves Jan Jaap Wesselink and 41 more

Alternative premessenger RNA splicing enables genes to generate more than one gene product. Splicing events that occur within protein coding regions have the potential alter biological function of expressed and even create new functions. has been suggested as explanation for discrepancy between number human functional complexity. Here, we carry out a detailed study alternatively spliced products annotated in ENCODE pilot project. We find alternative is frequent commonly suggested,...

10.1073/pnas.0700800104 article EN Proceedings of the National Academy of Sciences 2007-03-20

Progress and challenges in predicting protein-protein interaction sites

OPENALEX - Publications

Iakes Ezkurdia Lisa Bartoli Piero Fariselli Rita Casadio Alfonso Valencia and 1 more

The identification of protein-protein interaction sites is an essential intermediate step for mutant design and the prediction protein networks. In recent years a significant number methods have been developed to predict these interface residues here we review current status field. Progress in this area requires clear view methodology applied, data sets used training testing systems, evaluation procedures. We analysed impact representative set features algorithms highlighted problems...

10.1093/bib/bbp021 article EN Briefings in Bioinformatics 2008-12-06

APPRIS 2017: principal isoforms for multiple gene sets

OPENALEX - Publications

José Manuel Rodrı́guez Juan Rodriguez-Rivas Tomás Di Domenico Jesús Vázquez Alfonso Valencia and 1 more

The APPRIS database (http://appris-tools.org) uses protein structural and functional features information from cross-species conservation to annotate splice isoforms in protein-coding genes. selects a single isoform, the 'principal' as reference for each gene based on these annotations. A main isoform reflects biological reality most coding genes principal are best predictors of proteins isoforms. Here, we present updates database, new developments that include addition three species...

10.1093/nar/gkx997 article EN cc-by-nc Nucleic Acids Research 2017-10-19

Chimeras taking shape: Potential functions of proteins encoded by chimeric RNA transcripts

OPENALEX - Publications

Milana Frenkel‐Morgenstern Vincent Lacroix Iakes Ezkurdia Yishai Levin Alexandra Gabashvili and 6 more

Chimeric RNAs comprise exons from two or more different genes and have the potential to encode novel proteins that alter cellular phenotypes. To date, numerous putative chimeric transcripts been identified among ESTs isolated several organisms using high throughput RNA sequencing. The few corresponding protein products characterized mostly result chromosomal translocations are associated with cancer. Here, we systematically establish some of genuinely expressed in human cells. Using...

10.1101/gr.130062.111 article EN cc-by-nc Genome Research 2012-05-15

Most Highly Expressed Protein-Coding Genes Have a Single Dominant Isoform

OPENALEX - Publications

Iakes Ezkurdia José Manuel Rodrı́guez Enrique Carrillo de Santa Pau Jesús Vázquez Alfonso Valencia and 1 more

Although eukaryotic cells express a wide range of alternatively spliced transcripts, it is not clear whether genes tend to transcripts simultaneously across cells, or produce dominant isoforms in manner that either tissue-specific regardless tissue. To date, large-scale investigations into the pattern transcript expression distinct tissues have produced contradictory results. Here, we attempt determine splice variant at protein level. We interrogate peptides from eight human proteomics...

10.1021/pr501286b article EN Journal of Proteome Research 2015-03-03

Comparative Proteomics Reveals a Significant Bias Toward Alternative Protein Isoforms with Conserved Structure and Function

OPENALEX - Publications

Iakes Ezkurdia Ángela del Pozo Adam Frankish José Manuel Rodrı́guez Jennifer Harrow and 3 more

Advances in high-throughput mass spectrometry are making proteomics an increasingly important tool genome annotation projects. Peptides detected experiments can be used to validate gene models and verify the translation of putative coding sequences (CDSs). Here, we have identified peptides that cover 35% genes annotated by GENCODE consortium for human as part a comprehensive analysis experimental spectra from two large publicly available databases. We protein "novel" "putative"...

10.1093/molbev/mss100 article EN cc-by-nc Molecular Biology and Evolution 2012-03-22

Alternatively Spliced Homologous Exons Have Ancient Origins and Are Highly Expressed at the Protein Level

OPENALEX - Publications

Federico Abascal Iakes Ezkurdia Juan Rodriguez-Rivas José Manuel Rodrı́guez Ángela del Pozo and 3 more

Alternative splicing of messenger RNA can generate a wide variety mature transcripts, and these transcripts may produce protein isoforms with diverse cellular functions. While there is much supporting evidence for the expression alternative same not true alternatively spliced products. Large-scale mass spectroscopy experiments have identified at level, but conflicting results. Here we carried out rigorous analysis peptide from eight large-scale proteomics to assess scale that detectable by...

10.1371/journal.pcbi.1004325 article EN cc-by PLoS Computational Biology 2015-06-10

An analysis of tissue-specific alternative splicing at the protein level

OPENALEX - Publications

José Manuel Rodrı́guez Fernando Campo del Pozo Tomás Di Domenico Jesús Vázquez Michael L. Tress

The role of alternative splicing is one the great unanswered questions in cellular biology. There strong evidence for at transcript level, and transcriptomics experiments show that many splice events are tissue specific. It has been suggested evolved order to remodel tissue-specific protein-protein networks. Here we investigated among isoforms detected a large-scale proteomics analysis. Although data supporting limited protein clear patterns emerged small numbers could detect data. More than...

10.1371/journal.pcbi.1008287 article EN cc-by PLoS Computational Biology 2020-10-05

Loose ends: almost one in five human genes still have unresolved coding status

OPENALEX - Publications

Federico Abascal David Juan Irwin Jungreis Laura Martínez Gómez Maria Rigau and 3 more

Seventeen years after the sequencing of human genome, proteome is still under revision. One in eight 22 210 coding genes listed by Ensembl/GENCODE, RefSeq and UniProtKB reference databases are annotated differently across three sets. We have carried out an in-depth investigation on 2764 classified as one or more sets manual curators not others. Data from large-scale genetic variation analyses suggests that most protein-like purifying selection so unlikely to code for functional proteins. A...

10.1093/nar/gky587 article EN cc-by-nc Nucleic Acids Research 2018-06-18

APPRIS: selecting functionally important isoforms

OPENALEX - Publications

José Manuel Rodrı́guez Fernando Campo del Pozo Daniel Cerdán-Vélez Tomás Di Domenico Jesús Vázquez and 1 more

APPRIS (https://appris.bioinfo.cnio.es) is a well-established database housing annotations for protein isoforms range of species. selects principal based on structure and function features cross-species conservation. Most coding genes produce single main isoform the chosen by best represent this cellular isoform. Human genetic data, experimental evidence distribution clinical variants all support relevance isoforms. have now been expanded to 10 model organisms. In paper we highlight most...

10.1093/nar/gkab1058 article EN cc-by-nc Nucleic Acids Research 2021-10-21

Genomic and phenotypic characterization of 404 individuals with neurodevelopmental disorders caused by CTNNB1 variants

OPENALEX - Publications

Sayaka Kayumi Luis A. Pérez‐Jurado María Palomares Sneha Rangu Sarah E. Sheppard and 70 more

Germline loss-of-function variants in CTNNB1 cause neurodevelopmental disorder with spastic diplegia and visual defects (NEDSDV; OMIM 615075) are the most frequent, recurrent monogenic of cerebral palsy (CP). We investigated range clinical phenotypes owing to disruptions determine association between NEDSDV CP.Genetic information from 404 individuals collectively 392 pathogenic were ascertained for study. From these, detailed 52 previously unpublished collected combined 68 published...

10.1016/j.gim.2022.08.006 article EN cc-by-nc-nd Genetics in Medicine 2022-09-09

GENCODE 2025: reference gene annotation for human and mouse

OPENALEX - Publications

Jonathan M. Mudge Sílvia Carbonell Sala Mark Diekhans José M. González Toby Hunt and 51 more

GENCODE produces comprehensive reference gene annotation for human and mouse. Entering its twentieth year, the project remains highly active as new technologies methodologies allow us to catalog genome at ever-increasing granularity. In particular, long-read transcriptome sequencing enables identify large numbers of missing transcripts substantially improve existing models, our long non-coding RNA catalogs have undergone a dramatic expansion reconfiguration result. Meanwhile, we are...

10.1093/nar/gkae1078 article EN cc-by Nucleic Acids Research 2024-11-20

Coming Soon ...