Michael L. Tress

ORCID: 0000-0001-9046-6370
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • RNA and protein synthesis mechanisms
  • Genomics and Phylogenetic Studies
  • RNA modifications and cancer
  • Protein Structure and Dynamics
  • RNA Research and Splicing
  • Enzyme Structure and Function
  • Machine Learning in Bioinformatics
  • Bioinformatics and Genomic Networks
  • Molecular Biology Techniques and Applications
  • Advanced Proteomics Techniques and Applications
  • Genomics and Chromatin Dynamics
  • Computational Drug Discovery Methods
  • Chromosomal and Genetic Variations
  • Cancer-related molecular mechanisms research
  • Genomics and Rare Diseases
  • Protein Degradation and Inhibitors
  • Genetics, Bioinformatics, and Biomedical Research
  • Microbial Metabolic Engineering and Bioproduction
  • Microbial Natural Products and Biosynthesis
  • Genetics and Neurodevelopmental Disorders
  • Genetic and phenotypic traits in livestock
  • Monoclonal and Polyclonal Antibodies Research
  • Biomedical Text Mining and Ontologies
  • Machine Learning in Materials Science
  • PI3K/AKT/mTOR signaling in cancer

Spanish National Cancer Research Centre
2015-2024

Centro de Investigación del Cáncer
2012-2016

Spanish National Centre for Cardiovascular Research
2015

Dana-Farber Cancer Institute
2012

Cancer Research Center
2012

Biocom
2009

University of Manchester
2007

European Bioinformatics Institute
2007

Instituto de Salud Carlos III
2007

Consejo Superior de Investigaciones Científicas
2004-2006

The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since first public release this annotation data set, few new protein-coding loci have been added, yet number alternative splicing transcripts annotated has steadily increased. 7 contains 20,687 9640 long noncoding RNA 33,977 coding not represented UCSC genes RefSeq. It also most comprehensive (lncRNA) publicly available...

10.1101/gr.135350.111 article EN cc-by-nc Genome Research 2012-09-01

The accurate identification and description of the genes in human mouse genomes is a fundamental requirement for high quality analysis data informing both genome biology clinical genomics. Over last 15 years, GENCODE consortium has been producing reference gene annotations to provide this foundational resource. includes experimental computational groups who work together improve extend annotation. Specifically, we generate primary data, create bioinformatics tools support expert manual...

10.1093/nar/gky955 article EN cc-by Nucleic Acids Research 2018-10-08

Abstract The GENCODE project annotates human and mouse genes transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology clinical genomics. annotation processes make use of primary bioinformatic tools analysis generated both within the consortium externally to support creation transcript structures determination their function. Here, we present improvements our infrastructure, bioinformatics tools, analysis, advances they in...

10.1093/nar/gkaa1087 article EN cc-by Nucleic Acids Research 2020-10-25

Determining the full complement of protein-coding genes is a key goal genome annotation. The most powerful approach for confirming potential detection cellular protein expression through peptide mass spectrometry (MS) experiments. Here, we mapped peptides detected in seven large-scale proteomics studies to almost 60% GENCODE annotation human genome. We found strong relationship between experiments and both gene family age cross-species conservation. Most which were highly conserved. >96%...

10.1093/hmg/ddu309 article EN cc-by-nc Human Molecular Genetics 2014-06-16

High-throughput sequencing of full-length transcripts using long reads has paved the way for discovery thousands novel transcripts, even in well-annotated mammalian species. The advances technology have created a need studies and tools that can characterize these variants. Here, we present SQANTI, an automated pipeline classification long-read assess quality data preprocessing 47 unique descriptors. We apply SQANTI to neuronal mouse transcriptome Pacific Biosciences (PacBio) illustrate how...

10.1101/gr.222976.117 article EN cc-by-nc Genome Research 2018-02-09
Naihui Zhou Yuxiang Jiang Timothy Bergquist Alexandra Lee Balint Z. Kacsoh and 95 more Alex W. Crocker Kimberley A. Lewis George P. Georghiou Huy Nguyen Md-Nafiz Hamid L. Taylor Davis Tunca Doğan Volkan Atalay Ahmet Süreyya Rifaioğlu Alperen Dalkıran Rengül Çetin-Atalay Chengxin Zhang Rebecca L. Hurto Peter L. Freddolino Yang Zhang Prajwal Bhat Fran Supek José M. Fernández Branislava Gemović Vladimir Perović Radoslav Davidović Neven Šumonja Nevena Veljković Ehsaneddin Asgari Mohammad R. K. Mofrad Giuseppe Profiti Castrense Savojardo Pier Luigi Martelli Rita Casadio Florian Boecker Heiko Schoof Indika Kahanda Natalie Thurlby Alice C. McHardy Alexandre Renaux Rabie Saidi Julian Gough Alex A. Freitas Magdalena Antczak Fábio Fabris Mark N. Wass Jie Hou Jianlin Cheng Zheng Wang Alfonso E. Romero Alberto Paccanaro Haixuan Yang Tatyana Goldberg Chenguang Zhao Liisa Holm Petri Törönen Alan Medlar Elaine Zosa Itamar Borukhov Ilya B. Novikov Angela D. Wilkins Olivier Lichtarge Po-Han Chi Wei-Cheng Tseng Michal Linial Peter W. Rose Christophe Dessimoz Vedrana Vidulin Sašo Džeroski Ian Sillitoe Sayoni Das Jonathan Lees David T. Jones Cen Wan Domenico Cozzetto Rui Fa Mateo Torres Alex Warwick Vesztrocy José Manuel Rodrı́guez Michael L. Tress Marco Frasca Marco Notaro Giuliano Grossi Alessandro Petrini Matteo Ré Giorgio Valentini Marco Mesiti Daniel B. Roche Jonas Reeb David W. Ritchie Sabeur Aridhi Seyed Ziaeddin Alborzi Marie‐Dominique Devignes Da Chen Emily Koo Richard Bonneau Vladimir Gligorijević Meet Barot Hai Fang Stefano Toppo Enrico Lavezzo

Abstract Background The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation protein function. Results Here, we report on results third CAFA challenge, CAFA3, that featured expanded analysis over previous rounds, both in terms volume data analyzed types performed. In a novel major new development, predictions assessment goals drove some experimental assays, resulting functional annotations for...

10.1186/s13059-019-1835-8 article EN cc-by Genome biology 2019-11-19

GENCODE produces high quality gene and transcript annotation for the human mouse genomes. All is supported by experimental data serves as a reference genome biology clinical genomics. The consortium generates targeted data, develops bioinformatic tools carries out analyses that, along with externally produced methods, support identification of structures determination their function. Here, we present an update on genes, including developments in tools, major collaborations which underpin...

10.1093/nar/gkac1071 article EN cc-by Nucleic Acids Research 2022-11-24

Here, we present APPRIS (http://appris.bioinfo.cnio.es), a database that houses annotations of human splice isoforms. has been designed to provide value manual the genome by adding reliable protein structural and functional data information from cross-species conservation. The visual representation provided for each gene allows annotators researchers alike easily identify changes brought about splicing events. In addition collecting, integrating analyzing predictions effect events, also...

10.1093/nar/gks1058 article EN cc-by-nc Nucleic Acids Research 2012-11-17

The classic organization of a gene structure has followed the Jacob and Monod bacterial model proposed more than 50 years ago. Since then, empirical determinations complexity transcriptomes found in yeast to human blurred definition physical boundaries genes. Using multiple analysis approaches we have characterized individual mapping on chromosomes 21 22. Analyses locations 5′ 3′ transcriptional termini 492 protein coding genes revealed that for 85% these extend beyond current annotated...

10.1371/journal.pone.0028213 article EN cc-by PLoS ONE 2012-01-04

Alternative premessenger RNA splicing enables genes to generate more than one gene product. Splicing events that occur within protein coding regions have the potential alter biological function of expressed and even create new functions. has been suggested as explanation for discrepancy between number human functional complexity. Here, we carry out a detailed study alternatively spliced products annotated in ENCODE pilot project. We find alternative is frequent commonly suggested,...

10.1073/pnas.0700800104 article EN Proceedings of the National Academy of Sciences 2007-03-20

The identification of protein-protein interaction sites is an essential intermediate step for mutant design and the prediction protein networks. In recent years a significant number methods have been developed to predict these interface residues here we review current status field. Progress in this area requires clear view methodology applied, data sets used training testing systems, evaluation procedures. We analysed impact representative set features algorithms highlighted problems...

10.1093/bib/bbp021 article EN Briefings in Bioinformatics 2008-12-06

The APPRIS database (http://appris-tools.org) uses protein structural and functional features information from cross-species conservation to annotate splice isoforms in protein-coding genes. selects a single isoform, the 'principal' as reference for each gene based on these annotations. A main isoform reflects biological reality most coding genes principal are best predictors of proteins isoforms. Here, we present updates database, new developments that include addition three species...

10.1093/nar/gkx997 article EN cc-by-nc Nucleic Acids Research 2017-10-19

Chimeric RNAs comprise exons from two or more different genes and have the potential to encode novel proteins that alter cellular phenotypes. To date, numerous putative chimeric transcripts been identified among ESTs isolated several organisms using high throughput RNA sequencing. The few corresponding protein products characterized mostly result chromosomal translocations are associated with cancer. Here, we systematically establish some of genuinely expressed in human cells. Using...

10.1101/gr.130062.111 article EN cc-by-nc Genome Research 2012-05-15

Although eukaryotic cells express a wide range of alternatively spliced transcripts, it is not clear whether genes tend to transcripts simultaneously across cells, or produce dominant isoforms in manner that either tissue-specific regardless tissue. To date, large-scale investigations into the pattern transcript expression distinct tissues have produced contradictory results. Here, we attempt determine splice variant at protein level. We interrogate peptides from eight human proteomics...

10.1021/pr501286b article EN Journal of Proteome Research 2015-03-03

Advances in high-throughput mass spectrometry are making proteomics an increasingly important tool genome annotation projects. Peptides detected experiments can be used to validate gene models and verify the translation of putative coding sequences (CDSs). Here, we have identified peptides that cover 35% genes annotated by GENCODE consortium for human as part a comprehensive analysis experimental spectra from two large publicly available databases. We protein "novel" "putative"...

10.1093/molbev/mss100 article EN cc-by-nc Molecular Biology and Evolution 2012-03-22

Alternative splicing of messenger RNA can generate a wide variety mature transcripts, and these transcripts may produce protein isoforms with diverse cellular functions. While there is much supporting evidence for the expression alternative same not true alternatively spliced products. Large-scale mass spectroscopy experiments have identified at level, but conflicting results. Here we carried out rigorous analysis peptide from eight large-scale proteomics to assess scale that detectable by...

10.1371/journal.pcbi.1004325 article EN cc-by PLoS Computational Biology 2015-06-10

The role of alternative splicing is one the great unanswered questions in cellular biology. There strong evidence for at transcript level, and transcriptomics experiments show that many splice events are tissue specific. It has been suggested evolved order to remodel tissue-specific protein-protein networks. Here we investigated among isoforms detected a large-scale proteomics analysis. Although data supporting limited protein clear patterns emerged small numbers could detect data. More than...

10.1371/journal.pcbi.1008287 article EN cc-by PLoS Computational Biology 2020-10-05

Seventeen years after the sequencing of human genome, proteome is still under revision. One in eight 22 210 coding genes listed by Ensembl/GENCODE, RefSeq and UniProtKB reference databases are annotated differently across three sets. We have carried out an in-depth investigation on 2764 classified as one or more sets manual curators not others. Data from large-scale genetic variation analyses suggests that most protein-like purifying selection so unlikely to code for functional proteins. A...

10.1093/nar/gky587 article EN cc-by-nc Nucleic Acids Research 2018-06-18

APPRIS (https://appris.bioinfo.cnio.es) is a well-established database housing annotations for protein isoforms range of species. selects principal based on structure and function features cross-species conservation. Most coding genes produce single main isoform the chosen by best represent this cellular isoform. Human genetic data, experimental evidence distribution clinical variants all support relevance isoforms. have now been expanded to 10 model organisms. In paper we highlight most...

10.1093/nar/gkab1058 article EN cc-by-nc Nucleic Acids Research 2021-10-21

Germline loss-of-function variants in CTNNB1 cause neurodevelopmental disorder with spastic diplegia and visual defects (NEDSDV; OMIM 615075) are the most frequent, recurrent monogenic of cerebral palsy (CP). We investigated range clinical phenotypes owing to disruptions determine association between NEDSDV CP.Genetic information from 404 individuals collectively 392 pathogenic were ascertained for study. From these, detailed 52 previously unpublished collected combined 68 published...

10.1016/j.gim.2022.08.006 article EN cc-by-nc-nd Genetics in Medicine 2022-09-09

GENCODE produces comprehensive reference gene annotation for human and mouse. Entering its twentieth year, the project remains highly active as new technologies methodologies allow us to catalog genome at ever-increasing granularity. In particular, long-read transcriptome sequencing enables identify large numbers of missing transcripts substantially improve existing models, our long non-coding RNA catalogs have undergone a dramatic expansion reconfiguration result. Meanwhile, we are...

10.1093/nar/gkae1078 article EN cc-by Nucleic Acids Research 2024-11-20
Coming Soon ...