Giuseppe Insana

ORCID: 0000-0002-8186-1026
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Genomics and Phylogenetic Studies
  • Machine Learning in Bioinformatics
  • Advanced Proteomics Techniques and Applications
  • Biomedical Text Mining and Ontologies
  • Scientific Computing and Data Management
  • Bioinformatics and Genomic Networks
  • Research Data Management Practices
  • Data Mining Algorithms and Applications
  • Natural Language Processing Techniques
  • Genomics and Rare Diseases
  • RNA and protein synthesis mechanisms
  • Renaissance Literature and Culture
  • Classical Antiquity Studies
  • Fractal and DNA sequence analysis
  • Computational Drug Discovery Methods
  • Language and cultural evolution
  • Genetics, Bioinformatics, and Biomedical Research
  • Semantic Web and Ontologies
  • Enzyme Structure and Function

European Bioinformatics Institute
2019-2024

Alex Bateman María Martin Sandra Orchard Michele Magrane Rahat Agivetova and 95 more Shadab Ahmad Emanuele Alpi Emily Bowler-Barnett Ramona Britto Borisas Bursteinas Hema Bye‐A‐Jee Ray Coetzee Austra Cukura Alan Da Silva Paul Denny Tunca Doğan ThankGod E. Ebenezer Jun Fan Leyla Jael Castro Penelope Garmiri George P. Georghiou Leonardo Jose da Costa Gonzales Emma Hatton-Ellis Abdulrahman Hussein Alexandr Ignatchenko Giuseppe Insana Rizwan Ishtiaq Petteri Jokinen Vishal Joshi Dushyanth Jyothi Antonia Lock Rodrigo López Aurélien Luciani Jie Luo Yvonne Lussi Alistair MacDougall Fábio Madeira Mahdi Mahmoudy M. Menchi Alok Mishra Katie Moulang Andrew Nightingale Carla Susana Oliveira Sangya Pundir Guoying Qi Shriya Raj Daniel L Rice M. Rodríguez-López Rabie Saidi J. H. Sampson Tony Sawford Elena Speretta E. B. Turner Nidhi Tyagi Preethi Vasudev Vladimir Volynkin Kate Warner Xavier Watkins Rossana Zaru Hermann Zellner Alan Bridge Sylvain Poux Nicole Redaschi Lucila Aimo Ghislaine Argoud‐Puy Andrea Auchincloss Kristian B. Axelsen Parit Bansal Delphine Baratin Marie-Claude Blatter Jerven Bolleman Emmanuel Boutet Lionel Breuza Cristina Casals‐Casas Leyla Jael Castro Kamal Chikh Echioukh Elisabeth Coudert Beatrice Cuche Mikael Doche Dolnide Dornevil Anne Estreicher Maria Livia Famiglietti Marc Feuermann Elisabeth Gasteiger Sébastien Géhant Vivienne Baillie Gerritsen Arnaud Gos Nadine Gruaz-Gumowski Ursula Hinz Chantal Hulo Nevila Hyka‐Nouspikel Florence Jungo G. Keller Arnaud Kerhornou V. Lara Philippe Le Mercier Damien Lieberherr Thierry Lombardot Xavier Martín Patrick Masson

The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set protein sequences annotated functional information. In this article, we describe significant updates that have made over last two years resource. number in UniProtKB has risen approximately 190 million, despite continued work reduce sequence redundancy at proteome level. We adopted new methods assessing completeness quality. continue extract detailed annotations from...

10.1093/nar/gkaa1100 article EN cc-by Nucleic Acids Research 2020-11-02
Alex Bateman María Martin Sandra Orchard Michele Magrane Shadab Ahmad and 95 more Emanuele Alpi Emily Bowler-Barnett Ramona Britto Hema Bye‐A‐Jee Austra Cukura Paul Denny Tunca Doğan ThankGod E. Ebenezer Jun Fan Penelope Garmiri Leonardo Jose da Costa Gonzales Emma Hatton-Ellis Abdulrahman Hussein Alexandr Ignatchenko Giuseppe Insana Rizwan Ishtiaq Vishal Joshi Dushyanth Jyothi Swaathi Kandasaamy Antonia Lock Aurélien Luciani Marija Lugaric Jie Luo Yvonne Lussi Alistair MacDougall Fábio Madeira Mahdi Mahmoudy Alok Mishra Katie Moulang Andrew Nightingale Sangya Pundir Guoying Qi Shriya Raj Pedro Raposo Daniel L Rice Rabie Saidi Rafael Santos Elena Speretta James Stephenson Prabhat Totoo E. B. Turner Nidhi Tyagi Preethi Vasudev Kate Warner Xavier Watkins Rossana Zaru Hermann Zellner Alan Bridge Lucila Aimo Ghislaine Argoud‐Puy Andrea Auchincloss Kristian B. Axelsen Parit Bansal Delphine Baratin Teresa M Batista Neto Marie-Claude Blatter Jerven Bolleman Emmanuel Boutet Lionel Breuza Blanca Cabrera Gil Cristina Casals‐Casas Kamal Chikh Echioukh Elisabeth Coudert Beatrice Cuche Edouard de Castro Anne Estreicher Maria Livia Famiglietti Marc Feuermann Elisabeth Gasteiger Pascale Gaudet Sébastien Géhant Vivienne Baillie Gerritsen Arnaud Gos Nadine Gruaz Chantal Hulo Nevila Hyka‐Nouspikel Florence Jungo Arnaud Kerhornou Philippe Le Mercier Damien Lieberherr Patrick Masson Anne Morgat Venkatesh Muthukrishnan Salvo Paesano Ivo Pedruzzi Sandrine Pilbout Lucille Pourcel Sylvain Poux Monica Pozzato Manuela Pruess Nicole Redaschi Catherine Rivoire Christian Sigrist Karin Sonesson Shyamala Sundaram

Abstract The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set protein sequences annotated functional information. In this publication we describe enhancements made our data processing pipeline website adapt an ever-increasing information content. number in UniProtKB has risen over 227 million are working towards including reference proteome for each taxonomic group. We continue extract detailed annotations from literature...

10.1093/nar/gkac1052 article EN cc-by Nucleic Acids Research 2022-11-21
Elisabeth Coudert Sébastien Géhant Edouard de Castro Monica Pozzato Delphine Baratin and 95 more Teresa Batista Neto Christian Sigrist Nicole Redaschi Alan Bridge Alan Bridge Lucila Aimo Ghislaine Argoud‐Puy Andrea Auchincloss Kristian B. Axelsen Parit Bansal Delphine Baratin Teresa M Batista Neto Marie-Claude Blatter Jerven Bolleman Emmanuel Boutet Lionel Breuza Blanca Cabrera Gil Cristina Casals‐Casas Kamal Chikh Echioukh Elisabeth Coudert Beatrice Cuche Edouard de Castro Anne Estreicher Maria Livia Famiglietti Marc Feuermann Elisabeth Gasteiger Pascale Gaudet Sébastien Géhant Vivienne Baillie Gerritsen Arnaud Gos Nadine Gruaz Chantal Hulo Nevila Hyka‐Nouspikel Florence Jungo Arnaud Kerhornou Philippe Le Mercier Damien Lieberherr Patrick Masson Anne Morgat Venkatesh Muthukrishnan Salvo Paesano Ivo Pedruzzi Sandrine Pilbout Lucille Pourcel Sylvain Poux Monica Pozzato Manuela Pruess Nicole Redaschi Catherine Rivoire Christian Sigrist Karin Sonesson Shyamala Sundaram Alex Bateman María Martin Sandra Orchard Michele Magrane Shadab Ahmad Emanuele Alpi Emily Bowler-Barnett Ramona Britto Hema Bye- A-Jee Austra Cukura Paul Denny Tunca Doğan ThankGod E. Ebenezer Jun Fan Penelope Garmiri Leonardo Jose da Costa Gonzales Emma Hatton-Ellis Abdulrahman Hussein Alexandr Ignatchenko Giuseppe Insana Rizwan Ishtiaq Vishal Joshi Dushyanth Jyothi Swaathi Kandasaamy Antonia Lock Aurélien Luciani Marija Lugaric Jie Luo Yvonne Lussi Alistair MacDougall Fábio Madeira Mahdi Mahmoudy Alok Mishra Katie Moulang Andrew Nightingale Sangya Pundir Guoying Qi Shriya Raj Pedro Raposo Daniel L Rice Rabie Saidi Rafael Santos Elena Speretta

Abstract Motivation To provide high quality, computationally tractable annotation of binding sites for biologically relevant (cognate) ligands in UniProtKB using the chemical ontology ChEBI (Chemical Entities Biological Interest), to better support efforts study and predict functionally interactions between protein sequences structures small molecule ligands. Results We structured data model cognate ligand site annotations performed a complete reannotation all stable unique identifiers from...

10.1093/bioinformatics/btac793 article EN cc-by Bioinformatics 2022-12-08
Alex Bateman María Martin Sandra Orchard Michele Magrane Aduragbemi S. Adesina and 94 more Shadab Ahmad Emily Bowler-Barnett Hema Bye‐A‐Jee David C. J. Carpentier Paul Denny Jun Fan Penelope Garmiri Leonardo Jose da Costa Gonzales Abdulrahman Hussein Alexandr Ignatchenko Giuseppe Insana Rizwan Ishtiaq Vishal Joshi Dushyanth Jyothi Swaathi Kandasaamy Antonia Lock Aurélien Luciani Jie Luo Yvonne Lussi Juan Marín Pedro Raposo Daniel L Rice Rafael Silva Santos Elena Speretta James Stephenson Prabhat Totoo Nidhi Tyagi Nadya Urakova Preethi Vasudev Kate Warner Supun Wijerathne C. Yu Rossana Zaru Alan Bridge Lucila Aimo Ghislaine Argoud‐Puy Andrea Auchincloss Kristian B. Axelsen Parit Bansal Delphine Baratin Teresa M Batista Neto Marie-Claude Blatter Jerven Bolleman Emmanuel Boutet Lionel Breuza Blanca Cabrera Gil Cristina Casals‐Casas Kamal Chikh Echioukh Elisabeth Coudert Beatrice Cuche Edouard de Castro Anne Estreicher Maria Livia Famiglietti Marc Feuermann Elisabeth Gasteiger Pascale Gaudet Sébastien Géhant Vivienne Baillie Gerritsen Arnaud Gos Nadine Gruaz Chantal Hulo Nevila Hyka‐Nouspikel Florence Jungo Arnaud Kerhornou Philippe Le Mercier Damien Lieberherr Patrick Masson Anne Morgat Salvo Paesano Ivo Pedruzzi Sandrine Pilbout Lucille Pourcel Sylvain Poux Monica Pozzato Manuela Pruess Nicole Redaschi Catherine Rivoire Christian Sigrist Karin Sonesson Shyamala Sundaram Anastasia Sveshnikova Cathy Wu Cecilia Arighi Chuming Chen Chuming Chen Hongzhan Huang Kati Laiho Minna Lehväslaiho Peter B. McGarvey Darren A. Natale Karen Ross C R Vinayaka Yuqi Wang Jian Zhang

The aim of the UniProt Knowledgebase (UniProtKB; https://www.uniprot.org/) is to provide users with a comprehensive, high-quality and freely accessible set protein sequences annotated functional information. In this publication, we describe ongoing changes our production pipeline limit available in UniProtKB high-quality, non-redundant reference proteomes. We continue manually curate scientific literature add latest data use machine learning techniques. also encourage community curation...

10.1093/nar/gkae1010 article EN cc-by Nucleic Acids Research 2024-11-18
Leyla García Jerven Bolleman Sébastien Géhant Nicole Redaschi María Martin and 95 more Alex Bateman Michele Magrane María Martin Sandra Orchard Shriya Raj Shadab Ahmad Emanuele Alpi Emily Bowler-Barnett Ramona Britto Borisas Bursteinas Hema Bye‐A‐Jee Tunca Doğan Leyla García Penelope Garmiri George P. Georghiou Leonardo Jose da Costa Gonzales Emma Hatton-Ellis Alexandr Ignatchenko Giuseppe Insana Rizwan Ishtiaq Vishal Joshi Dushyanth Jyothi Jie Luo Yvonne Lussi Alistair MacDougall Mahdi Mahmoudy Andrew Nightingale Carla Oliveira Joseph Onwubiko Vivek Poddar Sangya Pundir Guoying Qi Ahmet Süreyya Rifaioğlu Daniel L Rice Rabie Saidi Elena Speretta E. B. Turner Nidhi Tyagi Preethi Vasudev Vladimir Volynkin Kate Warner Xavier Watkins Rossana Zaru Hermann Zellner Alan Bridge Lionel Breuza Elisabeth Coudert Damien Lieberherr Ivo Pedruzzi Sylvain Poux Manuela Pruess Nicole Redaschi Lucila Aimo Ghislaine Argoud‐Puy Andrea Auchincloss Kristian B. Axelsen Parit Bansal Delphine Baratin Teresa Batista Neto Marie-Claude Blatter Jerven Bolleman Emmanuel Boutet Cristina Casals‐Casas Beatrice Cuche Leyla Jael Castro Anne Estreicher L. Famiglietti Marc Feuermann Elisabeth Gasteiger Sébastien Géhant Vivienne Baillie Gerritsen Arnaud Gos Nadine Gruaz Ursula Hinz Chantal Hulo Nevila Hyka‐Nouspikel Florence Jungo Arnaud Kerhornou Philippe Le Mercier Thierry Lombardot Patrick Masson Anne Morgat Sandrine Pilbout Monica Pozzato Catherine Rivoire Christian Sigrist Shyamala Sundaram Cathy Wu Cecilia Arighi Hongzhan Huang Peter B. McGarvey Darren A. Natale Leslie Arminski Chuming Chen Chuming Chen

UniProt continues to support the ongoing process of making scientific data FAIR. Here we contribute this with a FAIRness assessment our UniProtKB dataset followed by critical reflection on challenges and future directions adoption validation FAIR principles metrics.

10.1038/s41597-019-0180-9 article EN cc-by Scientific Data 2019-09-20

Abstract The ‘canonical’ protein sets distributed by UniProt are widely used for similarity searching, and functional structural annotation. For many investigators, canonical sequences the only version of a examined. However, higher eukaryotes often encode multiple isoforms from single gene. unreviewed (UniProtKB/TrEMBL) sequences, longest sequence in Gene-Centric group is chosen as canonical. This choice can create inconsistencies, selecting >95% identical orthologs with dramatically...

10.1093/nargab/lqae066 article EN cc-by NAR Genomics and Bioinformatics 2024-04-04

The "canonical" protein sets distributed by UniProt are widely used for similarity searching, and functional structural annotation. For many investigators, canonical sequences the only version of a examined. However, higher eukaryotes often encode multiple isoforms from single gene. unreviewed (UniProtKB/TrEMBL) sequences, longest sequence in Gene-Centric group is chosen as canonical. This choice can create inconsistencies, selecting >95% identical orthologs with dramatically different...

10.1101/2024.03.04.583387 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2024-03-06
Giuseppe Insana Alexandr Ignatchenko María Martin Alex Bateman Alex Bateman and 92 more María Martin Sandra Orchard Michele Magrane Shadab Ahmad Emily Bowler-Barnett Hema Bye‐A‐Jee Paul Denny Tunca Doğan ThankGod E. Ebenezer Jun Fan Leonardo Jose da Costa Gonzales Abdulrahman Hussein Alexandr Ignatchenko Giuseppe Insana Rizwan Ishtiaq Vishal Joshi Dushyanth Jyothi Swaathi Kandasaamy Antonia Lock Aurélien Luciani Jie Luo Yvonne Lussi Pedro Raposo Daniel L Rice Rabie Saidi Rafael Santos Elena Speretta James Stephenson Prabhat Totoo Nidhi Tyagi Preethi Vasudev Kate Warner Rossana Zaru Supun Wijerathne Khawaja Talal Ibrahim Minjoon Kim Juan Marín Alan Bridge Lucila Aimo Ghislaine Argoud‐Puy Andrea Auchincloss Kristian B. Axelsen Parit Bansal Delphine Baratin Teresa M Batista Neto Jerven Bolleman Emmanuel Boutet Lionel Breuza Blanca Cabrera Gil Cristina Casals‐Casas Elisabeth Coudert Beatrice Cuche Edouard de Castro Anne Estreicher Maria Livia Famiglietti Marc Feuermann Elisabeth Gasteiger Sébastien Géhant Arnaud Gos Nadine Gruaz Chantal Hulo Nevila Hyka‐Nouspikel Florence Jungo Arnaud Kerhornou Philippe Le Mercier Damien Lieberherr Patrick Masson Anne Morgat Ivo Pedruzzi Sandrine Pilbout Lucille Pourcel Sylvain Poux Monica Pozzato Manuela Pruess Nicole Redaschi Catherine Rivoire Christian Sigrist Shyamala Sundaram Anastasia Sveshnikova Cathy Wu Cecilia Arighi Chuming Chen Chuming Chen Hongzhan Huang Kati Laiho Minna Lehväslaiho Peter B. McGarvey Darren A. Natale Karen Ross C R Vinayaka Yuqi Wang Jian Zhang

Abstract Motivation There now exist thousands of molecular biology databases covering every aspect biological data. This database infrastructure takes significant effort and funding to develop maintain. The creators these need make strong justifications funders prove their impact or importance. are many publication metrics tools available such as Google Scholar measure citation AltMetrics multiple measures including social media coverage. Results In this article, we describe a series novel...

10.1093/bioadv/vbad180 article EN cc-by Bioinformatics Advances 2023-01-01
Coming Soon ...