NFDI4DS | UHH-SEMS - Publication Details

Castrense Savojardo

ORCID: 0000-0002-7359-0633

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5069439283

Research Areas

Machine Learning in Bioinformatics
Genomics and Phylogenetic Studies
RNA and protein synthesis mechanisms
Protein Structure and Dynamics
Genomics and Rare Diseases
Bioinformatics and Genomic Networks
Microbial Metabolic Engineering and Bioproduction
Enzyme Structure and Function
Identification and Quantification in Food
Genetics, Bioinformatics, and Biomedical Research
Genomic variations and chromosomal abnormalities
Genetic diversity and population structure
Biomedical Text Mining and Ontologies
Computational Drug Discovery Methods
CRISPR and Genetic Engineering
Cancer Genomics and Diagnostics
Advanced Proteomics Techniques and Applications
Genetic factors in colorectal cancer
Evolution and Genetic Dynamics
Genetics and Neurodevelopmental Disorders
Glycosylation and Glycoproteins Research
Insect and Arachnid Ecology and Behavior
Metabolism and Genetic Disorders
Plant and animal studies
Lipid Membrane Structure and Behavior

University of Bologna
2016-2025

Biocom
2015

Zambon (Italy)
2013

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

OPENALEX - Publications

Naihui Zhou Yuxiang Jiang Timothy Bergquist Alexandra Lee Balint Z. Kacsoh and 95 more

Abstract Background The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation protein function. Results Here, we report on results third CAFA challenge, CAFA3, that featured expanded analysis over previous rounds, both in terms volume data analyzed types performed. In a novel major new development, predictions assessment goals drove some experimental assays, resulting functional annotations for...

10.1186/s13059-019-1835-8 article EN cc-by Genome biology 2019-11-19

BUSCA: an integrative web server to predict subcellular localization of proteins

OPENALEX - Publications

Castrense Savojardo Pier Luigi Martelli Piero Fariselli Giuseppe Profiti Rita Casadio

Here, we present BUSCA (http://busca.biocomp.unibo.it), a novel web server that integrates different computational tools for predicting protein subcellular localization. combines methods identifying signal and transit peptides (DeepSig TPpred3), GPI-anchors (PredGPI) transmembrane domains (ENSEMBLE3.0 BetAware) with discriminating localization of both globular membrane proteins (BaCelLo, MemLoci SChloro). Outcomes from the are processed integrated annotating eukaryotic bacterial sequences....

10.1093/nar/gky320 article EN cc-by-nc Nucleic Acids Research 2018-04-17

INPS-MD: a web server to predict stability of protein variants from sequence and structure

OPENALEX - Publications

Castrense Savojardo Piero Fariselli Pier Luigi Martelli Rita Casadio

Abstract Motivation: Protein function depends on its structural stability. The effects of single point variations protein stability can elucidate the molecular mechanisms human diseases and help in developing new drugs. Recently, we introduced INPS, a method suited to predict effect from sequence whose performance is competitive with available state-of-the-art tools. Results: In this article, describe INPS-MD (Impact Non synonymous Stability-Multi-Dimension), web server for prediction...

10.1093/bioinformatics/btw192 article EN Bioinformatics 2016-04-10

DOME: recommendations for supervised machine learning validation in biology

OPENALEX - Publications

Ian Walsh Dmytro Fishman Dario García-Gasulla Tiina Titma Gianluca Pollastri and 31 more

10.1038/s41592-021-01205-4 article EN Nature Methods 2021-07-27

Evaluating predictors of kinase activity of STK11 variants identified in primary human non-small cell lung cancers

OPENALEX - Publications

Yile Chen Kyoungyeul Lee Junwoo Woo Dong Wook Kim Changwon Keum and 18 more

Abstract Critical evaluation of computational tools for predicting variant effects is important considering their increased use in disease diagnosis and driving molecular discoveries. In the sixth edition Assessment Genome Interpretation (CAGI) challenge, a dataset 28 STK11 rare variants (27 missense, 1 single amino acid deletion), identified primary non-small cell lung cancer biopsies, was experimentally assayed to characterize methods from four participating teams five publicly available...

10.1007/s00439-025-02726-0 article EN cc-by Human Genetics 2025-02-12

INPS: predicting the impact of non-synonymous variations on protein stability from sequence

OPENALEX - Publications

Piero Fariselli Pier Luigi Martelli Castrense Savojardo Rita Casadio

Abstract Motivation: A tool for reliably predicting the impact of variations on protein stability is extremely important both engineering and understanding effects Mendelian somatic mutations in genome. Next Generation Sequencing studies are constantly increasing number sequences. Given huge disproportion between sequences structures, there a need tools suited to annotate effect starting from sequence without relying structure. Here, we describe INPS, novel approach annotating non-synonymous...

10.1093/bioinformatics/btv291 article EN Bioinformatics 2015-05-07

DeepSig: deep learning improves signal peptide detection in proteins

OPENALEX - Publications

Castrense Savojardo Pier Luigi Martelli Piero Fariselli Rita Casadio

The identification of signal peptides in protein sequences is an important step toward localization and function characterization.Here, we present DeepSig, improved approach for peptide detection cleavage-site prediction based on deep learning methods. Comparative benchmarks performed updated independent dataset proteins show that DeepSig the current best performing method, scoring better than other available state-of-the-art approaches both precise identification.DeepSig as standalone...

10.1093/bioinformatics/btx818 article EN cc-by-nc Bioinformatics 2017-12-20

Solvent Accessibility of Residues Undergoing Pathogenic Variations in Humans: From Protein Structures to Protein Sequences

OPENALEX - Publications

Castrense Savojardo Matteo Manfredi Pier Luigi Martelli Rita Casadio

Solvent accessibility (SASA) is a key feature of proteins for determining their folding and stability. SASA computed from protein structures with different algorithms, sequences machine-learning based approaches trained on solved structures. Here we ask the question as to which extent solvent exposure residues can be associated pathogenicity variation. By this, wild-type residue acquires role in context functional annotation single-residue variations (SRVs). mapping curated database human...

10.3389/fmolb.2020.626363 article EN cc-by Frontiers in Molecular Biosciences 2021-01-07

DeepMito: accurate prediction of protein sub-mitochondrial localization using convolutional neural networks

OPENALEX - Publications

Castrense Savojardo Niccolò Bruciaferri Giacomo Tartari Pier Luigi Martelli Rita Casadio

The correct localization of proteins in cell compartments is a key issue for their function. Particularly, mitochondrial are physiologically active different and aberrant contributes to the pathogenesis human pathologies. Many computational methods exist assign protein sequences subcellular such as nucleus, cytoplasm organelles. However, substantial lack experimental evidence public sequence databases hampered so far finer grain discrimination, including also intra-organelle compartments.We...

10.1093/bioinformatics/btz512 article EN cc-by Bioinformatics 2019-06-17

Machine learning solutions for predicting protein–protein interactions

OPENALEX - Publications

Rita Casadio Pier Luigi Martelli Castrense Savojardo

Abstract Proteins are “social molecules.” Recent experimental evidence supports the notion that large protein aggregates, known as biomolecular condensates, affect structurally and functionally many biological processes. Condensate formation may be permanent and/or time dependent, suggesting processes can occur locally, depending on cell needs. The question then arises to which extent we monitor protein‐aggregate formation, both experimentally theoretically predict/simulate functional...

10.1002/wcms.1618 article EN cc-by Wiley Interdisciplinary Reviews Computational Molecular Science 2022-03-29

Dispersion of antimicrobial resistant bacteria in pig farms and in the surrounding environment

OPENALEX - Publications

Daniel Scicchitano Daniela Leuzzi Giulia Babbi Giorgia Palladino Silvia Turroni and 11 more

Abstract Background Antimicrobial resistance has been identified as a major threat to global health. The pig food chain is considered an important source of antimicrobial genes (ARGs). However, there still lack knowledge on the dispersion ARGs in production system, including external environment. Results In present study, we longitudinally followed one swine farm located Italy from weaning phase slaughterhouse comprehensively assess diversity ARGs, their diffusion, and bacteria associated...

10.1186/s42523-024-00305-8 article EN cc-by Animal Microbiome 2024-03-30

ISPRED-SEQ: Deep Neural Networks and Embeddings for Predicting Interaction Sites in Protein Sequences

OPENALEX - Publications

Matteo Manfredi Castrense Savojardo Pier Luigi Martelli Rita Casadio

The knowledge of protein–protein interaction sites (PPIs) is crucial for protein functional annotation. Here we address the problem focusing on prediction putative PPIs considering as input sequences. issue important given huge volume sequences compared to experimental and/or computed structures. Taking advantage language models, recently developed, and Deep Neural networks, here describe ISPRED-SEQ, which overpasses state-of-the-art predictors addressing same problem. ISPRED-SEQ freely...

10.1016/j.jmb.2023.167963 article EN cc-by-nc Journal of Molecular Biology 2023-01-13

Alpha&ESMhFolds: A Web Server for Comparing AlphaFold2 and ESMFold Models of the Human Reference Proteome

OPENALEX - Publications

Matteo Manfredi Castrense Savojardo Georgii Iardukhin Davide Salomoni Alessandro Costantini and 2 more

We develop a novel database Alpha&ESMhFolds which allows the direct comparison of AlphaFold2 and ESMFold predicted models for 42,942 proteins Reference Human Proteome, when available, their with 2,900 directly associated PDB structures at least structure to sequence coverage 70%. Statistics indicate that good quality tend overlap TM-score >0.6 as long some structural information is available. As expected, model superimposition highlights are slightly superior ones. However, 55% endowed...

10.1016/j.jmb.2024.168593 article EN cc-by-nc-nd Journal of Molecular Biology 2024-05-06

Critical assessment of missense variant effect predictors on disease-relevant variant data

OPENALEX - Publications

Ruchir Rastogi Ryan Chung Sindy Li Chang Li Kyoungyeul Lee and 31 more

Abstract Regular, systematic, and independent assessment of computational tools used to predict the pathogenicity missense variants is necessary evaluate their clinical research utility suggest directions for future improvement. Here, as part sixth edition Critical Assessment Genome Interpretation (CAGI) challenge, we assess variant effect predictors (or impact predictors) on an evaluation dataset rare from disease-relevant databases. Our evaluates submitted CAGI6 Annotate-All-Missense...

10.1101/2024.06.06.597828 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2024-06-08

Critical assessment of variant prioritization methods for rare disease diagnosis within the rare genomes project

OPENALEX - Publications

Sarah L. Stenton Melanie O’Leary Gabrielle Lemire Grace E. VanNoy Stephanie DiTroia and 69 more

Abstract Background A major obstacle faced by families with rare diseases is obtaining a genetic diagnosis. The average "diagnostic odyssey" lasts over five years and causal variants are identified in under 50%, even when capturing genome-wide. To aid the interpretation prioritization of vast number detected, computational methods proliferating. Knowing which tools most effective remains unclear. evaluate performance methods, to encourage innovation method development, we designed Critical...

10.1186/s40246-024-00604-w article EN cc-by Human Genomics 2024-04-29

eDGAR: a database of Disease-Gene Associations with annotated Relationships among genes

OPENALEX - Publications

Giulia Babbi Pier Luigi Martelli Giuseppe Profiti Samuele Bovo Castrense Savojardo and 1 more

Genetic investigations, boosted by modern sequencing techniques, allow dissecting the genetic component of different phenotypic traits. These efforts result in compilation lists genes related to diseases and show that an increasing number is associated with multiple genes. Investigating functional relations among same disease contributes highlighting molecular mechanisms pathogenesis. We present eDGAR, a database collecting organizing data on gene/disease associations as derived from OMIM,...

10.1186/s12864-017-3911-3 article EN cc-by BMC Genomics 2017-08-01

TPpred3 detects and discriminates mitochondrial and chloroplastic targeting peptides in eukaryotic proteins

OPENALEX - Publications

Castrense Savojardo Pier Luigi Martelli Piero Fariselli Rita Casadio

Abstract Motivation: Molecular recognition of N-terminal targeting peptides is the most common mechanism controlling import nuclear-encoded proteins into mitochondria and chloroplasts. When experimental information lacking, computational methods can annotate peptides, determine their cleavage sites for characterizing protein localization, function, mature sequences. The problem discriminating mitochondrial from chloroplastic propeptides particularly relevant when annotating proteomes...

10.1093/bioinformatics/btv367 article EN Bioinformatics 2015-06-16

E-SNPs&GO: embedding of protein sequence and function improves the annotation of human pathogenic variants

OPENALEX - Publications

Matteo Manfredi Castrense Savojardo Pier Luigi Martelli Rita Casadio

The advent of massive DNA sequencing technologies is producing a huge number human single-nucleotide polymorphisms occurring in protein-coding regions and possibly changing their sequences. Discriminating harmful protein variations from neutral ones one the crucial challenges precision medicine. Computational tools based on artificial intelligence provide models for sequence encoding, bypassing database searches evolutionary information. We leverage new encoding schemes an efficient...

10.1093/bioinformatics/btac678 article EN cc-by Bioinformatics 2022-10-10

Discriminating physiological from non‐physiological interfaces in structures of protein complexes: A community‐wide study

OPENALEX - Publications

Hugo Schweke Qifang Xu Gerardo Tauriello Lorenzo Pantolini Torsten Schwede and 41 more

Abstract Reliably scoring and ranking candidate models of protein complexes assigning their oligomeric state from the structure crystal lattice represent outstanding challenges. A community‐wide effort was launched to tackle these The latest resources on interfaces were exploited derive a benchmark dataset consisting 1677 homodimer structures, including balanced mix physiological non‐physiological complexes. in selected bury similar or larger interface area than counterparts, making it more...

10.1002/pmic.202200323 article EN publisher-specific-oa PROTEOMICS 2023-06-27

CoCoNat: a novel method based on deep learning for coiled-coil prediction

OPENALEX - Publications

Giovanni Madeo Castrense Savojardo Matteo Manfredi Pier Luigi Martelli Rita Casadio

Coiled-coil domains (CCD) are widespread in all organisms and perform several crucial functions. Given their relevance, the computational detection of CCD is very important for protein functional annotation. State-of-the-art prediction methods include precise identification boundaries, annotation typical heptad repeat pattern along coiled-coil helices as well oligomerization state.In this article, we describe CoCoNat, a novel method predicting helix residue-level register annotation, state....

10.1093/bioinformatics/btad495 article EN cc-by Bioinformatics 2023-08-01

DDGemb: predicting protein stability change upon single- and multi-point variations with embeddings and deep learning

OPENALEX - Publications

Castrense Savojardo Matteo Manfredi Pier Luigi Martelli Rita Casadio

Abstract Motivation The knowledge of protein stability upon residue variation is an important step for functional design and understanding how variants can promote disease onset. Computational methods are to complement experimental approaches allow a fast screening large datasets variations. Results In this work we present DDGemb, novel method combining language model embeddings transformer architectures predict ΔΔG both single- multi-point DDGemb has been trained on high-quality dataset...

10.1093/bioinformatics/btaf019 article EN cc-by Bioinformatics 2025-01-12

A genome-annotated bacterial collection of the plant food system microbiota

OPENALEX - Publications

Laura Pietrantonio Marion Devers‐Lamrani Peter Thorpe Catherine Arnton Senga Robertson-Albertyn and 5 more

This study reports draft genomes of 30 bacteria representative the plant food system microbiota and isolated from different sources in Italy France. Individual were reconstructed using PacBIO DNA sequencing: taxonomic classification distribution genes involved microbe-environment interactions are reported to facilitate strains' characterization utilization.

10.1128/mra.01221-24 article EN Microbiology Resource Announcements 2025-01-14

AlphaFold2 and ESMFold: a large-scale pairwise model comparison of human enzymes upon Pfam functional annotation

OPENALEX - Publications

Matteo Manfredi Gabriele Vazzana Castrense Savojardo Pier Luigi Martelli Rita Casadio

AlphaFold2 predicts protein structures from structural and functional knowledge. Alternatively, ESMFold does the same adopting language models. Here, we map available Pfam domains on pairs of models human reference proteome computed with both procedures compare mapped regions relevant for annotation. We find that, rather irrespectively global superimposition pairwise models, Pfam-containing overlap a TM-score above 0.8 predicted local distance difference test (pLDDT) which is higher than...

10.1016/j.csbj.2025.01.008 article EN cc-by-nc-nd Computational and Structural Biotechnology Journal 2025-01-01

Assessing the predicted impact of single amino acid substitutions in MAPK proteins for CAGI6 challenges

OPENALEX - Publications

Paola Turina Maria Petrosino Carlos A. Enriquez Sandoval Leonore Novak Alessandra Pasquo and 34 more

10.1007/s00439-024-02724-8 article EN Human Genetics 2025-02-20

Coming Soon ...