Raúl Fernández-Díaz

ORCID: 0000-0002-7383-6568
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Computational Drug Discovery Methods
  • vaccines and immunoinformatics approaches
  • Biomedical Text Mining and Ontologies
  • Machine Learning in Bioinformatics
  • Receptor Mechanisms and Signaling
  • Advanced Graph Neural Networks
  • Genetics, Bioinformatics, and Biomedical Research
  • Topic Modeling
  • Metabolomics and Mass Spectrometry Studies
  • Protein Structure and Dynamics
  • Fault Detection and Control Systems
  • Chemical Synthesis and Analysis

University College Dublin
2023-2025

IBM Research - Ireland
2023-2025

Conway School of Landscape Design
2023

Bioactive peptides are an important class of natural products with great functional diversity. Chemical modifications can improve their pharmacology, yet structural diversity presents unique challenges for computational modeling. Furthermore, data canonical (non-modified) is more abundant than non-canonical (chemically modified). We explored whether current methods sufficient to generalize from datasets. To do this, we first considered two critical aspects the modeling problem, namely choice...

10.26434/chemrxiv-2025-ggp8n preprint EN cc-by-nc-nd 2025-03-24

Molecular modelling is a vital tool in the discovery and characterisation of bioactive peptides, providing insights into their structural properties interactions with biological targets. Many models predicting peptide function or structure rely on intrinsic properties, including influence amino acid composition, sequence, chain length, which impact stability, folding, aggregation, target interaction. Homology predicts structures based known templates. Peptide–protein can be explored using...

10.3390/biom15040524 article EN cc-by Biomolecules 2025-04-03

Automated machine learning (AutoML) solutions can bridge the gap between new computational advances and their real-world applications by enabling experimental scientists to build own custom models. We examine different steps in development life-cycle of peptide bioactivity binary predictors identify key where automation not only result a more accessible method, but also robust interpretable evaluation leading trustworthy

10.1093/bioinformatics/btae555 article EN cc-by Bioinformatics 2024-09-18

Quantifying model generalization to out-of-distribution data has been a longstanding challenge in machine learning. Addressing this issue is crucial for leveraging learning scientific discovery, where models must generalize new molecules or materials. Current methods typically split into train and test sets using various criteria, like temporal, sequence identity, scaffold, random cross-validation; before evaluating performance. However, with so many splitting criteria available, existing...

10.1101/2024.03.14.584508 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2024-03-16

Recent research on predicting the binding affinity between drug molecules and proteins use representations learned, through unsupervised learning techniques, from large databases of molecule SMILES protein sequences. While these have significantly enhanced predictions, they are usually based a limited set modalities, do not exploit available knowledge about existing relations among proteins. In this study, we demonstrate that by incorporating graphs diverse sources modalities into sequences...

10.48550/arxiv.2306.12802 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Abstract Motivation Automated machine learning (AutoML) solutions can bridge the gap between new computational advances and their real-world applications by enabling experimental scientists to build own custom models. We examine different steps in development life-cycle of peptide bioactivity binary predictors identify key where automation not only result a more accessible method, but also robust interpretable evaluation leading trustworthy Results present automated method for drawing...

10.1101/2023.11.13.566825 preprint EN cc-by-nd bioRxiv (Cold Spring Harbor Laboratory) 2023-11-15

Recent research on predicting the binding affinity between drug molecules and proteins use representations learned, through unsupervised learning techniques, from large databases of molecule SMILES protein sequences. While these have significantly enhanced predictions, they are usually based a limited set modalities, do not exploit available knowledge about existing relations among proteins. Our study reveals that representations, derived multimodal graphs describing proteins, lead to...

10.1609/aaai.v38i9.28924 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24
Coming Soon ...