NFDI4DS | UHH-SEMS - Publication Details

Raúl Fernández-Díaz

ORCID: 0000-0002-7383-6568

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5092249392

Research Areas

Computational Drug Discovery Methods
vaccines and immunoinformatics approaches
Biomedical Text Mining and Ontologies
Machine Learning in Bioinformatics
Receptor Mechanisms and Signaling
Advanced Graph Neural Networks
Genetics, Bioinformatics, and Biomedical Research
Topic Modeling
Metabolomics and Mass Spectrometry Studies
Protein Structure and Dynamics
Fault Detection and Control Systems
Chemical Synthesis and Analysis

University College Dublin
2023-2025

IBM Research - Ireland
2023-2025

Conway School of Landscape Design
2023

How to generalize machine learning models to both canonical and non-canonical peptides

OPENALEX - Publications

Raúl Fernández-Díaz Rodrigo Ochoa Thanh Lam Hoang Vanessa López Denis C. Shields

Bioactive peptides are an important class of natural products with great functional diversity. Chemical modifications can improve their pharmacology, yet structural diversity presents unique challenges for computational modeling. Furthermore, data canonical (non-modified) is more abundant than non-canonical (chemically modified). We explored whether current methods sufficient to generalize from datasets. To do this, we first considered two critical aspects the modeling problem, namely choice...

10.26434/chemrxiv-2025-ggp8n preprint EN cc-by-nc-nd 2025-03-24

Molecular Modelling in Bioactive Peptide Discovery and Characterisation

OPENALEX - Publications

Clement Agoni Raúl Fernández-Díaz Patrick Brendan Timmons Alessandro Adelfio Hansel Gómez and 1 more

Molecular modelling is a vital tool in the discovery and characterisation of bioactive peptides, providing insights into their structural properties interactions with biological targets. Many models predicting peptide function or structure rely on intrinsic properties, including influence amino acid composition, sequence, chain length, which impact stability, folding, aggregation, target interaction. Homology predicts structures based known templates. Peptide–protein can be explored using...

10.3390/biom15040524 article EN cc-by Biomolecules 2025-04-03

AutoPeptideML: A study on how to build more trustworthy peptide bioactivity predictors

OPENALEX - Publications

Raúl Fernández-Díaz Rodrigo Cossio-Pérez Clement Agoni Thanh Lam Hoang Vanessa López and 1 more

Automated machine learning (AutoML) solutions can bridge the gap between new computational advances and their real-world applications by enabling experimental scientists to build own custom models. We examine different steps in development life-cycle of peptide bioactivity binary predictors identify key where automation not only result a more accessible method, but also robust interpretable evaluation leading trustworthy

10.1093/bioinformatics/btae555 article EN cc-by Bioinformatics 2024-09-18

A new framework for evaluating model out-of-distribution for the biochemical domain*

OPENALEX - Publications

Raúl Fernández-Díaz Thanh Lam Hoang Vanessa López Denis C. Shields

Quantifying model generalization to out-of-distribution data has been a longstanding challenge in machine learning. Addressing this issue is crucial for leveraging learning scientific discovery, where models must generalize new molecules or materials. Current methods typically split into train and test sets using various criteria, like temporal, sequence identity, scaffold, random cross-validation; before evaluating performance. However, with so many splitting criteria available, existing...

10.1101/2024.03.14.584508 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2024-03-16

Otter-Knowledge: benchmarks of multimodal knowledge graph representation learning from different sources for drug discovery

OPENALEX - Publications

Hoang Thanh Lam Marco Luca Sbodio Marcos Martínez Gallindo Mykhaylo Zayats Raúl Fernández-Díaz and 4 more

Recent research on predicting the binding affinity between drug molecules and proteins use representations learned, through unsupervised learning techniques, from large databases of molecule SMILES protein sequences. While these have significantly enhanced predictions, they are usually based a limited set modalities, do not exploit available knowledge about existing relations among proteins. In this study, we demonstrate that by incorporating graphs diverse sources modalities into sequences...

10.48550/arxiv.2306.12802 preprint EN cc-by arXiv (Cornell University) 2023-01-01

AutoPeptideML: A study on how to build more trustworthy peptide bioactivity predictors

OPENALEX - Publications

Raúl Fernández-Díaz Rodrigo Cossio-Pérez Clement Agoni Hoang Thanh Lam Vanessa López and 1 more

Abstract Motivation Automated machine learning (AutoML) solutions can bridge the gap between new computational advances and their real-world applications by enabling experimental scientists to build own custom models. We examine different steps in development life-cycle of peptide bioactivity binary predictors identify key where automation not only result a more accessible method, but also robust interpretable evaluation leading trustworthy Results present automated method for drawing...

10.1101/2023.11.13.566825 preprint EN cc-by-nd bioRxiv (Cold Spring Harbor Laboratory) 2023-11-15

Knowledge Enhanced Representation Learning for Drug Discovery

OPENALEX - Publications

Thanh Lam Hoang Marco Luca Sbodio Marcos Martínez Galindo Mykhaylo Zayats Raúl Fernández-Díaz and 4 more

Recent research on predicting the binding affinity between drug molecules and proteins use representations learned, through unsupervised learning techniques, from large databases of molecule SMILES protein sequences. While these have significantly enhanced predictions, they are usually based a limited set modalities, do not exploit available knowledge about existing relations among proteins. Our study reveals that representations, derived multimodal graphs describing proteins, lead to...

10.1609/aaai.v38i9.28924 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Enhancing foundation models for scientific discovery via multimodal knowledge graph representations

OPENALEX - Publications

Vanessa López Lam Thanh Hoang Marcos Martinez-Galindo Raúl Fernández-Díaz Marco Luca Sbodio and 4 more

10.1016/j.websem.2024.100845 article EN cc-by-nc-nd Journal of Web Semantics 2024-12-01

Coming Soon ...