NFDI4DS | UHH-SEMS - Publication Details

Esben Jannik Bjerrum

ORCID: 0000-0003-1614-7376

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5046421751

Research Areas

Computational Drug Discovery Methods
Machine Learning in Materials Science
Chemical Synthesis and Analysis
Analytical Chemistry and Chromatography
Innovative Microfluidic and Catalytic Techniques Innovation
Protein Structure and Dynamics
Machine Learning in Bioinformatics
Chemistry and Chemical Engineering
Microbial Natural Products and Biosynthesis
Various Chemistry Research Topics
Genetics, Bioinformatics, and Biomedical Research
Receptor Mechanisms and Signaling
Web Data Mining and Analysis
Neuroscience and Neuropharmacology Research
Advanced Database Systems and Queries
Machine Learning and Algorithms
Natural Language Processing Techniques
Data Management and Algorithms
Spectroscopy and Chemometric Analyses
Semantic Web and Ontologies
Topic Modeling
Bioinformatics and Genomic Networks
RNA and protein synthesis mechanisms
Nitric Oxide and Endothelin Effects
Model-Driven Software Engineering Techniques

Chemi (Italy)
2024

Odyssey Therapeutics (United States)
2022-2023

AstraZeneca (Sweden)
2019-2022

AstraZeneca (Singapore)
2021-2022

AstraZeneca (Brazil)
2019-2020

AstraZeneca (Netherlands)
2020

Rigshospitalet
2017

University of Copenhagen
2003-2016

Randomized SMILES strings improve the quality of molecular generative models

OPENALEX - Publications

Josep Arús‐Pous Simon Johansson Oleksii Prykhodko Esben Jannik Bjerrum Christian Tyrchan and 3 more

Recurrent Neural Networks (RNNs) trained with a set of molecules represented as unique (canonical) SMILES strings, have shown the capacity to create large chemical spaces valid and meaningful structures. Herein we perform an extensive benchmark on models subsets GDB-13 different sizes (1 million, 10,000 1000), variants (canonical, randomized DeepSMILES), two recurrent cell types (LSTM GRU) hyperparameter combinations. To guide benchmarks new metrics were developed that define how well model...

10.1186/s13321-019-0393-0 article EN cc-by Journal of Cheminformatics 2019-11-21

A de novo molecular generation method using latent vector based generative adversarial network

OPENALEX - Publications

Oleksii Prykhodko Simon Johansson Panagiotis-Christos Kotsias Josep Arús‐Pous Esben Jannik Bjerrum and 2 more

Deep learning methods applied to drug discovery have been used generate novel structures. In this study, we propose a new deep architecture, LatentGAN, which combines an autoencoder and generative adversarial neural network for de novo molecular design. We the method in two scenarios: one random drug-like compounds another target-biased compounds. Our results show that works well both cases. Sampled from trained model can largely occupy same chemical space as training set also substantial...

10.1186/s13321-019-0397-9 article EN cc-by Journal of Cheminformatics 2019-12-01

SMILES Enumeration as Data Augmentation for Neural Network Modeling of Molecules

OPENALEX - Publications

Esben Jannik Bjerrum

Simplified Molecular Input Line Entry System (SMILES) is a single line text representation of unique molecule. One molecule can however have multiple SMILES strings, which reason that canonical been defined, ensures one to correspondence between string and Here the fact represent same explored as technique for data augmentation molecular QSAR dataset modeled by long short term memory (LSTM) cell based neural network. The augmented was 130 times bigger than original. network trained with...

10.48550/arxiv.1703.07076 preprint EN other-oa arXiv (Cornell University) 2017-01-01

AiZynthFinder: a fast, robust and flexible open-source software for retrosynthetic planning

OPENALEX - Publications

Samuel Genheden Amol Thakkar Veronika Chadimová Jean‐Louis Reymond Ola Engkvist and 1 more

We present the open-source AiZynthFinder software that can be readily used in retrosynthetic planning. The algorithm is based on a Monte Carlo tree search recursively breaks down molecule to purchasable precursors. guided by an artificial neural network policy suggests possible precursors utilizing library of known reaction templates. fast and typically find solution less than 10 s perform complete 1 min. Moreover, development code was range engineering principles such as automatic testing,...

10.1186/s13321-020-00472-1 article EN cc-by Journal of Cheminformatics 2020-11-17

Direct steering of de novo molecular generation with descriptor conditional recurrent neural networks

OPENALEX - Publications

Panagiotis-Christos Kotsias Josep Arús‐Pous Hongming Chen Ola Engkvist Christian Tyrchan and 1 more

10.1038/s42256-020-0174-5 article EN Nature Machine Intelligence 2020-05-18

Improving Chemical Autoencoder Latent Space and Molecular De Novo Generation Diversity with Heteroencoders

OPENALEX - Publications

Esben Jannik Bjerrum Boris Sattarov

Chemical autoencoders are attractive models as they combine chemical space navigation with possibilities for de novo molecule generation in areas of interest. This enables them to produce focused libraries around a single lead compound employment early drug discovery project. Here, it is shown that the choice representation, such strings from simplified molecular-input line-entry system (SMILES), has large influence on properties latent space. It further explored what extent translating...

10.3390/biom8040131 article EN cc-by Biomolecules 2018-10-30

SMILES-based deep generative scaffold decorator for de-novo drug design

OPENALEX - Publications

Josep Arús‐Pous Atanas Patronov Esben Jannik Bjerrum Christian Tyrchan Jean‐Louis Reymond and 2 more

Molecular generative models trained with small sets of molecules represented as SMILES strings can generate large regions the chemical space. Unfortunately, due to sequential nature strings, these are not able given a scaffold (i.e., partially-built explicit attachment points). Herein we report new SMILES-based molecular architecture that generates from scaffolds and be any arbitrary set. This approach is possible thanks set pre-processing algorithm exhaustively slices all combinations...

10.1186/s13321-020-00441-8 article EN cc-by Journal of Cheminformatics 2020-05-29

Chemformer: a pre-trained transformer for computational chemistry

OPENALEX - Publications

Ross Irwin Spyridon Dimitriadis Jiazhen He Esben Jannik Bjerrum

Abstract Transformer models coupled with a simplified molecular line entry system (SMILES) have recently proven to be powerful combination for solving challenges in cheminformatics. These models, however, are often developed specifically single application and can very resource-intensive train. In this work we present the Chemformer model—a Transformer-based model which quickly applied both sequence-to-sequence discriminative cheminformatics tasks. Additionally, show that self-supervised...

10.1088/2632-2153/ac3ffb article EN cc-by Machine Learning Science and Technology 2021-12-07

Retrosynthetic accessibility score (RAscore) – rapid machine learned synthesizability classification from AI driven retrosynthetic planning

OPENALEX - Publications

Amol Thakkar Veronika Chadimová Esben Jannik Bjerrum Ola Engkvist Jean‐Louis Reymond

The retrosynthetic accessibility score (RAscore) is based on AI driven planning, and useful for rapid scoring of synthetic feasability pre-screening large datasets virtual/generated molecules.

10.1039/d0sc05401a article EN cc-by-nc Chemical Science 2021-01-01

De Novo Molecular Design by Combining Deep Autoencoder Recurrent Neural Networks with Generative Topographic Mapping

OPENALEX - Publications

Boris Sattarov Igor I. Baskin Dragos Horvath Gilles Marcou Esben Jannik Bjerrum and 1 more

Here we show that Generative Topographic Mapping (GTM) can be used to explore the latent space of SMILES-based autoencoders and generate focused molecular libraries interest. We have built a sequence-to-sequence neural network with Bidirectional Long Short-Term Memory layers trained it on SMILES strings from ChEMBL23. Very high reconstruction rates test set molecules were achieved (>98%), which are comparable ones reported in related publications. Using GTM, visualized autoencoder...

10.1021/acs.jcim.8b00751 article EN Journal of Chemical Information and Modeling 2019-02-20

Datasets and their influence on the development of computer assisted synthesis planning tools in the pharmaceutical domain

OPENALEX - Publications

Amol Thakkar Thierry Kogej Jean‐Louis Reymond Ola Engkvist Esben Jannik Bjerrum

Computer Assisted Synthesis Planning (CASP), datasets and their performance.

10.1039/c9sc04944d article EN cc-by Chemical Science 2019-11-05

Graph networks for molecular design

OPENALEX - Publications

Rocío Mercado Tobias Rastemo Edvard Lindelöf Günter Klambauer Ola Engkvist and 2 more

Abstract Deep learning methods applied to chemistry can be used accelerate the discovery of new molecules. This work introduces GraphINVENT, a platform developed for graph-based molecular design using graph neural networks (GNNs). GraphINVENT uses tiered deep network architecture probabilistically generate molecules single bond at time. All models implemented in quickly learn build resembling training set without any explicit programming chemical rules. The have been benchmarked MOSES...

10.1088/2632-2153/abcf91 article EN cc-by Machine Learning Science and Technology 2020-12-01

Data Augmentation of Spectral Data for Convolutional Neural Network (CNN) Based Deep Chemometrics

OPENALEX - Publications

Esben Jannik Bjerrum Mads Glahder Thomas Skov

Deep learning methods are used on spectroscopic data to predict drug content in tablets from near infrared (NIR) spectra. Using convolutional neural networks (CNNs), features ex- tracted the data. Extended multiplicative scatter correction (EMSC) and a novel spectral augmentation method benchmarked as preprocessing steps. The learned models perform better or par with hypothetical optimal partial least squares (PLS) for all combinations of preprocessing. Data subsequent EMSC combination gave...

10.48550/arxiv.1710.01927 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Molecular optimization by capturing chemist’s intuition using deep neural networks

OPENALEX - Publications

Jiazhen He huifang you Emil Sandström Eva Nittinger Esben Jannik Bjerrum and 3 more

Abstract A main challenge in drug discovery is finding molecules with a desirable balance of multiple properties. Here, we focus on the task molecular optimization, where goal to optimize given starting molecule towards This can be framed as machine translation problem natural language processing, our case, translated into optimized properties based SMILES representation. Typically, chemists would use their intuition suggest chemical transformations for being optimized. widely used strategy...

10.1186/s13321-021-00497-0 article EN cc-by Journal of Cheminformatics 2021-03-20

Transformer-based molecular optimization beyond matched molecular pairs

OPENALEX - Publications

Jiazhen He Eva Nittinger Christian Tyrchan Werngard Czechtizky Atanas Patronov and 2 more

Molecular optimization aims to improve the drug profile of a starting molecule. It is fundamental problem in discovery but challenging due (i) requirement simultaneous multiple properties and (ii) large chemical space explore. Recently, deep learning methods have been proposed solve this task by mimicking chemist's intuition terms matched molecular pairs (MMPs). Although MMPs widely used strategy medicinal chemists, it offers limited capability exploring structural modifications, therefore...

10.1186/s13321-022-00599-3 article EN cc-by Journal of Cheminformatics 2022-03-28

Molecular Generation with Recurrent Neural Networks (RNNs)

OPENALEX - Publications

Esben Jannik Bjerrum Richard Threlfall

The potential number of drug like small molecules is estimated to be between 10^23 and 10^60 while current databases known compounds are orders magnitude smaller with approximately 10^8 compounds. This discrepancy has led an interest in generating virtual libraries using hand crafted chemical rules fragment based methods cover a larger area space generate for use silico discovery endeavors. Here it explored what extent recurrent neural network long short term memory cells can figure out...

10.48550/arxiv.1705.04612 preprint EN other-oa arXiv (Cornell University) 2017-01-01

AiZynthFinder: A Fast Robust and Flexible Open-Source Software for Retrosynthetic Planning

OPENALEX - Publications

Samuel Genheden Amol Thakkar Veronika Chadimová Jean‐Louis Reymond Ola Engkvist and 1 more

We present the open-source AiZynthFinder software that can be readily used in retrosynthetic planning. The algorithm is based on a Monte Carlo tree search recursively breaks down molecule to purchasable precursors. guided by an artificial neural network policy suggests possible precursors utilizing library of known reaction templates. fast and typically find solution less than 10 seconds perform complete 1 minute. Moreover, writing code was range engineering principles such as automatic...

10.26434/chemrxiv.12465371 preprint EN cc-by-nc-nd 2020-06-15

PaRoutes: towards a framework for benchmarking retrosynthesis route predictions

OPENALEX - Publications

Samuel Genheden Esben Jannik Bjerrum

PaRoutes is a framework benchmarking multi-step retrosynthesis methods. It consists of synthetic routes extracted from the patent literature, stock compounds, as well scripts to compute route quality and diversity metrics.

10.1039/d2dd00015f article EN cc-by-nc Digital Discovery 2022-01-01

Randomized SMILES Strings Improve the Quality of Molecular Generative Models

OPENALEX - Publications

Josep Arús‐Pous Simon Johansson Oleksii Prykhodko Esben Jannik Bjerrum Christian Tyrchan and 3 more

Recurrent Neural Networks (RNNs) trained with a set of molecules represented as unique (canonical) SMILES strings, have shown the capacity to create large chemical spaces valid and meaningful structures. Herein we perform an extensive benchmark on models subsets GDB-13 different sizes (1 million , 10,000 1,000), variants (canonical, randomized DeepSMILES), two recurrent cell types (LSTM GRU) hyperparameter combinations. To guide benchmarks new metrics were developed that define generated...

10.26434/chemrxiv.8639942.v1 preprint EN 2019-07-05

A De Novo Molecular Generation Method Using Latent Vector Based Generative Adversarial Network

OPENALEX - Publications

Oleksii Prykhodko Simon Johansson Panagiotis-Christos Kotsias Josep Arús‐Pous Esben Jannik Bjerrum and 2 more

<p> </p><p>Deep learning methods applied to drug discovery have been used generate novel structures. In this study, we propose a new deep architecture, LatentGAN, which combines an autoencoder and generative adversarial neural network for de novo molecular design. We the method in two scenarios: one random drug-like compounds another target-biased compounds. Our results show that works well both cases: sampled from trained model can largely occupy same chemical space as...

10.26434/chemrxiv.8299544 preprint EN cc-by-nc 2019-06-20

Chemformer: A Pre-Trained Transformer for Computational Chemistry

OPENALEX - Publications

Ross Irwin Spyridon Dimitriadis Jiazhen He Esben Jannik Bjerrum

Transformer models coupled with Simplified Molecular Line Entry System (SMILES) have recently proven to be a powerful combination for solving challenges in cheminformatics. These models, however, are often developed specifically single application and can very resource-intensive train. In this work we present Chemformer model – Transformerbased which quickly applied both sequence-to-sequence discriminative cheminformatics tasks. Additionally, show that self-supervised pre-training improve...

10.26434/chemrxiv-2021-v2pnn preprint EN cc-by 2021-07-15

Human-in-the-loop assisted de novo molecular design

OPENALEX - Publications

Iiris Sundin Alexey Voronov Haoping Xiao Kostas Papadopoulos Esben Jannik Bjerrum and 4 more

A de novo molecular design workflow can be used together with technologies such as reinforcement learning to navigate the chemical space. bottleneck in that remains solved is how integrate human feedback exploration of space optimize molecules. drug designer still needs goal, expressed a scoring function for molecules captures designer's implicit knowledge about optimization task. Little support this task exists and, consequently, chemist usually resorts iteratively building objective...

10.1186/s13321-022-00667-8 article EN cc-by Journal of Cheminformatics 2022-12-28

Coming Soon ...