Avid M. Afzal

ORCID: 0000-0002-6186-6954
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Computational Drug Discovery Methods
  • Protein Structure and Dynamics
  • Metabolomics and Mass Spectrometry Studies
  • Machine Learning in Materials Science
  • Cell Image Analysis Techniques
  • Receptor Mechanisms and Signaling
  • Animal testing and alternatives
  • Microbial Natural Products and Biosynthesis
  • Bioinformatics and Genomic Networks
  • vaccines and immunoinformatics approaches
  • Drug-Induced Hepatotoxicity and Protection
  • Machine Learning in Bioinformatics
  • Pharmaceutical studies and practices
  • Viral Infectious Diseases and Gene Expression in Insects
  • Pharmacovigilance and Adverse Drug Reactions
  • Bayesian Modeling and Causal Inference
  • Microbial Metabolic Engineering and Bioproduction
  • Statistical Methods in Clinical Trials
  • Protein Degradation and Inhibitors
  • Pharmacogenetics and Drug Metabolism
  • Computational Physics and Python Applications
  • Traditional and Medicinal Uses of Annonaceae
  • Scientific Research and Discoveries
  • Phytochemical compounds biological activities
  • Particle physics theoretical and experimental studies

AstraZeneca (United Kingdom)
2020-2023

University of Cambridge
2013-2020

Unilever (United Kingdom)
2014

University of Glasgow
2014

In silico analyses are increasingly being used to support mode-of-action investigations; however many such approaches do not utilise the large amounts of inactive data held in chemogenomic repositories. The objective this work is concerned with integration bioactivity target prediction orphan compounds produce probability activity and inactivity for a range targets. To end, novel human set was constructed through assimilation over 195 million points deposited ChEMBL PubChem repositories,...

10.1186/s13321-015-0098-y article EN cc-by Journal of Cheminformatics 2015-10-24

The understanding of the mechanism-of-action (MoA) compounds and prediction potential drug targets play an important role in small-molecule discovery. aim this work was to compare chemical cell morphology information for bioactivity prediction. comparison performed using data from ExCAPE database, image (in form CellProfiler features) Cell Painting set (the largest publicly available images with ∼30,000 compound perturbations), extended connectivity fingerprints (ECFPs) multitask Bayesian...

10.1021/acs.jcim.0c00864 article EN Journal of Chemical Information and Modeling 2021-03-04

FAst MEtabolizer (FAME) is a fast and accurate predictor of sites metabolism (SoMs). It based on collection random forest models trained diverse chemical data sets more than 20 000 molecules annotated with their experimentally determined SoMs. Using comprehensive set available data, FAME aims to assess metabolic processes from holistic point view. not limited specific enzyme family or species. Besides global model, dedicated are for human, rat, dog metabolism; prediction phase I II also...

10.1021/ci400503s article EN Journal of Chemical Information and Modeling 2013-11-12

In silico approaches often fail to utilize bioactivity data available for orthologous targets due insufficient evidence highlighting the benefit such an approach. Deeper investigation into orthologue chemical space and its influence toward expanding compound target coverage is necessary improve confidence in this practice.Here we present analysis of ChEMBL PubChem impact on prediction. We highlight number conflicting bioactivities between human orthologues low annotations are overall...

10.1093/bioinformatics/btx525 article EN cc-by Bioinformatics 2017-08-25

According to Cobanoglu et al., it is now widely acknowledged that the single target paradigm (one protein/target, one disease, drug) has been dominant premise in drug development recent past untenable. More often than not, a drug-like compound (ligand) can be promiscuous - interact with more protein. In years, silico prediction methods promiscuity issue generally approached computationally three main ways: ligand-based methods; target-protein-based and integrative schemes. this study we...

10.1186/s13321-015-0071-9 article EN cc-by Journal of Cheminformatics 2015-05-29

Iterative screening has emerged as a promising approach to increase the efficiency of campaigns compared traditional high throughput approaches. By learning from subset compound library, inferences on what compounds screen next can be made by predictive models, resulting in more efficient screening. One way evaluate is consider cost gain associated with finding an active compound. In this work, we introduce conformal predictor coupled gain-cost function aim maximise iterative Using setup...

10.1186/s13321-018-0260-4 article EN cc-by Journal of Cheminformatics 2018-02-21

In the context of bioactivity prediction, question how to calibrate a score produced by machine learning method into probability binding protein target is not yet satisfactorily addressed. this study, we compared performance three such methods, namely, Platt scaling (PS), isotonic regression (IR), and Venn–ABERS predictors (VA), in calibrating prediction scores obtained from ligand–target comprising Naïve Bayes, support vector machines, random forest (RF) algorithms. Calibration quality was...

10.1021/acs.jcim.0c00476 article EN Journal of Chemical Information and Modeling 2020-08-31

Adverse drug reactions (ADRs) are undesired effects of medicines that can harm patients and a significant source attrition in development. ADRs anticipated by routinely screening drugs against secondary pharmacology protein panels. However, there is still lack quantitative information on the links between these off-target proteins reporting humans. Here, we present systematic analysis associations measured predicted vitro bioactivities adverse events (AEs) humans from two sources data: Side...

10.1021/acs.chemrestox.0c00294 article EN Chemical Research in Toxicology 2020-12-22

Abstract Measurements of protein–ligand interactions have reproducibility limits due to experimental errors. Any model based on such assays will consequentially unavoidable errors influencing their performance which should ideally be factored into modelling and output predictions, as the actual standard deviation measurements (σ) or associated comparability activity values between aggregated heterogenous units (i.e., K i versus IC 50 values) during dataset assimilation. However, are usually...

10.1186/s13321-021-00539-7 article EN cc-by Journal of Cheminformatics 2021-08-19

The nest is a protein motif of three consecutive amino acid residues with dihedral angles 1,2-αR αL (RL nests) or 1,2-αL αR (LR nests). Many nests form depression in which an anion δ-negative acceptor atom bound by hydrogen bonds from the main chain NH groups. We have determined extent and nature this bridging database structures using computer program written for purpose. Acceptor anions are pair 40% RL 20% LR nests. Two thirds bridges between groups at Positions 1 3 (N1N3-bridging)-which...

10.1002/prot.24663 article EN Proteins Structure Function and Bioinformatics 2014-08-08

Even though glucuronidations are the most frequent metabolic reactions of conjugation, both in quantitative and qualitative terms, they have rather seldom been investigated using computational approaches. To fill this gap, we used manually collected MetaQSAR reaction database to generate two models for prediction UGT-mediated metabolism, based on molecular descriptors implementing Random Forest algorithm. The first model predicts occurrence was internally validated with a Matthew correlation...

10.1021/acsmedchemlett.8b00603 article EN ACS Medicinal Chemistry Letters 2019-02-12

To improve our ability to extrapolate preclinical toxicity humans, there is a need understand and quantify the concordance of adverse events (AEs) between animal models clinical studies. In present work, we discovered 3011 statistically significant associations AEs caused by drugs reported in PharmaPendium database which 2952 were new toxicities encoded different Medical Dictionary for Regulatory Activities terms across species. find plausible testable candidate off-target drug activities...

10.1021/acs.chemrestox.0c00311 article EN Chemical Research in Toxicology 2020-12-18

Pattern classification methods assign an object to one of several predefined classes/categories based on features extracted from observed attributes the (pattern). When L discriminatory for pattern can be accurately determined, problem presents no difficulty. However, precise identification relevant a algorithm (classifier) able categorize real world patterns without errors is generally infeasible. In this case, often cast as devising classifier that minimizes misclassification rate. One way...

10.1016/j.patrec.2015.06.002 article EN cc-by Pattern Recognition Letters 2015-06-18

Current in vitro models for hepatotoxicity commonly suffer from low detection rates due to incomplete coverage of bioactivity space. Additionally, vivo exposure measures such as Cmax are used screening and unavailable early on. Here we propose a novel rule-based framework extract interpretable biologically meaningful multiconditional associations prioritize end points understand the associated physicochemical conditions. The data this study were derived 673 compounds 361 ToxCast measurements...

10.1021/acs.chemrestox.8b00382 article EN Chemical Research in Toxicology 2019-08-23

In silico protein target deconvolution is frequently used for mechanism-of-action investigations; however existing protocols usually do not predict compound functional effects, such as activation or inhibition, upon binding to their counterparts. This study hence concerned with including effects in prediction. To this end, we assimilated a bioactivity training set 332 targets, comprising 817,239 active data points unknown effect (binding data) and 20,761,260 inactive compounds, along 226,045...

10.3389/fphar.2018.00613 article EN cc-by Frontiers in Pharmacology 2018-06-11

Adverse events resulting from drug therapy can be a cause of withdrawal, reduced and or restricted clinical use, as well major economic burden for society. To increase the safety new drugs, there is need to better understand mechanisms causing adverse events. One way derive mechanistic hypotheses by linking data on with drugs' biological targets. In this study, we have used mining techniques mutual information statistical approaches find associations between reported collected FDA Event...

10.1021/acs.chemrestox.8b00159 article EN Chemical Research in Toxicology 2018-10-17

In the context of bioactivity prediction, question how to calibrate a score produced by machine learning method into reliable probability binding protein target is not yet satisfactorily addressed. this study, we compared performance three such methods, namely Platt Scaling, Isotonic Regression and Venn-ABERS in calibrating prediction scores for ligand-target comprising Naïve Bayes, Support Vector Machines Random Forest algorithms with data available at AstraZeneca (40 million points...

10.26434/chemrxiv.11526132 preprint EN cc-by-nc-nd 2020-01-08

Abstract Predicting the toxicity of a compound preclinically enables better decision making, thereby reducing development costs and increasing patient safety. It is complex issue, but in vitro assays physico-chemical properties compounds can be used to predict clinical toxicity. Neural networks (NNs) are popular predictive tool due their flexibility ability model non-linearities, they prone overfitting therefore not recommended for small data sets. Furthermore, don’t quantify uncertainty...

10.1101/2020.04.28.065532 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2020-04-30

Abstract A proteolysis targeting chimera (PROTAC) is a new technology that marks proteins for degradation in highly specific manner. During screening, PROTAC compounds are tested concentration-response (CR) assays to determine their potency, and parameters such as the half-maximal concentration (DC 50 ) estimated from fitted CR curves. These used rank compounds, with lower DC values indicating greater potency. However, data often exhibit bi-phasic poly-phasic relationships, making standard...

10.1101/2020.11.13.379883 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2020-11-13

In the context of bioactivity prediction, question how to calibrate a score produced by machine learning method into reliable probability binding protein target is not yet satisfactorily addressed. this study, we compared performance three such methods, namely Platt Scaling, Isotonic Regression and Venn-ABERS in calibrating prediction scores for ligand-target comprising Naïve Bayes, Support Vector Machines Random Forest algorithms with data available at AstraZeneca (40 million points...

10.26434/chemrxiv.11526132.v1 preprint EN cc-by-nc-nd 2020-01-08

A proteolysis-targeting chimera (PROTAC) is a new technology that marks proteins for degradation in highly specific manner. During screening, PROTAC compounds are tested concentration-response (CR) assays to determine their potency, and parameters such as the half-maximal concentration (DC50) estimated from fitted CR curves. These used rank compounds, with lower DC50 values indicating greater potency. However, data often exhibit biphasic polyphasic relationships, making standard sigmoidal...

10.1177/24725552211028142 article EN cc-by-nc-nd SLAS DISCOVERY 2021-09-20

In the context of small molecule property prediction, experimental errors are usually a neglected aspect during model generation. The main caveat to binary classification approaches is that they weight minority cases close threshold boundary equivalently in distinguishing between activity classes. For example, pXC50 value 5.1 or 4.9 treated equally important contributing opposing (e.g., 5), even though error may not afford such discriminatory accuracy. This detrimental practice and therefore...

10.26434/chemrxiv.14544291.v1 preprint EN cc-by-nc-nd 2021-05-07
Coming Soon ...