Chloé‐Agathe Azencott

ORCID: 0000-0003-1003-301X
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Chromosomal and Genetic Variations
  • Gene expression and cancer classification
  • Genomic variations and chromosomal abnormalities
  • Bioinformatics and Genomic Networks
  • Genetic Associations and Epidemiology
  • Computational Drug Discovery Methods
  • CRISPR and Genetic Engineering
  • Genetic Mapping and Diversity in Plants and Animals
  • Genetic and phenotypic traits in livestock
  • Machine Learning in Materials Science
  • Protein Structure and Dynamics
  • Genetics, Bioinformatics, and Biomedical Research
  • Machine Learning in Healthcare
  • Global Cancer Incidence and Screening
  • Machine Learning in Bioinformatics
  • Biomedical Text Mining and Ontologies
  • Health Systems, Economic Evaluations, Quality of Life
  • Metabolomics and Mass Spectrometry Studies
  • Statistical Methods and Inference
  • Data Quality and Management
  • Cell Image Analysis Techniques
  • Molecular Biology Techniques and Applications
  • vaccines and immunoinformatics approaches
  • Single-cell and spatial transcriptomics
  • RNA modifications and cancer

Université Paris Sciences et Lettres
2015-2024

Institut Curie
2015-2024

Inserm
2015-2024

ParisTech
2018-2024

École Nationale Supérieure des Mines de Paris
2015-2023

Cancer et génome: Bioinformatique, biostatistiques et épidémiologie des systèmes complexes
2017-2023

Génomique Bioinformatique et Applications
2017-2023

Centre de Biologie du Développement
2017

Max Planck Institute for Intelligent Systems
2012-2015

Max Planck Institute for Developmental Biology
2015

Prioritizing missense variants for further experimental investigation is a key challenge in current sequencing studies exploring complex and Mendelian diseases. A large number of silico tools have been employed the task pathogenicity prediction, including PolyPhen-2, SIFT, FatHMM, MutationTaster-2, MutationAssessor, Combined Annotation Dependent Depletion, LRT, phyloP, GERP++, as well optimized methods combining tool scores, such Condel Logit. Due to wealth these methods, an important...

10.1002/humu.22768 article EN Human Mutation 2015-02-14

Abstract Background Variability in datasets is not only the product of biological processes: they are also technical biases. ComBat and ComBat-Seq among most widely used tools for correcting those biases, called batch effects, in, respectively, microarray RNA-Seq expression data. Results In this note, we present a new Python implementation ComBat-Seq. While mathematical framework strictly same, show here that our implementations: (i) have similar results terms effects correction; (ii) as...

10.1186/s12859-023-05578-5 article EN cc-by BMC Bioinformatics 2023-12-07

Being able to predict the course of arbitrary chemical reactions is essential theory and applications organic chemistry. Approaches reaction prediction problems can be organized around three poles corresponding to: (1) physical laws; (2) rule-based expert systems; (3) inductive machine learning. Previous approaches at these poles, respectively, are not high throughput, generalizable or scalable, lack sufficient data structure implemented. We propose a new approach utilizing elements from...

10.1021/ci200207y article EN Journal of Chemical Information and Modeling 2011-08-07
Federica Eduati Lara M. Mangravite Tao Wang Jing Tang J Christopher Bare and 95 more Rui Huang Thea Norman Mike Kellen Michael P. Menden Yang Yang Xiaowei Zhan Rui Zhong Guanghua Xiao Menghang Xia Nour Abdo Oksana Kosyk Stephen Friend Gustavo Stolovitzky Allen Dearry Raymond R. Tice Anton Simeonov Ivan Rusyn Fred A. Wright Yang Xie Salvatore Alaimo Alicia Amadoz Muhammad Ammad-ud-din Chloé‐Agathe Azencott Jaume Bacardit Pelham Barron Elsa Bernard Andreas Beyer Bin Shao Alena van Bömmel Karsten Borgwardt April M. Brys Brian E. Caffrey Jeffrey Chang Jungsoo Chang Eleni Christodoulou Mathieu Clément‐Ziza Trevor Cohen Marianne Cowherd Sofie Demeyer Joaquı́n Dopazo Joel D Elhard André O. Falcão Alfredo Ferro David A. Friedenberg Rosalba Giugno Yunguo Gong Jenni Gorospe Courtney A. Granville Dominik G. Grimm Matthias Heinig Rosa Hernansaiz-Ballesteros Sepp Hochreiter Hua Huang Matthew R. Huska Yunlong Jiao Günter Klambauer Michael Kuhn Miron B. Kursa Rintu Kutum Nicola Lazzarini Inhan Lee Michael K. K. Leung Weng Khong Lim C. Liu Felipe Llinares López Alessandro Mammana Andreas Mayr Tom Michoel Misael Mongiovı̀ Jonathan D. Moore R. Narasimhan Stephen O. Opiyo Gaurav Pandey Andrea L. Peabody Juliane Perner Alfredo Pulvirenti Konrad Rawlik Susanne Reinhardt Carol G Riffle Douglas M. Ruderfer Aaron Sander Richard S. Savage Erwan Scornet Patricia Sebastián-León Roded Sharan Carl Johann Simon-Gabriel Véronique Stoven Jingchun Sun Ana Lúcia Teixeira Albert Tenesa Jean‐Philippe Vert Martin Vingron Thomas Walter Sean Whalen Zofia Wiśniewska

When it becomes completely possible for one to computationally forecast the impacts of harmful substances on humans, would be easier attempt addressing shortcomings existing safety testing chemicals. In this paper, we relay outcomes a community-facing DREAM contest prognosticate nature environment-based compounds, considering their likelihood have disadvantageous health-related effects human populace. Our research quantified cytotoxicity levels in 156 compounds across 884 lymphoblastic lines...

10.18034/ajhal.v4i2.577 article EN cc-by-nc Asian Journal of Humanity Art and Literature 2017-12-31

Between 30% and 70% of patients with breast cancer have pre-existing chronic conditions, more than half are on long-term non-cancer medication at the time diagnosis. Preliminary epidemiological evidence suggests that some medications may affect risk, recurrence, survival. In this nationwide cohort study, we assessed association between use diagnosis We included 235,368 French women newly diagnosed non-metastatic cancer. analyzes 288 medications, identified eight positively associated either...

10.1038/s41467-024-47002-3 article EN cc-by Nature Communications 2024-04-05

Abstract Motivation: The performance of classifiers is often assessed using Receiver Operating Characteristic ROC [or (AC) accumulation curve or enrichment curve] curves and the corresponding areas under (AUCs). However, in many fundamental problems ranging from information retrieval to drug discovery, only very top ranked list predictions any interest ROCs AUCs are not useful. New metrics, visualizations optimization tools needed address this ‘early retrieval’ problem. Results: To early...

10.1093/bioinformatics/btq140 article EN Bioinformatics 2010-04-07
Solveig K. Sieberts Fan Zhu Javier Garcı́a-Garcı́a Eli A. Stahl Abhishek Pratap and 95 more Gaurav Pandey Dimitrios A. Pappas Daniel Aguilar Bernat Anton Jaume Bonet Ridvan Eksi Oriol Fornés Emre Güney Hongdong Li Manuel Alejandro Marín-López Bharat Panwar Joan Planas-Iglesias Daniel Poglayen Jing Cui André O. Falcão Christine Suver Bruce Hoff Venkat S. K. Balagurusamy Donna Dillenberger Elias Chaibub Neto Thea Norman Tero Aittokallio Muhammad Ammad-ud-din Chloé‐Agathe Azencott Víctor Bellón Valentina Boeva Kerstin Bunte Himanshu Chheda Lu Cheng Jukka Corander Michel Dumontier Anna Goldenberg Peddinti Gopalacharyulu Mohsen Hajiloo Daniel Hidru Alok Jaiswal Samuel Kaski Beyrem Khalfaoui Suleiman A. Khan Eric R. Kramer Pekka Marttinen Aziz M. Mezlini Bhuvan Molparia Matti Pirinen Janna Saarela Matthias Samwald Véronique Stoven Hao Tang Jing Tang Ali Torkamani Jean-Phillipe Vert Bo Wang Tao Wang Krister Wennerberg Nathan E. Wineinger Guanghua Xiao Yang Xie Rae S. M. Yeung Xiaowei Zhan Cheng Zhao Manuel Calaza Haitham Elmarakeby Lenwood S. Heath Quan Long Jonathan D. Moore Stephen O. Opiyo Richard S. Savage Jun Zhu Jeff Greenberg Joel Kremer Kaleb Michaud Anne Barton Marieke J. H. Coenen Xavier Mariette Corinne Miceli‐Richard Nancy A. Shadick Michael E. Weinblatt Niek de Vries Paul P. Tak Daniëlle M. Gerlag T. Huizinga Fina Kurreeman Cornelia F Allaart S. Louis Bridges Lindsey A. Criswell Larry W. Moreland Lars Klareskog Saedís Saevarsdóttir Leonid Padyukov Peter K. Gregersen Stephen Friend Robert Plenge Gustavo Stolovitzky Baldo Oliva Yuanfang Guan

Rheumatoid arthritis (RA) affects millions world-wide. While anti-TNF treatment is widely used to reduce disease progression, fails in ∼one-third of patients. No biomarker currently exists that identifies non-responders before treatment. A rigorous community-based assessment the utility SNP data for predicting efficacy RA patients was performed context a DREAM Challenge (http://www.synapse.org/RA_Challenge). An open challenge framework enabled comparative evaluation predictions developed by...

10.1038/ncomms12460 article EN cc-by Nature Communications 2016-08-23

Abstract Motivation Finding non-linear relationships between biomolecules and a biological outcome is computationally expensive statistically challenging. Existing methods have important drawbacks, including among others lack of parsimony, non-convexity computational overhead. Here we propose block HSIC Lasso, feature selector that does not present the previous drawbacks. Results We compare Lasso to other state-of-the-art selection techniques in both synthetic real data, experiments over...

10.1093/bioinformatics/btz333 article EN cc-by-nc Bioinformatics 2019-05-09

Abstract Motivation: As an increasing number of genome-wide association studies reveal the limitations attempt to explain phenotypic heritability by single genetic loci, there is a recent focus on associating complex phenotypes with sets loci. Although several methods for multi-locus mapping have been proposed, it often unclear how relate detected loci growing knowledge about gene pathways and networks. The few that take biological or networks into account are either restricted investigating...

10.1093/bioinformatics/btt238 article EN cc-by-nc Bioinformatics 2013-06-19

Many chemoinformatics applications, including high-throughput virtual screening, benefit from being able to rapidly predict the physical, chemical, and biological properties of small molecules screen large repositories identify suitable candidates. When training sets are available, machine learning methods provide an effective alternative ab initio for these predictions. Here, we leverage rich molecular representations 1D SMILES strings, 2D graphs bonds, 3D coordinates derive efficient...

10.1021/ci600397p article EN Journal of Chemical Information and Modeling 2007-03-06

Abstract Background Variability in datasets is not only the product of biological processes: they are also technical biases. ComBat and ComBat-Seq among most widely used tools for correcting those biases, called batch effects, in, respectively, microarray RNA-Seq expression data. Results In this note, we present a new Python implementation ComBat-Seq. While mathematical framework strictly same, show here that our implementations: ( i ) have similar results terms effects correction; ii as...

10.1101/2020.03.17.995431 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2020-03-18

Given activity training data from high-throughput screening (HTS) experiments, virtual (vHTS) methods aim to predict in silico the of untested chemicals. We present a novel method, Influence Relevance Voter (IRV), specifically tailored for vHTS task. The IRV is low-parameter neural network which refines k-nearest neighbor classifier by nonlinearly combining influences chemical's neighbors set. Influences are decomposed, also nonlinearly, into relevance component and vote component....

10.1021/ci8004379 article EN Journal of Chemical Information and Modeling 2009-03-26
Coming Soon ...