NFDI4DS | UHH-SEMS - Publication Details

Mohammed AlQuraishi

ORCID: 0000-0001-6817-1322

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5002987125

Research Areas

Protein Structure and Dynamics
Machine Learning in Bioinformatics
Computational Drug Discovery Methods
RNA and protein synthesis mechanisms
Genomics and Phylogenetic Studies
Enzyme Structure and Function
Bioinformatics and Genomic Networks
Machine Learning in Materials Science
SARS-CoV-2 and COVID-19 Research
vaccines and immunoinformatics approaches
Genetics, Bioinformatics, and Biomedical Research
CAR-T cell therapy research
Viral gastroenteritis research and epidemiology
CRISPR and Genetic Engineering
Genomics and Rare Diseases
Animal Virus Infections Studies
Advanced Proteomics Techniques and Applications
DNA and Nucleic Acid Chemistry
Cell Image Analysis Techniques
Advanced MEMS and NEMS Technologies
Nanotechnology research and applications
Nanopore and Nanochannel Transport Studies
Single-cell and spatial transcriptomics
RNA Research and Splicing
Microbial Metabolic Engineering and Bioproduction

Columbia University
2021-2025

Columbia University Irving Medical Center
2021-2024

Harvard University Press
2023

Center for Systems Biology
2015-2021

Harvard University
2014-2021

Stanford University
2011-2012

Unified rational protein engineering with sequence-based deep representation learning

OPENALEX - Publications

Ethan C. Alley Grigory Khimulya Surojit Biswas Mohammed AlQuraishi George M. Church

10.1038/s41592-019-0598-1 article EN Nature Methods 2019-10-21

End-to-End Differentiable Learning of Protein Structure

OPENALEX - Publications

Mohammed AlQuraishi

10.1016/j.cels.2019.03.006 article EN publisher-specific-oa Cell Systems 2019-04-01

Single-sequence protein structure prediction using a language model and deep learning

OPENALEX - Publications

Ratul Chowdhury Nazim Bouatta Surojit Biswas Christina Floristean Anant Kharkar and 7 more

10.1038/s41587-022-01432-w article EN Nature Biotechnology 2022-10-03

OpenFold: Retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization

OPENALEX - Publications

Gustaf Ahdritz Nazim Bouatta Christina Floristean Sachin Kadyan Qinghui Xia and 24 more

Abstract AlphaFold2 revolutionized structural biology with the ability to predict protein structures exceptionally high accuracy. Its implementation, however, lacks code and data required train new models. These are necessary (i) tackle tasks, like protein-ligand complex structure prediction, (ii) investigate process by which model learns, remains poorly understood, (iii) assess model’s generalization capacity unseen regions of fold space. Here we report OpenFold, a fast, memory-efficient,...

10.1101/2022.11.20.517210 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2022-11-22

OpenFold: retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization

OPENALEX - Publications

Gustaf Ahdritz Nazim Bouatta Christina Floristean Sachin Kadyan Qinghui Xia and 29 more

AlphaFold2 revolutionized structural biology with the ability to predict protein structures exceptionally high accuracy. Its implementation, however, lacks code and data required train new models. These are necessary (1) tackle tasks, like protein–ligand complex structure prediction, (2) investigate process by which model learns (3) assess model's capacity generalize unseen regions of fold space. Here we report OpenFold, a fast, memory efficient trainable implementation AlphaFold2. We...

10.1038/s41592-024-02272-z article EN cc-by Nature Methods 2024-05-14

How to build the virtual cell with artificial intelligence: Priorities and opportunities

OPENALEX - Publications

Charlotte Bunne Yusuf Roohani Yanay Rosen Ankit Gupta Xikun Zhang and 37 more

Cells are essential to understanding health and disease, yet traditional models fall short of modeling simulating their function behavior. Advances in AI omics offer groundbreaking opportunities create an virtual cell (AIVC), a multi-scale, multi-modal large-neural-network-based model that can represent simulate the behavior molecules, cells, tissues across diverse states. This Perspective provides vision on design how collaborative efforts build AIVCs will transform biological research by...

10.1016/j.cell.2024.11.015 article EN cc-by Cell 2024-12-01

ProteinNet: a standardized data set for machine learning of protein structure

OPENALEX - Publications

Mohammed AlQuraishi

Rapid progress in deep learning has spurred its application to bioinformatics problems including protein structure prediction and design. In classic machine like computer vision, been driven by standardized data sets that facilitate fair assessment of new methods lower the barrier entry for non-domain experts. While sequence exist, they lack certain components critical learning, high-quality multiple alignments insulated training/validation splits account but only weakly detectable homology...

10.1186/s12859-019-2932-0 article EN cc-by BMC Bioinformatics 2019-06-11

Biophysical prediction of protein–peptide interactions and signaling networks using machine learning

OPENALEX - Publications

Joseph M. Cunningham Grigoriy Koytiger Peter K. Sorger Mohammed AlQuraishi

10.1038/s41592-019-0687-1 article EN Nature Methods 2020-01-06

Single-sequence protein structure prediction using language models from deep learning

OPENALEX - Publications

Ratul Chowdhury Nazim Bouatta Surojit Biswas Charlotte Rochereau George M. Church and 2 more

ABSTRACT AlphaFold2 and related systems use deep learning to predict protein structure from co-evolutionary relationships encoded in multiple sequence alignments (MSAs). Despite dramatic, recent increases accuracy, three challenges remain: (i) prediction of orphan rapidly evolving proteins for which an MSA cannot be generated, (ii) rapid exploration designed structures, (iii) understanding the rules governing spontaneous polypeptide folding solution. Here we report development end-to-end...

10.1101/2021.08.02.454840 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2021-08-04

High-throughput deep learning variant effect prediction with Sequence UNET

OPENALEX - Publications

Alistair S. Dunham Pedro Beltrão Mohammed AlQuraishi

Understanding coding mutations is important for many applications in biology and medicine but the vast mutation space makes comprehensive experimental characterisation impossible. Current predictors are often computationally intensive difficult to scale, including recent deep learning models. We introduce Sequence UNET, a highly scalable architecture that classifies predicts variant frequency from sequence alone using multi-scale representations fully convolutional compression/expansion...

10.1186/s13059-023-02948-3 article EN cc-by Genome biology 2023-05-09

Unified rational protein engineering with sequence-based deep representation learning

OPENALEX - Publications

Ethan Alley Grigory Khimulya Surojit Biswas Mohammed AlQuraishi George M. Church

Abstract This protocol describes the computational steps necessary to reproduce results described in paper " Unified rational protein engineering with sequence-only deep representation learning by Alley et al.

10.21203/rs.2.13774/v1 preprint EN cc-by Research Square (Research Square) 2019-11-20

Protein structure prediction by AlphaFold2: are attention and symmetries all you need?

OPENALEX - Publications

Nazim Bouatta Peter K. Sorger Mohammed AlQuraishi

The functions of most proteins result from their 3D structures, but determining structures experimentally remains a challenge, despite steady advances in crystallography, NMR and single-particle cryoEM. Computationally predicting the structure protein its primary sequence has long been grand challenge bioinformatics, intimately connected with understanding chemistry dynamics. Recent deep learning, combined availability genomic data for inferring co-evolutionary patterns, provide new approach...

10.1107/s2059798321007531 article EN cc-by Acta Crystallographica Section D Structural Biology 2021-07-29

Mapping variant effects on anti-tumor hallmarks of primary human T cells with base-editing screens

OPENALEX - Publications

Zachary Walsh Parin Shah Neeharika Kothapalli Shivem B. Shah Gergő Nikolényi and 12 more

10.1038/s41587-024-02235-x article EN Nature Biotechnology 2024-05-23

A multiscale statistical mechanical framework integrates biophysical and genomic data to assemble cancer networks

OPENALEX - Publications

Mohammed AlQuraishi Grigoriy Koytiger Anne Jenney Gavin MacBeath Peter K. Sorger

10.1038/ng.3138 article EN Nature Genetics 2014-11-02

Unified rational protein engineering with sequence-only deep representation learning

OPENALEX - Publications

Ethan C. Alley Grigory Khimulya Surojit Biswas Mohammed AlQuraishi George M. Church

Abstract Rational protein engineering requires a holistic understanding of function. Here, we apply deep learning to unlabelled amino acid sequences distill the fundamental features into statistical representation that is semantically rich and structurally, evolutionarily, biophysically grounded. We show simplest models built on top this uni fied rep resentation (UniRep) are broadly applicable generalize unseen regions sequence space. Our data-driven approach reaches near state-of-the-art or...

10.1101/589333 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2019-03-26

Generating Novel, Designable, and Diverse Protein Structures by Equivariantly Diffusing Oriented Residue Clouds

OPENALEX - Publications

Yeqing Lin Mohammed AlQuraishi

Proteins power a vast array of functional processes in living cells. The capability to create new proteins with designed structures and functions would thus enable the engineering cellular behavior development protein-based therapeutics materials. Structure-based protein design aims find that are designable (can be realized by sequence), novel (have dissimilar geometry from natural proteins), diverse (span wide range geometries). While advances structure prediction have made it possible...

10.48550/arxiv.2301.12485 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Recombination and lineage-specific mutations linked to the emergence of SARS-CoV-2

OPENALEX - Publications

Juan Ángel Patiño-Galindo Ioan Filip Ratul Chowdhury Costas D. Maranas Peter K. Sorger and 2 more

The emergence of SARS-CoV-2 underscores the need to better understand evolutionary processes that drive and adaptation zoonotic viruses in humans. In betacoronavirus genus, which also includes SARS-CoV MERS-CoV, recombination frequently encompasses receptor binding domain (RBD) Spike protein, is responsible for viral host cell receptors. this work, we reconstruct events have accompanied SARS-CoV-2, with a special emphasis on RBD its receptor, human ACE2.By means phylogenetic analyses, found...

10.1186/s13073-021-00943-6 article EN cc-by Genome Medicine 2021-08-06

Structural biology at the scale of proteomes

OPENALEX - Publications

Nazim Bouatta Mohammed AlQuraishi

10.1038/s41594-023-00924-w article EN Nature Structural & Molecular Biology 2023-02-01

Recombination and lineage-specific mutations linked to the emergence of SARS-CoV-2

OPENALEX - Publications

Juan Ángel Patiño-Galindo Ioan Filip Ratul Chowdhury Costas D. Maranas Peter K. Sorger and 2 more

Abstract The emergence of SARS-CoV-2 underscores the need to better understand evolutionary processes that drive and adaptation zoonotic viruses in humans. In betacoronavirus genus, which also includes SARS-CoV MERS-CoV, recombination frequently encompasses Receptor Binding Domain (RBD) Spike protein, which, turn, is responsible for viral binding host cell receptors. Here, we find evidence a event RBD involving ancestral linages both SARS-CoV-2. Although cannot specify recombinant nor...

10.1101/2020.02.10.942748 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2020-02-18

A Hybrid Structure-Based Machine Learning Approach for Predicting Kinase Inhibition by Small Molecules

OPENALEX - Publications

Changchang Liu Peter S. Kutchukian Nhan Duc Nguyen Mohammed AlQuraishi Peter K. Sorger

Kinases have been the focus of drug discovery programs for three decades leading to over 70 therapeutic kinase inhibitors and biophysical affinity measurements 130,000 kinase-compound pairs. Nonetheless, precise target spectrum many kinases remains only partly understood. In this study, we describe a computational approach unlocking qualitative quantitative kinome-wide binding structure-based machine learning. Our study has components: (i) Kinase Inhibitor Complex (KinCo) data set comprising...

10.1021/acs.jcim.3c00347 article EN cc-by-nc-nd Journal of Chemical Information and Modeling 2023-08-18

Coming Soon ...