NFDI4DS | UHH-SEMS - Publication Details

Sergey Ovchinnikov

ORCID: 0000-0003-2774-2744

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5084204507

Research Areas

Protein Structure and Dynamics
RNA and protein synthesis mechanisms
Machine Learning in Bioinformatics
Enzyme Structure and Function
Genomics and Phylogenetic Studies
Microbial Metabolic Engineering and Bioproduction
Bioinformatics and Genomic Networks
Bacterial Genetics and Biotechnology
Computational Drug Discovery Methods
Glycosylation and Glycoproteins Research
Photosynthetic Processes and Mechanisms
Protein purification and stability
Microbial Community Ecology and Physiology
Mass Spectrometry Techniques and Applications
Advanced Proteomics Techniques and Applications
Evolution and Genetic Dynamics
Genetic diversity and population structure
Peptidase Inhibition and Analysis
Machine Learning in Materials Science
Supramolecular Self-Assembly in Materials
Monoclonal and Polyclonal Antibodies Research
Bacteriophages and microbial interactions
Modular Robots and Swarm Intelligence
Endoplasmic Reticulum Stress and Disease
Engineering and Environmental Studies

Harvard University Press
2019-2025

Massachusetts Institute of Technology
2024-2025

Harvard University
2018-2024

University of Washington
2013-2023

Center for Systems Biology
2018-2023

Seoul National University
2021

The University of Tokyo
2021

Michigan State University
2021

Max Planck Institute for Biophysical Chemistry
2021

Seattle University
2014-2019

ColabFold: making protein folding accessible to all

OPENALEX - Publications

Milot Mirdita Konstantin Schütze Yoshitaka Moriwaki Lim Heo Sergey Ovchinnikov and 1 more

Abstract ColabFold offers accelerated prediction of protein structures and complexes by combining the fast homology search MMseqs2 with AlphaFold2 or RoseTTAFold. ColabFold’s 40−60-fold faster optimized model utilization enables close to 1,000 per day on a server one graphics processing unit. Coupled Google Colaboratory, becomes free accessible platform for folding. is open-source software available at https://github.com/sokrypton/ColabFold its novel environmental databases are...

10.1038/s41592-022-01488-1 article EN cc-by Nature Methods 2022-05-30

Accurate prediction of protein structures and interactions using a three-track neural network

OPENALEX - Publications

Minkyung Baek Frank DiMaio Ivan Anishchenko Justas Dauparas Sergey Ovchinnikov and 27 more

Deep learning takes on protein folding In 1972, Anfinsen won a Nobel prize for demonstrating connection between protein’s amino acid sequence and its three-dimensional structure. Since 1994, scientists have competed in the biannual Critical Assessment of Structure Prediction (CASP) protein-folding challenge. methods took center stage at CASP14, with DeepMind’s Alphafold2 achieving remarkable accuracy. Baek et al . explored network architectures based DeepMind framework. They used three-track...

10.1126/science.abj8754 article EN Science 2021-07-15

Improved protein structure prediction using predicted interresidue orientations

OPENALEX - Publications

Jianyi Yang Ivan Anishchenko Hahnbeom Park Zhenling Peng Sergey Ovchinnikov and 1 more

The prediction of interresidue contacts and distances from coevolutionary data using deep learning has considerably advanced protein structure prediction. Here, we build on these advances by developing a residual network for predicting orientations, in addition to distances, Rosetta-constrained energy-minimization protocol rapidly accurately generating models guided restraints. In benchmark tests 13th Community-Wide Experiment the Critical Assessment Techniques Protein Structure Prediction...

10.1073/pnas.1914677117 article EN Proceedings of the National Academy of Sciences 2020-01-02

De novo design of protein structure and function with RFdiffusion

OPENALEX - Publications

Joseph L. Watson David Juergens Nathaniel R. Bennett Brian L. Trippe Jason Yim and 23 more

Abstract There has been considerable recent progress in designing new proteins using deep-learning methods 1–9 . Despite this progress, a general framework for protein design that enables solution of wide range challenges, including de novo binder and higher-order symmetric architectures, yet to be described. Diffusion models 10,11 have had success image language generative modelling but limited when applied modelling, probably due the complexity backbone geometry sequence–structure...

10.1038/s41586-023-06415-8 article EN cc-by Nature 2023-07-11

Assessing the utility of coevolution-based residue–residue contact predictions in a sequence- and structure-rich era

OPENALEX - Publications

Hetunandan Kamisetty Sergey Ovchinnikov David Baker

Recently developed methods have shown considerable promise in predicting residue-residue contacts protein 3D structures using evolutionary covariance information. However, these require large numbers of evolutionarily related sequences to robustly assess the extent residue covariation, and larger family, more likely that contact information is unnecessary because a reasonable model can be built based on structure homolog. Here we describe method integrates sequence coevolution structural...

10.1073/pnas.1314045110 article EN Proceedings of the National Academy of Sciences 2013-09-05

Robust and accurate prediction of residue–residue interactions across protein interfaces using evolutionary information

OPENALEX - Publications

Sergey Ovchinnikov Hetunandan Kamisetty David Baker

Do the amino acid sequence identities of residues that make contact across protein interfaces covary during evolution? If so, such covariance could be used to predict contacts and assemble models biological complexes. We find residue pairs identified using a pseudo-likelihood-based method protein-protein in 50S ribosomal unit 28 additional bacterial complexes with known structure are almost always complex, provided number aligned sequences is greater than average length two proteins. use...

10.7554/elife.02030 article EN cc-by eLife 2014-05-01

ColabFold - Making protein folding accessible to all

OPENALEX - Publications

Milot Mirdita Konstantin Schütze Yoshitaka Moriwaki Lim Heo Sergey Ovchinnikov and 1 more

ColabFold offers accelerated protein structure and complex predictions by combining the fast homology search of MMseqs2 with AlphaFold2 or RoseTTAFold. ColabFold’s 40 - 60× faster optimized model use allows predicting close to a thousand structures per day on server one GPU. Coupled Google Colaboratory, becomes free accessible platform for folding. is open-source software available at github.com/sokrypton/ColabFold . Its novel environmental databases are colabfold.mmseqs.com Contact...

10.1101/2021.08.15.456425 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2021-08-15

Protein structure determination using metagenome sequence data

OPENALEX - Publications

Sergey Ovchinnikov Hahnbeom Park Neha Varghese Po‐Ssu Huang Georgios A. Pavlopoulos and 4 more

Filling in the protein fold picture Fewer than a third of 14,849 known families have at least one member with an experimentally determined structure. This leaves more 5000 no structural information. Protein modeling using residue-residue contacts inferred from evolutionary data has been successful unknown structures, but it requires large numbers aligned sequences. Ovchinnikov et al. augmented such sequence alignments metagenome (see Perspective by Söding). They number sequences required to...

10.1126/science.aah4043 article EN Science 2017-01-19

A structural biology community assessment of AlphaFold2 applications

OPENALEX - Publications

Mehmet Akdel Douglas E. V. Pires Eduard Porta‐Pardo Jürgen Jänes Arthur O. Zalevsky and 29 more

Most proteins fold into 3D structures that determine how they function and orchestrate the biological processes of cell. Recent developments in computational methods for protein structure predictions have reached accuracy experimentally determined models. Although this has been independently verified, implementation these across structural-biology applications remains to be tested. Here, we evaluate use AlphaFold2 (AF2) study characteristic structural elements; impact missense variants;...

10.1038/s41594-022-00849-w article EN cc-by Nature Structural & Molecular Biology 2022-11-01

Computed structures of core eukaryotic protein complexes

OPENALEX - Publications

Ian R. Humphreys Jimin Pei Minkyung Baek Aditya Krishnakumar Ivan Anishchenko and 25 more

Protein-protein interactions play critical roles in biology, but the structures of many eukaryotic protein complexes are unknown, and there likely not yet identified. We take advantage advances proteome-wide amino acid coevolution analysis deep-learning–based structure modeling to systematically identify build accurate models core within

10.1126/science.abm4805 article EN Science 2021-11-11

De novo protein design by deep network hallucination

OPENALEX - Publications

Ivan Anishchenko Samuel J. Pellock Tamuka M. Chidyausiku Theresa A. Ramelot Sergey Ovchinnikov and 10 more

10.1038/s41586-021-04184-w article EN Nature 2021-12-01

De novo design of a fluorescence-activating β-barrel

OPENALEX - Publications

Jiayi Dou Anastassia A. Vorobieva William Sheffler Lindsey Doyle Hahnbeom Park and 13 more

10.1038/s41586-018-0509-0 article EN Nature 2018-09-01

Scaffolding protein functional sites using deep learning

OPENALEX - Publications

Jue Wang Sidney Lisanza David Juergens Doug Tischer Joseph L. Watson and 19 more

The binding and catalytic functions of proteins are generally mediated by a small number functional residues held in place the overall protein structure. Here, we describe deep learning approaches for scaffolding such sites without needing to prespecify fold or secondary structure scaffold. first approach, "constrained hallucination," optimizes sequences that their predicted structures contain desired site. second "inpainting," starts from site fills additional sequence create viable...

10.1126/science.abn2100 article EN Science 2022-07-21

Protein interaction networks revealed by proteome coevolution

OPENALEX - Publications

Qian Cong Ivan Anishchenko Sergey Ovchinnikov David Baker

Residue-residue coevolution has been observed across a number of protein-protein interfaces, but the extent residue between protein families on whole-proteome scale not systematically studied. We investigate 5.4 million pairs proteins in

10.1126/science.aaw6718 article EN Science 2019-07-12

Large-scale determination of previously unsolved protein structures using evolutionary information

OPENALEX - Publications

Sergey Ovchinnikov Lisa N. Kinch Hahnbeom Park Yuxing Liao Jimin Pei and 4 more

The prediction of the structures proteins without detectable sequence similarity to any protein known structure remains an outstanding scientific challenge. Here we report significant progress in this area. We first describe de novo blind predictions unprecendented accuracy made for two large families recent CASP11 test methods by incorporating residue–residue co-evolution information Rosetta program. then use method generate models 58 121 prokaryotes which three-dimensional are not...

10.7554/elife.09248 article EN cc-by eLife 2015-09-03

Transformer protein language models are unsupervised structure learners

OPENALEX - Publications

Roshan Rao Joshua Meier Tom Sercu Sergey Ovchinnikov Alexander Rives

A bstract Unsupervised contact prediction is central to uncovering physical, structural, and functional constraints for protein structure determination design. For decades, the predominant approach has been infer evolutionary from a set of related sequences. In past year, language models have emerged as potential alternative, but performance fallen short state-of-the-art approaches in bioinformatics. this paper we demonstrate that Transformer attention maps learn contacts unsupervised...

10.1101/2020.12.15.422761 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2020-12-15

Predicting multiple conformations via sequence clustering and AlphaFold2

OPENALEX - Publications

Hannah K. Wayment-Steele Adedolapo Ojoawo Renee Otten Julia M Apitz Warintra Pitsawong and 4 more

AlphaFold2 (ref. 1) has revolutionized structural biology by accurately predicting single structures of proteins. However, a protein's biological function often depends on multiple conformational substates2, and disease-causing point mutations cause population changes within these substates3,4. We demonstrate that clustering multiple-sequence alignment sequence similarity enables to sample alternative states known metamorphic proteins with high confidence. Using this method, named...

10.1038/s41586-023-06832-9 article EN cc-by Nature 2023-11-13

Architectures of Lipid Transport Systems for the Bacterial Outer Membrane

OPENALEX - Publications

Damian C. Ekiert Gira Bhabha Georgia L. Isom Garrett Greenan Sergey Ovchinnikov and 3 more

10.1016/j.cell.2017.03.019 article EN publisher-specific-oa Cell 2017-04-01

Origins of coevolution between residues distant in protein 3D structures

OPENALEX - Publications

Ivan Anishchenko Sergey Ovchinnikov Hetunandan Kamisetty David Baker

Significance Coevolution-derived contact predictions are enabling accurate protein structure modeling. However, coevolving residues not always in contact, and this is a potential source of error such modeling efforts. To investigate the sources errors and, more generally, origins coevolution structures, we provide global overview contributions to “exceptions” general rule that close three-dimensional structures.

10.1073/pnas.1702664114 article EN Proceedings of the National Academy of Sciences 2017-08-07

Cryo-EM structure of the protein-conducting ERAD channel Hrd1 in complex with Hrd3

OPENALEX - Publications

Stefan Schoebel Wei Mi Alexander Stein Sergey Ovchinnikov Ryan E. Pavlovicz and 7 more

10.1038/nature23314 article EN Nature 2017-07-06

Structural basis of ER-associated protein degradation mediated by the Hrd1 ubiquitin ligase complex

OPENALEX - Publications

Xudong Wu Marc Siggel Sergey Ovchinnikov Wei Mi Vladimir Svetlov and 4 more

Misfolded luminal endoplasmic reticulum (ER) proteins undergo ER-associated degradation (ERAD-L): They are retrotranslocated into the cytosol, polyubiquitinated, and degraded by proteasome. ERAD-L is mediated Hrd1 complex (composed of Hrd1, Hrd3, Der1, Usa1, Yos9), but mechanism retrotranslocation remains mysterious. Here, we report a structure active complex, as determined cryo-electron microscopy analysis two subcomplexes. Hrd3 Yos9 jointly create binding site that recognizes glycosylated...

10.1126/science.aaz2449 article EN Science 2020-04-24

Mega-scale experimental analysis of protein folding stability in biology and design

OPENALEX - Publications

Kotaro Tsuboyama Justas Dauparas Jonathan H. Chen Élodie Laine Yasser Mohseni Behbahani and 4 more

Advances in DNA sequencing and machine learning are providing insights into protein sequences structures on an enormous scale1. However, the energetics driving folding invisible these remain largely unknown2. The hidden thermodynamics of can drive disease3,4, shape evolution5-7 guide engineering8-10, new approaches needed to reveal for every sequence structure. Here we present cDNA display proteolysis, a method measuring thermodynamic stability up 900,000 domains one-week experiment. From...

10.1038/s41586-023-06328-6 article EN cc-by Nature 2023-07-19

State-of-the-Art Estimation of Protein Model Accuracy Using AlphaFold

OPENALEX - Publications

James P. Roney Sergey Ovchinnikov

The problem of predicting a protein's 3D structure from its primary amino acid sequence is longstanding challenge in structural biology. Recently, approaches like alphafold have achieved remarkable performance on this task by combining deep learning techniques with coevolutionary data multiple alignments related protein sequences. use information critical to these models' accuracy, and without it their predictive drops considerably. In living cells, however, the fully determined biophysical...

10.1103/physrevlett.129.238101 article EN cc-by Physical Review Letters 2022-11-28

Protein sequence design by conformational landscape optimization

OPENALEX - Publications

Christoffer Norn Basile I. M. Wicky David Juergens Sirui Liu David E. Kim and 95 more

Significance Almost all proteins fold to their lowest free energy state, which is determined by amino acid sequence. Computational protein design has primarily focused on finding sequences that have very low in the target designed structure. However, what most relevant during folding not absolute of folded state but difference between and lowest-lying alternative states. We describe a deep learning approach captures aspects landscape, particular presence structures minima, show it can...

10.1073/pnas.2017228118 article EN cc-by-nc-nd Proceedings of the National Academy of Sciences 2021-03-12

Coming Soon ...