Halil Akin

ORCID: 0000-0003-1666-3223
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Machine Learning in Bioinformatics
  • Genomics and Phylogenetic Studies
  • RNA and protein synthesis mechanisms
  • Language and cultural evolution
  • Protein Structure and Dynamics

New York Consortium in Evolutionary Primatology
2025

Recent advances in machine learning have leveraged evolutionary information multiple sequence alignments to predict protein structure. We demonstrate direct inference of full atomic-level structure from primary using a large language model. As models sequences are scaled up 15 billion parameters, an atomic-resolution picture emerges the learned representations. This results order-of-magnitude acceleration high-resolution prediction, which enables large-scale structural characterization...

10.1126/science.ade2574 article EN cc-by Science 2023-03-16

Abstract Artificial intelligence has the potential to open insight into structure of proteins at scale evolution. It only recently been possible extend protein prediction two hundred million cataloged proteins. Characterizing structures exponentially growing billions sequences revealed by large gene sequencing experiments would necessitate a break-through in speed folding. Here we show that direct inference from primary sequence using language model enables an order magnitude speed-up high...

10.1101/2022.07.20.500902 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2022-07-21

Abstract More than three billion years of evolution have produced an image biology encoded into the space natural proteins. Here we show that language models trained on tokens generated by can act as evolutionary simulators to generate functional proteins are far away from known We present ESM3, a frontier multimodal generative model reasons over sequence, structure, and function ESM3 follow complex prompts combining its modalities is highly responsive biological alignment. prompted...

10.1101/2024.07.01.600583 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2024-07-02

More than three billion years of evolution have produced an image biology encoded into the space natural proteins. Here we show that language models trained at scale on evolutionary data can generate functional proteins are far away from known We present ESM3, a frontier multimodal generative model reasons over sequence, structure, and function ESM3 follow complex prompts combining its modalities is highly responsive to alignment improve fidelity. prompted fluorescent Among generations...

10.1126/science.ads0018 article EN Science 2025-01-16
Coming Soon ...