NFDI4DS | UHH-SEMS - Publication Details

Improved protein structure refinement guided by deep learning based accuracy estimation

OPENALEX - Publications

Naozumi Hiranuma Hahnbeom Park Minkyung Baek Ivan Anishchenko Justas Dauparas and 1 more

Abstract We develop a deep learning framework (DeepAccNet) that estimates per-residue accuracy and residue-residue distance signed error in protein models uses these predictions to guide Rosetta structure refinement. The network 3D convolutions evaluate local atomic environments followed by 2D provide their global contexts outperforms other methods similarly predict the of models. Overall for X-ray cryoEM structures PDB correlate with resolution, should be broadly useful assessing both...

10.1038/s41467-021-21511-x article EN cc-by Nature Communications 2021-02-26

Protein tertiary structure prediction and refinement using deep learning and Rosetta in CASP14

OPENALEX - Publications

Ivan Anishchenko Minkyung Baek Hahnbeom Park Naozumi Hiranuma David E. Kim and 4 more

Abstract The trRosetta structure prediction method employs deep learning to generate predicted residue‐residue distance and orientation distributions from which 3D models are built. We sought improve the by incorporating as inputs (in addition sequence information) both language model embeddings template information weighted similarity target. also developed a refinement pipeline that recombines generated template‐free utilizing versions of guided DeepAccNet accuracy predictor. Both...

10.1002/prot.26194 article EN cc-by-nc Proteins Structure Function and Bioinformatics 2021-07-31

DeepProfile: Deep learning of cancer molecular profiles for precision medicine

OPENALEX - Publications

Ayse B. Dincer Safiye Çelik Naozumi Hiranuma Su‐In Lee

Abstract We present the DeepProfile framework, which learns a variational autoencoder (VAE) network from thousands of publicly available gene expression samples and uses this to encode low-dimensional representation (LDR) predict complex disease phenotypes. To our knowledge, is first attempt use deep learning extract feature vast quantity unlabeled (i.e, lacking phenotype information) that are not incorporated into prediction problem. Deep-Profile acute myeloid leukemia patients’ in vitro...

10.1101/278739 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2018-03-08

Modeling SARS‐CoV‐2 proteins in the CASP‐commons experiment

OPENALEX - Publications

Andriy Kryshtafovych John Moult W.M. Billings Dennis Della Corte Krzysztof Fidelis and 80 more

Abstract Critical Assessment of Structure Prediction (CASP) is an organization aimed at advancing the state art in computing protein structure from sequence. In spring 2020, CASP launched a community project to compute structures most structurally challenging proteins coded for SARS‐CoV‐2 genome. Forty‐seven research groups submitted over 3000 three‐dimensional models and 700 sets accuracy estimates on 10 proteins. The resulting were released public. members also worked together provide...

10.1002/prot.26231 article EN publisher-specific-oa Proteins Structure Function and Bioinformatics 2021-08-31

Improved protein structure refinement guided by deep learning based accuracy estimation

OPENALEX - Publications

Naozumi Hiranuma Hahnbeom Park Minkyung Baek Ivan Anishchanka Justas Dauparas and 1 more

Abstract We develop a deep learning framework (DeepAccNet) that estimates per-residue accuracy and residue-residue distance signed error in protein models uses these predictions to guide Rosetta structure refinement. The network 3D convolutions evaluate local atomic environments followed by 2D provide their global contexts outperforms other methods similarly predict the of models. Overall for X-ray cryoEM structures PDB correlate with resolution, should be broadly useful assessing both...

10.1101/2020.07.17.209643 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2020-07-19

Sexual ancestors generated an obligate asexual and globally dispersed clone within the model diatom species Thalassiosira pseudonana

OPENALEX - Publications

Julie A. Koester Chris Berthiaume Naozumi Hiranuma Micaela S. Parker Vaughn Iverson and 3 more

Abstract Sexual reproduction roots the eukaryotic tree of life, although its loss occurs across diverse taxa. Asexual and clonal lineages persist in these taxa despite theoretical arguments suggesting that individual clones should be evolutionarily short-lived due to limited phenotypic diversity. Here, we present quantitative evidence an obligate asexual lineage emerged from a sexual population marine diatom Thalassiosira pseudonana rapidly expanded throughout world’s oceans. Whole genome...

10.1038/s41598-018-28630-4 article EN cc-by Scientific Reports 2018-07-06

AIControl: replacing matched control experiments with machine learning improves ChIP-seq peak identification

OPENALEX - Publications

Naozumi Hiranuma Scott Lundberg Su‐In Lee

ChIP-seq is a technique to determine binding locations of transcription factors, which remains central challenge in molecular biology. Current practice use 'control' dataset remove background signals from immunoprecipitation (IP) 'target' dataset. We introduce the AIControl framework, eliminates need obtain control and instead identifies peaks by estimating distributions many publicly available datasets. thereby avoid cost running experiments while simultaneously increasing accuracy location...

10.1093/nar/gkz156 article EN cc-by-nc Nucleic Acids Research 2019-02-28

DeepATAC: A deep-learning method to predict regulatory factor binding activity from ATAC-seq signals

OPENALEX - Publications

Naozumi Hiranuma Scott Lundberg Su‐In Lee

Abstract Determining the binding locations of regulatory factors , such as transcription and histone modifications, is essential to both basic biology research many clinical applications. Obtaining genome-wide location maps directly often invasive resource-intensive, so it common impute from DNA sequence or measures chromatin accessibility. We introduce DeepATAC, a deep-learning approach for imputing that uses accessibility measured by ATAC-seq. DeepATAC significantly outperforms current...

10.1101/172767 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2017-08-06

CloudControl

OPENALEX - Publications

Naozumi Hiranuma Scott Lundberg Su‐In Lee

Chromatin immunoprecipitation followed by high throughput sequencing (ChIP-seq) is a widely used method to determine the binding positions of various proteins on genome in population cells. A typical ChIP-seq protocol involves two experiments: one designed capture target signals ('target' experiment) and other background noise ('control' experiment). peak calling algorithm then examines difference between experiment data control where protein interest binds along genome. Our approach, named...

10.1145/2975167.2975187 article EN 2016-10-02