Kengo Sato

ORCID: 0000-0001-6744-7390
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • RNA and protein synthesis mechanisms
  • Genomics and Phylogenetic Studies
  • RNA modifications and cancer
  • RNA Research and Splicing
  • Machine Learning in Bioinformatics
  • Natural Language Processing Techniques
  • Cancer-related molecular mechanisms research
  • Topic Modeling
  • Genetic diversity and population structure
  • CRISPR and Genetic Engineering
  • Chromosomal and Genetic Variations
  • Computational Drug Discovery Methods
  • Gene expression and cancer classification
  • Advanced Steganography and Watermarking Techniques
  • Evolution and Genetic Dynamics
  • Metal Forming Simulation Techniques
  • Viral Infections and Immunology Research
  • Genomics and Chromatin Dynamics
  • Speech and dialogue systems
  • Molecular Biology Techniques and Applications
  • Bacteriophages and microbial interactions
  • Microbial Metabolic Engineering and Bioproduction
  • High-Velocity Impact and Material Behavior
  • Text and Document Classification Technologies
  • Protein Structure and Dynamics

Jichi Medical University
2024

Tokyo Denki University
2022-2023

Keio University
2012-2021

Tokushima University
2013-2014

Nara Institute of Science and Technology
2011-2012

The University of Tokyo
1997-2012

National Institute of Advanced Industrial Science and Technology
2008-2012

Kyoto University
2010-2011

Genome Institute of Singapore
2011

Chiba Institute of Technology
2011

Szymon M. Kiełbasa1, Raymond Wan2, Kengo Sato3, Paul Horton2 and Martin C. Frith2,4 Department of Computational Biology, Max Planck Institute for Molecular Genetics, Berlin D-14195, Germany; Biology Research Center, Tokyo 135-0064, Japan; Graduate School Frontier Sciences, University Tokyo, Chiba 277-8561, Japan

10.1101/gr.113985.110 article EN cc-by-nc Genome Research 2011-01-05

Pseudoknots found in secondary structures of a number functional RNAs play various roles biological processes. Recent methods for predicting RNA cover certain classes pseudoknotted structures, but only few them achieve satisfying predictions terms both speed and accuracy.We propose IPknot, novel computational method with pseudoknots based on maximizing expected accuracy predicted structure. IPknot decomposes structure into set pseudoknot-free substructures approximates base-pairing...

10.1093/bioinformatics/btr215 article EN cc-by-nc Bioinformatics 2011-06-14

Abstract Accurate predictions of RNA secondary structures can help uncover the roles functional non-coding RNAs. Although machine learning-based models have achieved high performance in terms prediction accuracy, overfitting is a common risk for such highly parameterized models. Here we show that be minimized when folding scores learnt using deep neural network are integrated together with Turner’s nearest-neighbor free energy parameters. Training model thermodynamic regularization ensures...

10.1038/s41467-021-21194-4 article EN cc-by Nature Communications 2021-02-11

The CentroidFold web server (http://www.ncrna.org/centroidfold/) is a application for RNA secondary structure prediction powered by one of the most accurate engine. accepts two kinds sequence data: single and multiple alignment sequences. It responses with result shown as popular base-pair notation graph representation. PDF version representation also available. For sequence, predicts common structure. Usage quite simple. You can paste (FASTA or plain text) (CLUSTAL-W format) into textarea...

10.1093/nar/gkp367 article EN Nucleic Acids Research 2009-05-12

Previous studies have suggested deep learning to be a highly effective approach for screening lead compounds new drugs. Several models been developed by addressing the use of various kinds fingerprints and graph convolution architectures. However, these methods are either advantageous or disadvantageous depending on whether they (1) can distinguish structural differences including chirality compounds, (2) automatically discover features.We another model compound classification. In this...

10.1186/s12859-018-2523-5 article EN cc-by BMC Bioinformatics 2018-12-01

Abstract Motivation: Recent studies have shown that the methods for predicting secondary structures of RNAs on basis posterior decoding base-pairing probabilities has an advantage with respect to prediction accuracy over conventionally utilized minimum free energy methods. However, there is room improvement in objective functions presented previous studies, which are maximized measures structures. Results: We propose novel estimators improve structure RNAs. The proposed maximize function...

10.1093/bioinformatics/btn601 article EN Bioinformatics 2008-12-18

The assembly of multiple genomes from mixed sequence reads is a bottleneck in metagenomic analysis. A single-genome program (assembler) not capable resolving metagenome sequences, so assemblers designed specifically for metagenomics have been developed. MetaVelvet an extension the assembler Velvet. It has proved to generate assemblies with higher N50 scores and quality than such as Velvet SOAPdenovo when applied frequently used this research community. One important open problem its low...

10.1093/dnares/dsu041 article EN cc-by-nc DNA Research 2014-11-27

Abstract RNA structural elements called pseudoknots are involved in various biological phenomena including ribosomal frameshifts. Because it is infeasible to construct an efficiently computable secondary structure model pseudoknots, prediction methods considering not yet widely available. We developed IPknot, which uses heuristics speed up computations, but has remained difficult apply long sequences, such as messenger and viral RNA, because requires cubic computational time with respect...

10.1093/bib/bbab395 article EN cc-by-nc Briefings in Bioinformatics 2021-09-08

Considerable attention has been focused on predicting RNA-RNA interaction since it is a key to identifying possible targets of non-coding small RNAs that regulate gene expression post-transcriptionally. A number computational studies have so far devoted joint secondary structures or binding sites under specific class interactions. In general, there trade-off between range type and efficiency prediction algorithm, thus efficient methods for comprehensive are still awaited.We present RactIP,...

10.1093/bioinformatics/btq372 article EN Bioinformatics 2010-09-04

Abstract Genetically modified nonhuman primates (NHP) are useful models for biomedical research. Gene editing technologies have enabled production of target-gene knock-out (KO) NHP models. Target-gene-KO/knock-in (KI) efficiency CRISPR/Cas9 has not been extensively investigated in marmosets. In this study, optimum conditions target gene modification efficacies CRISPR/mRNA and CRISPR/nuclease marmoset embryos were examined. was more effective than avoiding mosaic genetic alteration....

10.1038/s41598-019-49110-3 article EN cc-by Scientific Reports 2019-09-03

10.1016/j.csbj.2025.04.001 article EN cc-by Computational and Structural Biotechnology Journal 2025-04-01

Abstract Motivation: The computational identification of non-coding RNA regions on the genome is currently receiving much attention. However, it essentially harder than gene-finding problems for protein-coding because sequences do not have strong statistical signals. Since comparative sequence analysis effective detection, efficient methods are expected structural alignment sequences. Several been proposed to accomplish tasks sequences, and we found that one most important points estimate an...

10.1093/bioinformatics/bti1139 article EN Bioinformatics 2005-09-01

Motivation: Since the whole genome sequences of many species have been determined, computational prediction RNA secondary structures and identification those non-coding regions by comparative genomics become important. Therefore, more advanced alignment methods are required. Recently, an approach structural for has introduced to solve these problems. Pair hidden Markov models on tree (PHMMTSs) proposed Sakakibara efficient automata-theoretic structures, although PHMMTSs incapable handling...

10.1093/bioinformatics/bti385 article EN Bioinformatics 2005-03-22

Considerable attention has been focused on predicting the secondary structure for aligned RNA sequences since it is useful not only improving limiting accuracy of conventional prediction but also finding non-coding RNAs in genomic sequences. Although there exist many algorithms sequences, further improvement still awaited. In this article, toward accuracy, a theoretical classification state-of-the-art presented. The based viewpoint maximum expected (MEA), which successfully applied various...

10.1093/nar/gkq792 article EN cc-by-nc Nucleic Acids Research 2010-09-15

A popular approach for predicting RNA secondary structure is the thermodynamic nearest-neighbor model that finds a thermodynamically most stable with minimum free energy (MFE). For further improvement, an alternative based on machine learning techniques has been developed. The learning-based can employ fine-grained includes much richer feature representations ability to fit training data. Although achieved extremely high performance in prediction accuracy, possibility of risk overfitting...

10.1142/s0219720018400255 article EN Journal of Bioinformatics and Computational Biology 2018-10-31

Bacillus subtilis is the main component in fermentation of soybeans. To investigate genetics soybean-fermenting B. strains and its relationship with productivity extracellular poly-γ-glutamic acid (γPGA), we sequenced whole genome eight stains isolated from non-salted fermented soybean foods Southeast Asia. Assembled nucleotide sequences were compared those a natto (fermented food) starter strain BEST195 laboratory standard 168 that incapable γPGA production. Detected variants investigated...

10.1371/journal.pone.0141369 article EN cc-by PLoS ONE 2015-10-27

The importance of accurate and fast predictions multiple alignments for RNA sequences has increased due to recent findings about functional non-coding RNAs. Recent studies suggest that maximizing the expected accuracy will be useful many problems in bioinformatics.We designed a novel estimator structured RNAs, based on predictions. First, we define maximum (MEA) pairwise alignment sequences. This maximizes sum-of-pairs score (SPS) predicted under probability distribution given by...

10.1093/bioinformatics/btp580 article EN Bioinformatics 2009-10-06

Secondary structure prediction of RNA sequences is an important problem. There have been progresses in this area, but the accuracy from sequence still limited. In many cases, however, homologous are available with target whose secondary to be predicted.In article, we propose a new method for predictions individual by taking information their into account without assuming common entire sequences. The proposed based on posterior decoding techniques, which consider all suboptimal structures and...

10.1093/bioinformatics/btp228 article EN cc-by-nc Bioinformatics 2009-05-27

Abstract Motivation: It is well known that the accuracy of RNA secondary structure prediction from a single sequence limited, and thus comparative approach predicts common aligned sequences better choice if homologous with reliable alignments are available. However, correct information needed to produce sequences. To tackle this dilemma, we require fast accurate aligner takes structural into consideration yield alignments, which suitable for prediction. Results: We develop DAFS, novel...

10.1093/bioinformatics/bts612 article EN Bioinformatics 2012-10-11

De novo microbial genome sequencing reached a turning point with third-generation (TGS) platforms, and several genomes have been improved by TGS long reads. Bacillus subtilis natto is closely related to the laboratory standard strain B. Marburg 168, it has function in production of traditional Japanese fermented food “natto.” The BEST195 was previously sequenced short reads, but included some incomplete regions. We resequenced using PacBio RS sequencer, we successfully obtained complete...

10.1371/journal.pone.0109999 article EN cc-by PLoS ONE 2014-10-16

Abstract The first draft of the common marmoset ( Callithrix jacchus ) genome was published by Marmoset Genome Sequencing and Analysis Consortium. based on whole-genome shotgun sequencing current assembly version is Callithrix_jacches-3.2.1, but there still exist 187,214 undetermined gap regions supercontigs relatively short contigs that are unmapped to chromosomes in genome. We performed resequencing deep with high-throughput technology. Several different sequence runs using Illumina...

10.1038/srep16894 article EN cc-by Scientific Reports 2015-11-20

We present a web-based tool set Rtips for fast and accurate prediction of RNA 2D complex structures. comprises two computational tools based on integer programming, IPknot predicting secondary structures with pseudoknots RactIP RNA-RNA interactions kissing hairpins. Both servers can run much faster than existing services the same purpose large data sets as well being at least comparable in accuracy. The web server along stand-alone programs is freely accessible http://rna.naist.jp/.

10.1093/nar/gks412 article EN cc-by-nc Nucleic Acids Research 2012-05-16

Abstract Background Prediction of biochemical (metabolic) pathways has a wide range applications, including the optimization drug candidates, and elucidation toxicity mechanisms. Recently, several methods have been developed for pathway prediction to derive goal compound from start compound. However, these require high computational costs, cannot perform comprehensive novel metabolic pathways. Our aim this study is develop de novo method reconstructions predictions unknown biosynthetic in...

10.1186/1471-2105-13-s17-s8 article EN cc-by BMC Bioinformatics 2012-12-01
Coming Soon ...