NFDI4DS | UHH-SEMS - Publication Details

Hirokazu Chiba

ORCID: 0000-0003-4062-8903

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5059383385

Research Areas

Genomics and Phylogenetic Studies
Biomedical Text Mining and Ontologies
Semantic Web and Ontologies
RNA and protein synthesis mechanisms
Bioinformatics and Genomic Networks
Graph Theory and Algorithms
Advanced Graph Neural Networks
Genomics and Rare Diseases
Machine Learning in Bioinformatics
Plant biochemistry and biosynthesis
Genomics and Chromatin Dynamics
Gene expression and cancer classification
Invertebrate Immune Response Mechanisms
Silk-based biomaterials and applications
Genetics, Bioinformatics, and Biomedical Research
Chromosomal and Genetic Variations
Silkworms and Sericulture Research
Scientific Computing and Data Management
Natural product bioactivities and synthesis
Advanced Database Systems and Queries
Genetic Associations and Epidemiology
Environmental DNA in Biodiversity Studies
Phytochemical compounds biological activities
Glycosylation and Glycoproteins Research
Research Data Management Practices

Research Organization of Information and Systems
2018-2025

The University of Tokyo
2008-2022

National Institutes of Natural Sciences
2012-2020

National Institute for Basic Biology
2014-2020

National Institute of Advanced Industrial Science and Technology
2010

Tokyo University of Science
2008

MBGD update 2015: microbial genome database for flexible ortholog analysis utilizing a diverse set of genomic data

OPENALEX - Publications

Ikuo Uchiyama Motohiro Mihara Hiroyo Nishide Hirokazu Chiba

The microbial genome database for comparative analysis (MBGD) (available at http://mbgd.genome.ad.jp/) is a comprehensive ortholog flexible of genomes, where the users are allowed to create an table among any specified set organisms. Because rapid increase in data owing next-generation sequencing technology, it becomes increasingly challenging maintain high-quality orthology relationships while allowing incorporate latest genomic available into analysis. many recently accumulating draft...

10.1093/nar/gku1152 article EN cc-by Nucleic Acids Research 2014-11-14

BioHackathon series in 2011 and 2012: penetration of ontology and linked data in life science domains

OPENALEX - Publications

Toshiaki Katayama Mark D. Wilkinson Kiyoko F. Aoki‐Kinoshita Shuichi Kawashima Yasunori Yamamoto and 80 more

The application of semantic technologies to the integration biological data and interoperability bioinformatics analysis visualization tools has been common theme a series annual BioHackathons hosted in Japan for past five years. Here we provide review activities outcomes from held 2011 Kyoto 2012 Toyama. In order efficiently implement life sciences, participants formed various sub-groups worked on following topics: Resource Description Framework (RDF) models specific domains, text mining...

10.1186/2041-1480-5-5 article EN cc-by Journal of Biomedical Semantics 2014-01-01

Ten Years of Collaborative Progress in the Quest for Orthologs

OPENALEX - Publications

Benjamin Linard Ingo Ebersberger Shawn E. McGlynn Natasha Glover Tomohiro Mochizuki and 52 more

Accurate determination of the evolutionary relationships between genes is a foundational challenge in biology. Homology-evolutionary relatedness-is many cases readily determined based on sequence similarity analysis. By contrast, whether or not two directly descended from common ancestor by speciation event (orthologs) duplication (paralogs) more challenging, yet provides critical information history gene. Since 2009, this task has been focus Quest for Orthologs (QFO) Consortium. The sixth...

10.1093/molbev/msab098 article EN cc-by Molecular Biology and Evolution 2021-04-01

MBGD update 2018: microbial genome database based on hierarchical orthology relations covering closely related and distantly related comparisons

OPENALEX - Publications

Ikuo Uchiyama Motohiro Mihara Hiroyo Nishide Hirokazu Chiba Masaki Kato

The Microbial Genome Database for Comparative Analysis (MBGD) is a database comparative genomics based on comprehensive orthology analysis of bacteria, archaea and unicellular eukaryotes. MBGD now contains 6318 genomes. To utilize the both closely related distantly genomes, previously provided two types ortholog tables: standard table containing one representative genome from each genus covering entire taxonomic range taxon specific tables taxon. However, this approach has drawback in that...

10.1093/nar/gky1054 article EN cc-by Nucleic Acids Research 2018-11-03

Expanding the concept of ID conversion in TogoID by introducing multi-semantic and label features

OPENALEX - Publications

Shuya Ikeda Kiyoko F. Aoki‐Kinoshita Hirokazu Chiba Susumu Goto Masae Hosoda and 8 more

TogoID ( https://togoid.dbcls.jp/ ) is an identifier (ID) conversion service designed to link IDs across diverse categories of life science databases. With its ability obtain related in different semantic relationships, a user-friendly web interface, and regular automatic data update system, has been valuable tool for bioinformatics. We have recently expanded TogoID's represent semantics between datasets, enabling it handle multiple relationships within dataset pairs. This enhancement...

10.1186/s13326-024-00322-1 article EN cc-by-nc-nd Journal of Biomedical Semantics 2025-01-08

MBGD: Microbial genome database for comparative analysis featuring enhanced functionality to characterize gene and genome functions through large-scale orthology analysis

OPENALEX - Publications

Ikuo Uchiyama Motohiro Mihara Hiroyo Nishide Hirokazu Chiba M. Takayanagi and 2 more

10.1016/j.jmb.2025.168957 article EN cc-by Journal of Molecular Biology 2025-01-01

MBGD update 2013: the microbial genome database for exploring the diversity of microbial world

OPENALEX - Publications

Ikuo Uchiyama Motohiro Mihara Hiroyo Nishide Hirokazu Chiba

The microbial genome database for comparative analysis (MBGD, available at http://mbgd.genome.ad.jp/) is a platform comparison based on orthology analysis. As its unique feature, MBGD allows users to conduct among any specified set of organisms; this flexibility adapt variety genomic study. Reflecting the huge diversity world, number projects now becomes several thousands. To efficiently explore entire data, provides summary pages pre-calculated ortholog tables various taxonomic groups. For...

10.1093/nar/gks1006 article EN cc-by-nc Nucleic Acids Research 2012-10-30

Gearing up to handle the mosaic nature of life in the quest for orthologs

OPENALEX - Publications

Sofia K. Forslund Cécile Pereira Salvador Capella-Gutiérrez Alan Sousa da Silva Adrian Altenhoff and 76 more

Abstract Summary: The Quest for Orthologs (QfO) is an open collaboration framework experts in comparative phylogenomics and related research areas who have interest highly accurate orthology predictions their applications. We here report highlights discussion points from the QfO meeting 2015 held Barcelona. Achievements recent years established a basis to support developments improved prediction explore new approaches. Central effort proper benchmarking of methods services, as well design...

10.1093/bioinformatics/btx542 article EN cc-by Bioinformatics 2017-08-29

The Orthology Ontology: development and applications

OPENALEX - Publications

Jesualdo Tomás Fernández‐Breis Hirokazu Chiba María Del Carmen Legaz-García Ikuo Uchiyama

Computational comparative analysis of multiple genomes provides valuable opportunities to biomedical research. In particular, orthology can play a central role in genomics; it guides establishing evolutionary relations among genes organisms and allows functional inference gene products. However, the wide variations current databases necessitate research toward shareability content that is generated by different tools stored structures. Exchanging with other communities requires making...

10.1186/s13326-016-0077-x article EN cc-by Journal of Biomedical Semantics 2016-06-04

TogoID: an exploratory ID converter to bridge biological datasets

OPENALEX - Publications

Shuya Ikeda Hiromasa Ono Tazro Ohta Hirokazu Chiba Yuki Naito and 6 more

Abstract Motivation Understanding life cannot be accomplished without making full use of biological data, which are scattered across databases diverse categories in sciences. To connect such data seamlessly, identifier (ID) conversion plays a key role. However, existing ID services have disadvantages, as covering only limited range databases, not keeping up with the updates original and outputs being hard to interpret context relations, especially when converting IDs multiple steps. Results...

10.1093/bioinformatics/btac491 article EN cc-by Bioinformatics 2022-07-08

Construction of an Ortholog Database Using the Semantic Web Technology for Integrative Analysis of Genomic Data

OPENALEX - Publications

Hirokazu Chiba Hiroyo Nishide Ikuo Uchiyama

Recently, various types of biological data, including genomic sequences, have been rapidly accumulating. To discover knowledge from such growing heterogeneous a flexible framework for data integration is necessary. Ortholog information central resource interlinking corresponding genes among different organisms, and the Semantic Web provides key technology data. We constructed an ortholog database using technology, aiming at numerous information. formalize structure in Web, we Ontology...

10.1371/journal.pone.0122802 article EN cc-by PLoS ONE 2015-04-13

Weak correlation between sequence conservation in promoter regions and in protein-coding regions of human-mouse orthologous gene pairs

OPENALEX - Publications

Hirokazu Chiba Riu Yamashita Kengo Kinoshita Kenta Nakai

Abstract Background Interspecies sequence comparison is a powerful tool to extract functional or evolutionary information from the genomes of organisms. A number studies have compared protein sequences promoter between mammals, which provided many insights into genomics. However, correlation conservation and remains controversial. Results We examined as well for 6,901 human mouse orthologous genes, observed very weak them. further investigated their relationship by decomposing it based on...

10.1186/1471-2164-9-152 article EN cc-by BMC Genomics 2008-04-02

Triterpene RDF: Developing a database of plant enzymes and transcription factors involved in triterpene biosynthesis using the Resource Description Framework

OPENALEX - Publications

Keita Tamura Hirokazu Chiba Hidemasa Bono

Plants produce structurally diverse triterpenes (triterpenoids and steroids). Their biosynthesis occurs from a common precursor, namely 2,3-oxidosqualene, followed by cyclization catalyzed oxidosqualene cyclases (OSCs) to yield various triterpene skeletons. Steroids, which are biosynthesized cycloartenol or lanosterol, essential primary metabolites in most plant species, along with lineage-specific steroids, such as steroidal glycoalkaloids found the Solanum species. Other skeletons...

10.5511/plantbiotechnology.24.0312c article EN Plant Biotechnology 2024-08-25

Improvement of domain-level ortholog clustering by optimizing domain-specific sum-of-pairs score

OPENALEX - Publications

Hirokazu Chiba Ikuo Uchiyama

Identification of ortholog groups is a crucial step in comparative analysis multiple genomes. Although several computational methods have been developed to create groups, most those do not evaluate orthology at the sub-gene level. In our method for domain-level clustering, DomClust, proteins are split into domains on basis alignment boundaries identified by all-against-all pairwise comparison, but it often fails determine appropriate boundaries. We improve classification using information....

10.1186/1471-2105-15-148 article EN cc-by BMC Bioinformatics 2014-05-17

SPANG: a SPARQL client supporting generation and reuse of queries for distributed RDF databases

OPENALEX - Publications

Hirokazu Chiba Ikuo Uchiyama

Toward improved interoperability of distributed biological databases, an increasing number datasets have been published in the standardized Resource Description Framework (RDF). Although powerful SPARQL Protocol and RDF Query Language (SPARQL) provides a basis for exploiting writing code is burdensome users including bioinformaticians. Thus, easy-to-use interface necessary. We developed SPANG, client that has unique features querying datasets. SPANG dynamically generates typical queries...

10.1186/s12859-017-1531-1 article EN cc-by BMC Bioinformatics 2017-02-08

TogoGenome/TogoStanza: modularized Semantic Web genome database

OPENALEX - Publications

Toshiaki Katayama Shuichi Kawashima Shinobu Okamoto Yuki Moriya Hirokazu Chiba and 4 more

TogoGenome is a genome database that purely based on the Semantic Web technology, which enables integration of heterogeneous data and flexible semantic searches. All information stored as Resource Description Framework (RDF) data, reporting web pages are generated fly using SPARQL Protocol RDF Query Language (SPARQL) queries. provides semantic-faceted search system by gene functional annotation, taxonomy, phenotypes environment relevant ontologies. also serves an interface to conduct...

10.1093/database/bay132 article EN cc-by Database 2018-11-29

Mapping RDF Graphs to Property Graphs

OPENALEX - Publications

Shota Matsumoto Ryota Yamanaka Hirokazu Chiba

Increasing amounts of scientific and social data are published in the Resource Description Framework (RDF). Although RDF can be queried using SPARQL language, even SPARQL-based operation has a limitation implementing traversal or analytical algorithms. Recently, variety graph database implementations dedicated to analyses on property model have emerged. However, not interoperable. Here, we developed framework based Graph Mapping Language (G2GML) for mapping graphs make most accumulated data....

10.48550/arxiv.1812.01801 preprint EN other-oa arXiv (Cornell University) 2018-01-01

BioHackathon 2015: Semantics of data for life sciences and reproducible research

OPENALEX - Publications

Rutger Vos Toshiaki Katayama Hiroyuki Mishima Shin Kawano Shuichi Kawashima and 72 more

<ns3:p>We report on the activities of 2015 edition BioHackathon, an annual event that brings together researchers and developers from around world to develop tools technologies promote reusability biological data. We discuss issues surrounding representation, publication, integration, mining reuse data metadata across a wide range biomedical types relevance for life sciences, including chemistry, genotypes phenotypes, orthology phylogeny, proteomics, genomics, glycomics, metabolomics....

10.12688/f1000research.18236.1 preprint EN cc-by F1000Research 2020-02-24

Exploring Disease Model Mouse Using Knowledge Graphs: Combining Gene Expression, Orthology, and Disease Datasets

OPENALEX - Publications

Tatsuya Kushida Tarcisio Mendes de Farias Ana C. Sima Christophe Dessimoz Hirokazu Chiba and 2 more

Abstract Background The RIKEN BRC develops and maintains the BioResource MetaDatabase to help users explore appropriate target bioresources for their experiments prepare precise high-quality data infrastructures. Swiss Institute of Bioinformatics two RDF datasets across multi species study gene expression orthology: Bgee Orthologous MAtrix (OMA, an orthology database). Methods This integrates knowledge graph with Resource Description Framework (RDF) from Bgee, a database, OMA, DisGeNET,...

10.1101/2023.08.30.555283 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2023-08-31

Property Graph Exchange Format

OPENALEX - Publications

Hirokazu Chiba Ryota Yamanaka Shota Matsumoto

Recently, a variety of database implementations adopting the property graph model have emerged. However, interoperable management data on these is challenging due to differences in models and formats. Here, we redefine incorporating existing propose serialization formats for graphs. The independent specific provides basis data. proposed not only general but also intuitive, thus it useful creating maintaining To demonstrate practical use our serialization, implemented converters from into...

10.48550/arxiv.1907.03936 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Triterpene RDF: Developing a database of plant enzymes and transcription factors involved in triterpene biosynthesis using the Resource Description Framework

OPENALEX - Publications

Keita Tamura Hirokazu Chiba Hidemasa Bono

Abstract Plants produce structurally diverse triterpenes (triterpenoids and steroids). Their biosynthesis occurs from a common precursor, namely 2,3-oxidosqualene, followed by cyclization catalyzed oxidosqualene cyclases (OSCs) to yield various triterpene skeletons. Steroids, which are biosynthesized cycloartenol or lanosterol, essential primary metabolites in most plant species, along with lineage-specific steroids, such as steroidal glycoalkaloids found the Solanum species. Other skeletons...

10.1101/2024.01.08.574260 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2024-01-09

BioHackJP 2023 Report R1: Mapping human genome variations to their mouse counterparts for identifying disease model mouse strains

OPENALEX - Publications

N Mitsuhashi Hirokazu Chiba Yuki Moriya Toyoyuki Takada

In disease model mouse strains used for human studies, information on genomic variations is essential elucidating the relationship between haplotypes and susceptibility. To select a appropriately, it crucial to identify variants with same effect as disease-causing in humans. BioHackathon Japan J2023, we focused nucleotide involved amino acid substitutions. We developed an API that matches from MoG+ database within gene regions defined by HGNC identifiers or symbols. After Hackathon, will map...

10.37044/osf.io/8kuzr preprint EN 2024-01-20

Coming Soon ...