NFDI4DS | UHH-SEMS - Publication Details

Satoshi Matsuoka

ORCID: 0000-0003-1910-8532

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100634486

Research Areas

Parallel Computing and Optimization Techniques
Distributed and Parallel Computing Systems
Advanced Data Storage Technologies
Cloud Computing and Resource Management
Distributed systems and fault tolerance
Scientific Computing and Data Management
Interconnection Networks and Systems
Optical Network Technologies
Graph Theory and Algorithms
Advanced Combustion Engine Technologies
Caching and Content Delivery
Semiconductor Lasers and Optical Devices
Embedded Systems Design Techniques
Photonic and Optical Devices
Advanced Photonic Communication Systems
Algorithms and Data Compression
Solid State Laser Technologies
Advanced Optical Network Technologies
Peer-to-Peer Network Technologies
Laser Design and Applications
Vehicle emissions and performance
Advanced Graph Neural Networks
Electromagnetic Scattering and Analysis
Advanced Neural Network Applications
Software System Performance and Reliability

RIKEN Center for Computational Science
2018-2025

Tokyo Institute of Technology
2013-2023

RIKEN
2023

Association for Computing Machinery
2019-2020

University of Tennessee at Knoxville
2011-2018

National Institute of Advanced Industrial Science and Technology
2004-2016

Computing Center
2004-2016

Tokyo University of Technology
2016

Japan Science and Technology Agency
2008-2014

Institut national de recherche en informatique et en automatique
2013

Spectrum-efficient and scalable elastic optical path network: architecture, benefits, and enabling technologies

OPENALEX - Publications

Masahiko Jinno H. Takara Bartłomiej Kozicki Yukio Tsukishima Yuji Sone and 1 more

The sustained growth of data traffic volume calls for an introduction efficient and scalable transport platform links 100 Gb/s beyond in the future optical network. In this article, after briefly reviewing existing major technology options, we propose a novel, spectrum- efficient, network architecture called SLICE. SLICE enables sub-wavelength, superwavelength, multiple-rate accommodation highly spectrum-efficient manner, thereby providing fractional bandwidth service. Dynamic variation...

10.1109/mcom.2009.5307468 article EN IEEE Communications Magazine 2009-11-01

The International Exascale Software Project roadmap

OPENALEX - Publications

Jack Dongarra Pete Beckman Terry Moore Patrick Aerts Giovanni Aloisio and 60 more

Over the last 20 years, open-source community has provided more and software on which world’s high-performance computing systems depend for performance productivity. The invested millions of dollars years effort to build key components. However, although investments in these separate elements have been tremendously valuable, a great deal productivity also lost because lack planning, coordination, integration technologies necessary make them work together smoothly efficiently, both within...

10.1177/1094342010391989 article EN The International Journal of High Performance Computing Applications 2011-01-06

FTI

OPENALEX - Publications

Leonardo Bautista-Gomez Seiji Tsuboi Dimitri Komatitsch Franck Cappello Naoya Maruyama and 1 more

Large scientific applications deployed on current petascale systems expend a significant amount of their execution time dumping checkpoint files to remote storage. New fault tolerant techniques will be critical efficiently exploit post-petascale systems. In this work, we propose low-overhead high-frequency multi-level technique in which integrate highly-reliable topology-aware Reed-Solomon encoding three-level scheme. We hide the using one Fault-Tolerance dedicated thread per node. implement...

10.1145/2063384.2063427 preprint EN 2011-11-08

Big data and extreme-scale computing

OPENALEX - Publications

Mark Asch Terry Moore Rosa M. Badía Micah Beck Pete Beckman and 34 more

Over the past four years, Big Data and Exascale Computing (BDEC) project organized a series of five international workshops that aimed to explore ways in which new forms data-centric discovery introduced by ongoing revolution high-end data analysis (HDA) might be integrated with established, simulation-centric paradigm high-performance computing (HPC) community. Based on those meetings, we argue rapid proliferation digital generators, unprecedented growth volume diversity they generate,...

10.1177/1094342018778123 article EN The International Journal of High Performance Computing Applications 2018-07-01

Combined Spatial and Temporal Blocking for High-Performance Stencil Computation on FPGAs Using OpenCL

OPENALEX - Publications

Hamid Reza Zohouri Artur Podobas Satoshi Matsuoka

Recent developments in High Level Synthesis tools have attracted software programmers to accelerate their high-performance computing applications on FPGAs. Even though it has been shown that FPGAs can compete with GPUs terms of performance for stencil computation, most previous work achieve this by avoiding spatial blocking and restricting input dimensions relative FPGA on-chip memory. In we create a accelerator using Intel SDK OpenCL achieves high without having such restrictions. We...

10.1145/3174243.3174248 preprint EN 2018-02-15

OPENALEX - Publications

Yoshio Tanaka Hidemoto Nakada Satoshi Sekiguchi Toyotaro Suzumura Satoshi Matsuoka

10.1023/a:1024083511032 article EN Journal of Grid Computing 2003-01-01

Grid Datafarm Architecture for Petascale Data Intensive Computing

OPENALEX - Publications

Osamu Tatebe Y. Morita Satoshi Matsuoka Noriyuki Soda Satoshi Sekiguchi

The Grid Datafarm (Gfarm) architecture is designed for global petascale data-intensive computing. It provides a parallel filesystem with online storage, scalable I/O bandwidth, and processing, it can exploit local in grid of clusters tens thousands nodes. Gfarm APIs commands provide single image manipulate metadata consistently. Fault tolerance load balancing are automatically managed by file duplication or recomputation using command history log. Preliminary performance evaluation has shown...

10.1109/ccgrid.2002.1017117 article EN 2003-06-25

Demonstration of novel spectrum-efficient elastic optical path network with per-channel variable capacity of 40 Gb/s to over 400 Gb/s

OPENALEX - Publications

Masahiko Jinno H. Takara Bartłomiej Kozicki Yukio Tsukishima Toshihide Yoshimatsu and 6 more

We demonstrated, for the first time, a novel spectrum-efficient elastic optical path network 100 Gb/s services and beyond, based on flexible rate transceivers variable-bandwidth wavelength crossconnects.

10.1109/ecoc.2008.4729581 article EN 2008-01-01

Exploration of Lossy Compression for Application-Level Checkpoint/Restart

OPENALEX - Publications

Naoto Sasaki Kento Sato Toshio Endo Satoshi Matsuoka

The scale of high performance computing (HPC) systems is exponentially growing, potentially causing prohibitive shrinkage mean time between failures (MTBF) while the overall increase in I/O parallel file will be far behind scale. As such, there have been various attempts to decrease checkpoint overhead, one which employ compression techniques files. While most existing focus on lossless compression, their rates and thus effectiveness remain rather limited. Instead, we propose a loss...

10.1109/ipdps.2015.67 article EN 2015-05-01

Scaling Word2Vec on Big Corpus

OPENALEX - Publications

Bofang Li Aleksandr Drozd Yuhe Guo Tao Liu Satoshi Matsuoka and 1 more

Word embedding has been well accepted as an important feature in the area of natural language processing (NLP). Specifically, Word2Vec model learns high-quality word embeddings and is widely used various NLP tasks. The training sequential on a CPU due to strong dependencies between word–context pairs. In this paper, we target scale GPU cluster. To do this, one main challenge reducing inside large batch. We heuristically design variation Word2Vec, which ensures that each pair contains...

10.1007/s41019-019-0096-6 article EN cc-by Data Science and Engineering 2019-06-01

Overview of a performance evaluation system for global computing scheduling algorithms

OPENALEX - Publications

Atsuko Takefusa Satoshi Matsuoka Hidemoto Nakada Kento Aida Umpei Nagashima

While there have been several proposals of high-performance global computing systems, scheduling schemes for the systems not well investigated. The reason is difficulties evaluation by large-scale benchmarks with reproducible results. Our Bricks performance system allows analysis and comparison various in a typical setting. can simulate behaviors especially behavior networks resource algorithms. Moreover, partitioned into components such that only its constituents be replaced to different...

10.1109/hpdc.1999.805287 article EN 2003-01-20

Formation and Oxidation Processes of Soot Particulates in a D. I. Diesel Engine — An Experimental Study via the Two-Color Method

OPENALEX - Publications

Yukio Matsui Takeyuki Kamimoto Satoshi Matsuoka

Formation et oxydation des particules de suie dans un moteur diesel a injection directe. Etude experimentale par la methode deux couleurs

10.4271/820464 article FR SAE technical papers on CD-ROM/SAE technical paper series 1982-02-01

Hybrid Map Task Scheduling for GPU-Based Heterogeneous Clusters

OPENALEX - Publications

Koichi Shirahata Hitoshi Sato Satoshi Matsuoka

MapReduce is a programming model that enables efficient massive data processing in large-scale computing environments such as supercomputers and clouds. Such computers employ GPUs to enjoy its good peak performance high memory bandwidth. Since the of each job depending on running application characteristics underlying environments, scheduling tasks onto CPU cores GPU devices for execution difficult. To address this problem, we have proposed hybrid technique GPU-based computer clusters, which...

10.1109/cloudcom.2010.55 article EN 2010-11-01

An efficient, model-based CPU-GPU heterogeneous FFT library

OPENALEX - Publications

Yasuhito Ogata Toshio Endo Naoya Maruyama Satoshi Matsuoka

General-Purpose computing on Graphics Processing Units (GPGPU) is becoming popular in HPC because of its high peak performance. However, spite the potential performance improvements as well recent promising results scientific applications, real not necessarily higher than that current high-performance CPUs, especially with trends towards increasing number cores a single die. This GPU can be severely limited by such restrictions memory size and bandwidth programming using graphics-specific...

10.1109/ipdps.2008.4536163 article EN Proceedings - IEEE International Parallel and Distributed Processing Symposium 2008-04-01

The International Exascale Software Project: a Call To Cooperative Action By the Global High-Performance Community

OPENALEX - Publications

Jack Dongarra Pete Beckman Patrick Aerts Frank Cappello Thomas Lippert and 6 more

Over the last 20 years, open-source community has provided more and software on which world’s high-performance computing systems depend for performance productivity. The invested millions of dollars years effort to build key components. Although investments in these separate elements have been tremendously valuable, a great deal productivity also lost because lack planning, coordination, integration technologies necessary make them work together smoothly efficiently, both within individual...

10.1177/1094342009347714 article EN The International Journal of High Performance Computing Applications 2009-10-12

Design and modeling of a non-blocking checkpointing system

OPENALEX - Publications

Kento Sato Kathryn Mohror Adam Moody Todd Gamblin Bronis R. de Supinski and 2 more

As the capability and component count of systems increase, MTBF decreases. Typically, applications tolerate failures with checkpoint/restart to a parallel file system (PFS). While simple, this approach can suffer from contention for PFS resources. Multi-level checkpointing is promising solution. However, while multi-level successful on today's machines, it not expected be sufficient exascale class which are predicted have orders magnitude larger memory sizes failure rates. Our solution...

10.5555/2388996.2389022 article EN IEEE International Conference on High Performance Computing, Data, and Analytics 2012-11-10

Distributed Diskless Checkpoint for Large Scale Systems

OPENALEX - Publications

Leonardo Bautista-Gomez Naoya Maruyama Franck Cappello Satoshi Matsuoka

In high performance computing (HPC), the applications are periodically check pointed to stable storage increase success rate of long executions. Nowadays, overhead imposed by disk-based checkpoint is about 20% execution time and in next years it will be more than 50% if frequency increases as fault increases. Diskless has been introduced a solution avoid IO bottleneck checkpoint. However, encoding time, dedicated resources (the spares) memory diskless significant obstacles against its...

10.1109/ccgrid.2010.40 article EN 2010-01-01

Design and modeling of a non-blocking checkpointing system

OPENALEX - Publications

Kento Sato Naoya Maruyama Kathryn Mohror Adam Moody Todd Gamblin and 2 more

10.1109/sc.2012.46 article EN International Conference for High Performance Computing, Networking, Storage and Analysis 2012-11-01

A study of deadline scheduling for client-server systems on the Computational Grid

OPENALEX - Publications

Atsuko Takefusa Henri Casanova Satoshi Matsuoka Fran Berman

The Computational Grid is a promising platform for the deployment of various high-performance computing applications. A number projects have addressed idea software as service on network. These systems usually implement client-server architectures with many servers running distributed resources and commonly been referred to network-enabled (NES). An important question that scheduling in this multi-client multi-server scenario. Note context most requests are computationally intensive they...

10.1109/hpdc.2001.945208 article EN 2002-11-13

Virtual Clusters on the Fly - Fast, Scalable, and Flexible Installation

OPENALEX - Publications

Hideo Nishimura Naoya Maruyama Satoshi Matsuoka

One of the advantages in virtualized computing clusters compared to traditional shared HPC environments is their ability accommodate user-specific system customization. However, past attempts providing virtual are not scalable with increasing number VMs, nor do they allow fine-grained customization assuming that preconfigured VM images always available on grid. We propose a new cluster installation technique achieves efficiency and scalability, yet simultaneously customizability. It allows...

10.1109/ccgrid.2007.121 article EN 2007-05-01

Synthesis of Clostridium cellulovorans minicellulosomes by intercellular complementation

OPENALEX - Publications

Takamitsu Arai Satoshi Matsuoka Hee‐Yeon Cho Hideaki Yukawa Masayuki Inui and 2 more

The ability of two strains bacteria to cooperate in the synthesis an enzyme complex (a minicellulosome) was examined. Three Bacillus subtilis were constructed express Clostridium cellulovorans genes engB, xynB, and minicbpA. MiniCbpA, EngB, XynB synthesized secreted into medium by B. subtilis. When with minicbpA engB or xynB cocultured, minicellulosomes synthesized, consisting one case miniCbpA EngB second XynB. Both showed their respective enzymatic activities. We call this phenomenon...

10.1073/pnas.0610740104 article EN Proceedings of the National Academy of Sciences 2007-01-24

Coming Soon ...