Satoshi Matsuoka

ORCID: 0000-0003-1910-8532
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Parallel Computing and Optimization Techniques
  • Distributed and Parallel Computing Systems
  • Advanced Data Storage Technologies
  • Cloud Computing and Resource Management
  • Distributed systems and fault tolerance
  • Scientific Computing and Data Management
  • Interconnection Networks and Systems
  • Optical Network Technologies
  • Graph Theory and Algorithms
  • Advanced Combustion Engine Technologies
  • Caching and Content Delivery
  • Semiconductor Lasers and Optical Devices
  • Embedded Systems Design Techniques
  • Photonic and Optical Devices
  • Advanced Photonic Communication Systems
  • Algorithms and Data Compression
  • Solid State Laser Technologies
  • Advanced Optical Network Technologies
  • Peer-to-Peer Network Technologies
  • Laser Design and Applications
  • Vehicle emissions and performance
  • Advanced Graph Neural Networks
  • Electromagnetic Scattering and Analysis
  • Advanced Neural Network Applications
  • Software System Performance and Reliability

RIKEN Center for Computational Science
2018-2025

Tokyo Institute of Technology
2013-2023

RIKEN
2023

Association for Computing Machinery
2019-2020

University of Tennessee at Knoxville
2011-2018

National Institute of Advanced Industrial Science and Technology
2004-2016

Computing Center
2004-2016

Tokyo University of Technology
2016

Japan Science and Technology Agency
2008-2014

Institut national de recherche en informatique et en automatique
2013

The sustained growth of data traffic volume calls for an introduction efficient and scalable transport platform links 100 Gb/s beyond in the future optical network. In this article, after briefly reviewing existing major technology options, we propose a novel, spectrum- efficient, network architecture called SLICE. SLICE enables sub-wavelength, superwavelength, multiple-rate accommodation highly spectrum-efficient manner, thereby providing fractional bandwidth service. Dynamic variation...

10.1109/mcom.2009.5307468 article EN IEEE Communications Magazine 2009-11-01

Over the last 20 years, open-source community has provided more and software on which world’s high-performance computing systems depend for performance productivity. The invested millions of dollars years effort to build key components. However, although investments in these separate elements have been tremendously valuable, a great deal productivity also lost because lack planning, coordination, integration technologies necessary make them work together smoothly efficiently, both within...

10.1177/1094342010391989 article EN The International Journal of High Performance Computing Applications 2011-01-06

Large scientific applications deployed on current petascale systems expend a significant amount of their execution time dumping checkpoint files to remote storage. New fault tolerant techniques will be critical efficiently exploit post-petascale systems. In this work, we propose low-overhead high-frequency multi-level technique in which integrate highly-reliable topology-aware Reed-Solomon encoding three-level scheme. We hide the using one Fault-Tolerance dedicated thread per node. implement...

10.1145/2063384.2063427 preprint EN 2011-11-08

Over the past four years, Big Data and Exascale Computing (BDEC) project organized a series of five international workshops that aimed to explore ways in which new forms data-centric discovery introduced by ongoing revolution high-end data analysis (HDA) might be integrated with established, simulation-centric paradigm high-performance computing (HPC) community. Based on those meetings, we argue rapid proliferation digital generators, unprecedented growth volume diversity they generate,...

10.1177/1094342018778123 article EN The International Journal of High Performance Computing Applications 2018-07-01

Recent developments in High Level Synthesis tools have attracted software programmers to accelerate their high-performance computing applications on FPGAs. Even though it has been shown that FPGAs can compete with GPUs terms of performance for stencil computation, most previous work achieve this by avoiding spatial blocking and restricting input dimensions relative FPGA on-chip memory. In we create a accelerator using Intel SDK OpenCL achieves high without having such restrictions. We...

10.1145/3174243.3174248 preprint EN 2018-02-15

10.1023/a:1024083511032 article EN Journal of Grid Computing 2003-01-01

The Grid Datafarm (Gfarm) architecture is designed for global petascale data-intensive computing. It provides a parallel filesystem with online storage, scalable I/O bandwidth, and processing, it can exploit local in grid of clusters tens thousands nodes. Gfarm APIs commands provide single image manipulate metadata consistently. Fault tolerance load balancing are automatically managed by file duplication or recomputation using command history log. Preliminary performance evaluation has shown...

10.1109/ccgrid.2002.1017117 article EN 2003-06-25

We demonstrated, for the first time, a novel spectrum-efficient elastic optical path network 100 Gb/s services and beyond, based on flexible rate transceivers variable-bandwidth wavelength crossconnects.

10.1109/ecoc.2008.4729581 article EN 2008-01-01

The scale of high performance computing (HPC) systems is exponentially growing, potentially causing prohibitive shrinkage mean time between failures (MTBF) while the overall increase in I/O parallel file will be far behind scale. As such, there have been various attempts to decrease checkpoint overhead, one which employ compression techniques files. While most existing focus on lossless compression, their rates and thus effectiveness remain rather limited. Instead, we propose a loss...

10.1109/ipdps.2015.67 article EN 2015-05-01

Word embedding has been well accepted as an important feature in the area of natural language processing (NLP). Specifically, Word2Vec model learns high-quality word embeddings and is widely used various NLP tasks. The training sequential on a CPU due to strong dependencies between word–context pairs. In this paper, we target scale GPU cluster. To do this, one main challenge reducing inside large batch. We heuristically design variation Word2Vec, which ensures that each pair contains...

10.1007/s41019-019-0096-6 article EN cc-by Data Science and Engineering 2019-06-01

While there have been several proposals of high-performance global computing systems, scheduling schemes for the systems not well investigated. The reason is difficulties evaluation by large-scale benchmarks with reproducible results. Our Bricks performance system allows analysis and comparison various in a typical setting. can simulate behaviors especially behavior networks resource algorithms. Moreover, partitioned into components such that only its constituents be replaced to different...

10.1109/hpdc.1999.805287 article EN 2003-01-20

Formation et oxydation des particules de suie dans un moteur diesel a injection directe. Etude experimentale par la methode deux couleurs

10.4271/820464 article FR SAE technical papers on CD-ROM/SAE technical paper series 1982-02-01

MapReduce is a programming model that enables efficient massive data processing in large-scale computing environments such as supercomputers and clouds. Such computers employ GPUs to enjoy its good peak performance high memory bandwidth. Since the of each job depending on running application characteristics underlying environments, scheduling tasks onto CPU cores GPU devices for execution difficult. To address this problem, we have proposed hybrid technique GPU-based computer clusters, which...

10.1109/cloudcom.2010.55 article EN 2010-11-01

General-Purpose computing on Graphics Processing Units (GPGPU) is becoming popular in HPC because of its high peak performance. However, spite the potential performance improvements as well recent promising results scientific applications, real not necessarily higher than that current high-performance CPUs, especially with trends towards increasing number cores a single die. This GPU can be severely limited by such restrictions memory size and bandwidth programming using graphics-specific...

10.1109/ipdps.2008.4536163 article EN Proceedings - IEEE International Parallel and Distributed Processing Symposium 2008-04-01

Over the last 20 years, open-source community has provided more and software on which world’s high-performance computing systems depend for performance productivity. The invested millions of dollars years effort to build key components. Although investments in these separate elements have been tremendously valuable, a great deal productivity also lost because lack planning, coordination, integration technologies necessary make them work together smoothly efficiently, both within individual...

10.1177/1094342009347714 article EN The International Journal of High Performance Computing Applications 2009-10-12

As the capability and component count of systems increase, MTBF decreases. Typically, applications tolerate failures with checkpoint/restart to a parallel file system (PFS). While simple, this approach can suffer from contention for PFS resources. Multi-level checkpointing is promising solution. However, while multi-level successful on today's machines, it not expected be sufficient exascale class which are predicted have orders magnitude larger memory sizes failure rates. Our solution...

10.5555/2388996.2389022 article EN IEEE International Conference on High Performance Computing, Data, and Analytics 2012-11-10

In high performance computing (HPC), the applications are periodically check pointed to stable storage increase success rate of long executions. Nowadays, overhead imposed by disk-based checkpoint is about 20% execution time and in next years it will be more than 50% if frequency increases as fault increases. Diskless has been introduced a solution avoid IO bottleneck checkpoint. However, encoding time, dedicated resources (the spares) memory diskless significant obstacles against its...

10.1109/ccgrid.2010.40 article EN 2010-01-01

As the capability and component count of systems increase, MTBF decreases. Typically, applications tolerate failures with checkpoint/restart to a parallel file system (PFS). While simple, this approach can suffer from contention for PFS resources. Multi-level checkpointing is promising solution. However, while multi-level successful on today's machines, it not expected be sufficient exascale class which are predicted have orders magnitude larger memory sizes failure rates. Our solution...

10.1109/sc.2012.46 article EN International Conference for High Performance Computing, Networking, Storage and Analysis 2012-11-01

The Computational Grid is a promising platform for the deployment of various high-performance computing applications. A number projects have addressed idea software as service on network. These systems usually implement client-server architectures with many servers running distributed resources and commonly been referred to network-enabled (NES). An important question that scheduling in this multi-client multi-server scenario. Note context most requests are computationally intensive they...

10.1109/hpdc.2001.945208 article EN 2002-11-13

One of the advantages in virtualized computing clusters compared to traditional shared HPC environments is their ability accommodate user-specific system customization. However, past attempts providing virtual are not scalable with increasing number VMs, nor do they allow fine-grained customization assuming that preconfigured VM images always available on grid. We propose a new cluster installation technique achieves efficiency and scalability, yet simultaneously customizability. It allows...

10.1109/ccgrid.2007.121 article EN 2007-05-01

The ability of two strains bacteria to cooperate in the synthesis an enzyme complex (a minicellulosome) was examined. Three Bacillus subtilis were constructed express Clostridium cellulovorans genes engB, xynB, and minicbpA. MiniCbpA, EngB, XynB synthesized secreted into medium by B. subtilis. When with minicbpA engB or xynB cocultured, minicellulosomes synthesized, consisting one case miniCbpA EngB second XynB. Both showed their respective enzymatic activities. We call this phenomenon...

10.1073/pnas.0610740104 article EN Proceedings of the National Academy of Sciences 2007-01-24
Coming Soon ...