Eishi Arima

ORCID: 0009-0002-7043-4288
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Parallel Computing and Optimization Techniques
  • Advanced Data Storage Technologies
  • Advanced Memory and Neural Computing
  • Cloud Computing and Resource Management
  • Distributed and Parallel Computing Systems
  • Interconnection Networks and Systems
  • Embedded Systems Design Techniques
  • Ferroelectric and Negative Capacitance Devices
  • Low-power high-performance VLSI design
  • Distributed systems and fault tolerance
  • Caching and Content Delivery
  • Catalytic Processes in Materials Science
  • IoT and Edge/Fog Computing
  • Magnetic properties of thin films
  • Neural Networks and Applications
  • Ammonia Synthesis and Nitrogen Reduction
  • Catalysis and Hydrodesulfurization Studies
  • Optimization and Search Problems

Technical University of Munich
2022-2024

The University of Tokyo
2013-2021

Ion Technology Center (Japan)
2020

Two performance gaps in the memory hierarchy, between CPU cache and main memory, mass storage, will become increasingly severe bottlenecks for computing-system performance. Although it is necessary to increase capacity fill these gaps, power also increases when conventional volatile memories are used. A new nonvolatile this purpose has been anticipated. Storage class used second gap. Many candidates exist: ReRAM, PRAM, 3D-cross point type with resistive change RAM. However, last level (LLC)...

10.1109/isscc.2016.7417942 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2016-01-01

The supercomputer "Fugaku", which recently ranked number one on multiple supercomputing lists, including the Top500 in June 2020, has various power control features, such as (1) an eco mode that utilizes only of two floating-point pipelines while decreasing supply to chip; (2) a boost increases clock frequency; and (3) core retention function turns unused cores into low-power state. By orchestrating these power-performance features considering characteristics currently running applications,...

10.1109/cluster49012.2020.00069 article EN 2020-09-01

Future HPC systems, including post-exascale supercomputers, will face severe problems such as the slowing-down of Moore's law and limitation power supply. To achieve desired system performance improvement while counteracting these issues, hardware design optimization is a key factor. In this paper, we investigate future directions SIMD-based processor architectures by using A64FX chip customized version power/performance/area simulators, i.e., Gem5 McPAT. More specifically, based on chip,...

10.1109/coolchips52128.2021.9410320 article EN 2021-04-14

This paper presents GreenCourier, a novel scheduling framework that enables the runtime of serverless functions across geographically distributed regions based on their carbon efficiencies. Our incorporates an intelligent strategy for Kubernetes and supports Knative as platform. To obtain real-time information different geographical regions, our multiple marginal emissions sources such WattTime Carbon-aware SDK. We comprehensively evaluate performance using Google Engine production function...

10.1145/3631295.3631396 preprint EN 2023-11-24

Implementing last level caches (LLCs) with STT-MRAM is a promising approach for designing energy efficient microprocessors due to high density and low leakage power of its memory cells. However, peripheral circuits an cache still suffer from because large leaky transistors are required drive write current element. To overcome this problem, we propose new management scheme called Immediate Sleep (IS). IS immediately turns off subarray if the next access predicted be not critical in...

10.1109/iccd.2015.7357096 article EN 2015-10-01

CPU-GPU heterogeneous systems are now commonly used in HPC (High-Performance Computing). However, improving the utilization and energy-efficiency of such is still one most critical issues. As single program typically cannot fully utilize all resources within a node/chip, co-scheduling (or co-locating) multiple programs with complementary resource requirements promising solution. Meanwhile, as power consumption has become first-class design constraint for systems, techniques should be...

10.1145/3547276.3548630 preprint EN 2022-08-29

This paper describes a proposal of non-volatile cache architecture utilizing novel DRAM / MRAM cell-level hybrid structured memory (D-MRAM) that enables effective power reduction for high performance mobile SoCs without area overhead. Here, the key point to reduce active is intermittent refresh process DRAM-mode. D-MRAM has advantage static consumptions compared conventional SRAM, because there are no leakage paths in cell and it not needed supply voltage its cells when used as MRAM-mode....

10.7873/date.2013.363 article EN Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015 2013-01-01

This paper describes a proposal of non-volatile cache architecture utilizing novel DRAM / MRAM cell-level hybrid structured memory (D-MRAM) that enables effective power reduction for high performance mobile SoCs without area overhead. Here, the key point to reduce active is intermittent refresh process DRAM-mode. D-MRAM has advantage static consumptions compared conventional SRAM, because there are no leakage paths in cell and it not needed supply voltage its cells when used as MRAM-mode....

10.5555/2485288.2485716 article EN Design, Automation, and Test in Europe 2013-03-18

This paper describes state-of-the-art STT-MRAM, which can drastically save energy consumption dissipated in cache memory system compared with conventional SRAM-based ones. also presents how to build hierarchy both the state-of-art STT-MRAM and SRAM reduce consumption. The key point is "break-even-time aware design" based on normally-off operation. For further power reduction, an intelligent management technique for STT-MRAM-based discussed.

10.1109/isocc.2015.7401759 article EN 2015-11-01

GPU-based heterogeneous architectures are now commonly used in HPC clusters. Due to their architectural simplicity specialized for data-level parallelism, GPUs can offer much higher computational throughput and memory bandwidth than CPUs the same generation do. However, as available resources have increased exponentially over past decades, it has become increasingly difficult a single program fully utilize them. As consequence, industry started supporting several resource partitioning...

10.1109/cluster52292.2023.00023 preprint EN 2023-10-31

In modern microprocessors, lower level cache memories are usually implemented as unified caches where different classes of cachelines such data, instructions, and Page Table Entries (PTEs) coexist. Particularly, frequent PTE accesses following after TLB missies can happen on systems, which is driven by the increasing demands applications for larger working set size, this trend naturally leads to significant conflicts among these kinds cachelines.This paper targets emerging conflict problem...

10.1109/dsd51259.2020.00027 article EN 2020-08-01

Tackling climate change by reducing and eventually eliminating carbon emissions is a significant milestone on the path toward establishing an environmentally sustainable society. As we transition into exascale era, marked increasing demand scale of HPC resources, community must embrace challenge from designing operating modern systems. In this position paper, describe challenges highlight different opportunities that can aid sites in footprint

10.48550/arxiv.2309.13473 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Tackling climate change by reducing and eventually eliminating carbon emissions is a significant milestone on the path toward establishing an environmentally sustainable society. As we transition into exascale era, marked increasing demand scale of HPC resources, community must embrace challenge from designing operating modern systems. In this position paper, describe challenges highlight different opportunities that can aid sites in footprint

10.1145/3624062.3624271 article EN 2023-11-10
Co-Chairs Bridges Ron Brightwell Patrick McCormick Martin Schulz Eishi Arima and 95 more Maya Gokhale David Boehme Kenneth B. Kent D. R. Cadena Kurt Brian Ferreira Amanda Randles Scott Levy Engin Arslan Michael Bader Costas Bekas Huilong Chen Rafael Ferreira da Silva Johannes Lagguth Hatem Simula Piotr Kaust Richard Membarth Gabriele Mencagli Shirley Moore Antonio J. Peña Sivasankaran Rajamanickam Suzanne M. Shontz Francesco Silvestri Shaden Smith Hari Sundar Nathan R. Tallent Ramachandran Vaidyanathan Tobias Weinzierl Mattan Erez Sudheer Chunduri Guilherme Cox Alexandros Daglis Sven Karlsson E. Kim John Kim Jagadish Kotra Frank Mueller Vassilis Papaefstathiou Gilles Pokham Steve Reihnhart Minsoo Rhu Alex Kaist Kentaro Rico Osman Sano Jeremy Wilkie Seyed Majid Zahedi Jishen Zhao Tianhao Zheng Albert Google Dorian Arnold Ali Anwar Michaela Becchi Aurélien Bouteiller Anthony Danalis Judit Giménez Taylor Groves Amina Guermouche Samuel K. Gutiérrez Laurent Lefèvre Dong Li Abid Malik Olga Pearce Judy Qiu Ioan Raicu Iván Rodero Seetheram Seelam Sameer Shende Alexandru Uta Carlos A. Varela Patrick Widener Jia Zou Marı́a S. Pérez Gagan Agrawal Gabriel Antoniu Philip Carns Toni Cortez Alexandru Costan Ian Foster Pilar González‐Férez Jian Huang Shadi Ibrahim Michael Kühn Adrien Lèbre Pierre Matri Suzanne Mcintosh Sai Narasimhamurthy Youssef S. G. Nashed Lukas Rupprecht Alberto Sánchez Michael Schoettner Heinrich-Heine Universtiät Düsseldorf Robert Sisneros Domenic Talia Jon Woodring Simon David Hammond Kevin Huck

10.1109/cluster.2019.8891050 article 2019-09-01
Coming Soon ...