NFDI4DS | UHH-SEMS - Publication Details

7.2 4Mb STT-MRAM-based cache with memory-access-aware power optimization and write-verify-write / read-modify-write scheme

OPENALEX - Publications

Hiroki Noguchi Kazutaka Ikegami Satoshi Takaya Eishi Arima K. Kushida and 8 more

Two performance gaps in the memory hierarchy, between CPU cache and main memory, mass storage, will become increasingly severe bottlenecks for computing-system performance. Although it is necessary to increase capacity fill these gaps, power also increases when conventional volatile memories are used. A new nonvolatile this purpose has been anticipated. Storage class used second gap. Many candidates exist: ReRAM, PRAM, 3D-cross point type with resistive change RAM. However, last level (LLC)...

10.1109/isscc.2016.7417942 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2016-01-01

Evaluation of Power Management Control on the Supercomputer Fugaku

OPENALEX - Publications

Yuetsu Kodama Tetsuya Odajima Eishi Arima Mitsuhisa Sato

The supercomputer "Fugaku", which recently ranked number one on multiple supercomputing lists, including the Top500 in June 2020, has various power control features, such as (1) an eco mode that utilizes only of two floating-point pipelines while decreasing supply to chip; (2) a boost increases clock frequency; and (3) core retention function turns unused cores into low-power state. By orchestrating these power-performance features considering characteristics currently running applications,...

10.1109/cluster49012.2020.00069 article EN 2020-09-01

Power/Performance/Area Evaluations for Next-Generation HPC Processors using the A64FX Chip

OPENALEX - Publications

Eishi Arima Yuetsu Kodama Tetsuya Odajima Miwako Tsuji Mitsuhisa Sato

Future HPC systems, including post-exascale supercomputers, will face severe problems such as the slowing-down of Moore's law and limitation power supply. To achieve desired system performance improvement while counteracting these issues, hardware design optimization is a key factor. In this paper, we investigate future directions SIMD-based processor architectures by using A64FX chip customized version power/performance/area simulators, i.e., Gem5 McPAT. More specifically, based on chip,...

10.1109/coolchips52128.2021.9410320 article EN 2021-04-14

GreenCourier

OPENALEX - Publications

Mohak Chadha Thandayuthapani Subramanian Eishi Arima Michael Gerndt Martin Schulz and 1 more

This paper presents GreenCourier, a novel scheduling framework that enables the runtime of serverless functions across geographically distributed regions based on their carbon efficiencies. Our incorporates an intelligent strategy for Kubernetes and supports Knative as platform. To obtain real-time information different geographical regions, our multiple marginal emissions sources such WattTime Carbon-aware SDK. We comprehensively evaluate performance using Google Engine production function...

10.1145/3631295.3631396 preprint EN 2023-11-24

Immediate sleep: Reducing energy impact of peripheral circuits in STT-MRAM caches

OPENALEX - Publications

Eishi Arima Hiroki Noguchi Takashi Nakada Shinobu Miwa Susumu Takeda and 2 more

Implementing last level caches (LLCs) with STT-MRAM is a promising approach for designing energy efficient microprocessors due to high density and low leakage power of its memory cells. However, peripheral circuits an cache still suffer from because large leaky transistors are required drive write current element. To overcome this problem, we propose new management scheme called Immediate Sleep (IS). IS immediately turns off subarray if the next access predicted be not critical in...

10.1109/iccd.2015.7357096 article EN 2015-10-01

Optimizing Hardware Resource Partitioning and Job Allocations on Modern GPUs under Power Caps

OPENALEX - Publications

Eishi Arima Minjoon Kang Issa Saba Josef Weidendorfer Carsten Trinitis and 1 more

CPU-GPU heterogeneous systems are now commonly used in HPC (High-Performance Computing). However, improving the utilization and energy-efficiency of such is still one most critical issues. As single program typically cannot fully utilize all resources within a node/chip, co-scheduling (or co-locating) multiple programs with complementary resource requirements promising solution. Meanwhile, as power consumption has become first-class design constraint for systems, techniques should be...

10.1145/3547276.3548630 preprint EN 2022-08-29

D-MRAM Cache: Enhancing Energy Efficiency with 3T-1MTJ DRAM / MRAM Hybrid Memory

OPENALEX - Publications

Hiroki Noguchi Kumiko Nomura Keiko Abe Shinobu Fujita Eishi Arima and 4 more

This paper describes a proposal of non-volatile cache architecture utilizing novel DRAM / MRAM cell-level hybrid structured memory (D-MRAM) that enables effective power reduction for high performance mobile SoCs without area overhead. Here, the key point to reduce active is intermittent refresh process DRAM-mode. D-MRAM has advantage static consumptions compared conventional SRAM, because there are no leakage paths in cell and it not needed supply voltage its cells when used as MRAM-mode....

10.7873/date.2013.363 article EN Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015 2013-01-01

D-MRAM cache: enhancing energy efficiency with 3T-1MTJ DRAM/MRAM hybrid memory

OPENALEX - Publications

Hiroki Noguchi Kumiko Nomura Keiko Abe Shinobu Fujita Eishi Arima and 4 more

This paper describes a proposal of non-volatile cache architecture utilizing novel DRAM / MRAM cell-level hybrid structured memory (D-MRAM) that enables effective power reduction for high performance mobile SoCs without area overhead. Here, the key point to reduce active is intermittent refresh process DRAM-mode. D-MRAM has advantage static consumptions compared conventional SRAM, because there are no leakage paths in cell and it not needed supply voltage its cells when used as MRAM-mode....

10.5555/2485288.2485716 article EN Design, Automation, and Test in Europe 2013-03-18

Low-power cache memory with state-of-the-art STT-MRAM for high-performance processors

OPENALEX - Publications

Susumu Takeda Hiroki Noguchi Kumiko Nomura Shinobu Fujita Shinobu Miwa and 3 more

This paper describes state-of-the-art STT-MRAM, which can drastically save energy consumption dissipated in cache memory system compared with conventional SRAM-based ones. also presents how to build hierarchy both the state-of-art STT-MRAM and SRAM reduce consumption. The key point is "break-even-time aware design" based on normally-off operation. For further power reduction, an intelligent management technique for STT-MRAM-based discussed.

10.1109/isocc.2015.7401759 article EN 2015-11-01

Reinforcement Learning-Driven Co-Scheduling and Diverse Resource Assignments on NUMA Systems

OPENALEX - Publications

Urvij Saroliya Eishi Arima Dai Liu Martin Schulz

10.1109/iccd63220.2024.00034 article EN 2024-11-18

Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach

OPENALEX - Publications

Urvij Saroliya Eishi Arima Dai Liu Martin Schulz

GPU-based heterogeneous architectures are now commonly used in HPC clusters. Due to their architectural simplicity specialized for data-level parallelism, GPUs can offer much higher computational throughput and memory bandwidth than CPUs the same generation do. However, as available resources have increased exponentially over past decades, it has become increasingly difficult a single program fully utilize them. As consequence, industry started supporting several resource partitioning...

10.1109/cluster52292.2023.00023 preprint EN 2023-10-31

Classification-Based Unified Cache Replacement via Partitioned Victim Address History

OPENALEX - Publications

Eishi Arima

In modern microprocessors, lower level cache memories are usually implemented as unified caches where different classes of cachelines such data, instructions, and Page Table Entries (PTEs) coexist. Particularly, frequent PTE accesses following after TLB missies can happen on systems, which is driven by the increasing demands applications for larger working set size, this trend naturally leads to significant conflicts among these kinds cachelines.This paper targets emerging conflict problem...

10.1109/dsd51259.2020.00027 article EN 2020-08-01

Sustainability in HPC: Vision and Opportunities

OPENALEX - Publications

Mohak Chadha Eishi Arima Amir Raoofy Michael Gerndt Martin Schulz

Tackling climate change by reducing and eventually eliminating carbon emissions is a significant milestone on the path toward establishing an environmentally sustainable society. As we transition into exascale era, marked increasing demand scale of HPC resources, community must embrace challenge from designing operating modern systems. In this position paper, describe challenges highlight different opportunities that can aid sites in footprint

10.48550/arxiv.2309.13473 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Sustainability in HPC: Vision and Opportunities

OPENALEX - Publications

Mohak Chadha Eishi Arima Amir Raoofy Michael Gerndt Martin Schulz

Tackling climate change by reducing and eventually eliminating carbon emissions is a significant milestone on the path toward establishing an environmentally sustainable society. As we transition into exascale era, marked increasing demand scale of HPC resources, community must embrace challenge from designing operating modern systems. In this position paper, describe challenges highlight different opportunities that can aid sites in footprint

10.1145/3624062.3624271 article EN 2023-11-10

CLUSTER 2019 Committees

OPENALEX - Publications

Co-Chairs Bridges Ron Brightwell Patrick McCormick Martin Schulz Eishi Arima and 95 more

10.1109/cluster.2019.8891050 article 2019-09-01

Toward Dynamic Orchestration of Data/Power/Process Management for Hybrid Memory Based Systems

OPENALEX - Publications

Eishi Arima Carsten Trinitis Martin Schulz

10.18420/fgbs2021h-03 article EN 2021-01-01