NFDI4DS | UHH-SEMS - Publication Details

Christian Jacobi

ORCID: 0000-0003-0522-1630

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5040039726

Research Areas

Parallel Computing and Optimization Techniques
Embedded Systems Design Techniques
Advanced Data Storage Technologies
Formal Methods in Verification
Interconnection Networks and Systems
Low-power high-performance VLSI design
Numerical Methods and Algorithms
Logic, programming, and type systems
Radiation Effects in Electronics
Real-Time Systems Scheduling
Distributed and Parallel Computing Systems
Digital Filter Design and Implementation
Advanced Authentication Protocols Security
Advanced Neural Network Applications
VLSI and Analog Circuit Testing
Algorithms and Data Compression
Distributed systems and fault tolerance
Advanced Software Engineering Methodologies
User Authentication and Security Systems
Security and Verification in Computing
Analog and Mixed-Signal Circuit Design
Cloud Computing and Remote Desktop Technologies
Scientific Computing and Data Management
Architecture and Computational Design
IoT and Edge/Fog Computing

IBM (United States)
2005-2022

Intel (United States)
2022

Xilinx (United States)
2022

The University of Tokyo
2022

Takeda (United States)
2022

University of Massachusetts Amherst
2022

NEC (Japan)
2022

Poughkeepsie Public Library District
2015-2018

IEEE Computer Society
2013

IBM (Germany)
2002-2012

Transactional Memory Architecture and Implementation for IBM System Z

OPENALEX - Publications

Christian Jacobi T. J. Slegel D. F. Greiner

We present the introduction of transactional memory into next generation IBM System z CPU. first describe instruction-set architecture features, including requirements for enterprise-class software RAS. then implementation in zEnterprise EC12 (zEC12) microprocessor generation, focusing on how can be embedded existing cache design and multiprocessor shared-memory infrastructure. explain practical reasons behind our choices. The zEC12 system is available since September 2012.

10.1109/micro.2012.12 article EN 2012-12-01

Design and microarchitecture of the IBM System z10 microprocessor

OPENALEX - Publications

Chung-Lung Shum F.Y. Busaba S. Dao-Trong G. Gerwig Christian Jacobi and 5 more

The IBM System z10™ microprocessor is currently the fastest running 64-bit CISC (complex instruction set computer) microprocessor. This operates at 4.4 GHz and provides up to two times performance improvement compared with its predecessor, z9® In addition ultrahigh-frequency pipeline, offers such enhancements as a sophisticated branch-prediction structure, large second-level private cache, data-prefetch engine, hardwired decimal floating-point arithmetic unit. z10 also implements new...

10.1147/jrd.2009.5388586 article EN IBM Journal of Research and Development 2009-01-01

IBM POWER6 accelerators: VMX and DFU

OPENALEX - Publications

L. Eisen J. W. Ward H.-W. Tast N. Mading J. Leenstra and 5 more

The IBM POWER6™ microprocessor core includes two accelerators for increasing performance of specific workloads. vector multimedia extension (VMX) provides a acceleration graphic and scientific It single instructions that work on multiple data elements. separate 128-bit into different components are operated concurrently. decimal floating-point unit (DFU) commercial workloads, more specifically, financial transactions. new number system performs implicit rounding to radix points, feature...

10.1147/rd.516.0663 article EN IBM Journal of Research and Development 2007-11-01

Putting it all together – Formal verification of the VAMP

OPENALEX - Publications

Sven Beyer Christian Jacobi Daniel Kröning Dirk Leinenbach Wolfgang J. Paul

10.1007/s10009-006-0204-6 article EN International Journal on Software Tools for Technology Transfer 2006-05-01

Data Compression Accelerator on IBM POWER9 and z15 Processors : Industrial Product

OPENALEX - Publications

Bülent Abali B. Blaner John Reilly M. Klein Ashutosh Mishra and 7 more

Lossless data compression is highly desirable in enterprise and cloud environments for storage memory cost savings improved utilization I/O network. While the value provided by recognized, its application practice often limited because it's a processor intensive operation resulting low throughput high elapsed time intense workloads.The IBM POWER9 z15 systems overcome shortcomings of existing approaches including novel on-chip integrated accelerator. The accelerator reduces cycles, traffic,...

10.1109/isca45697.2020.00012 article EN 2020-05-01

A Fully Pipelined Single-Precision Floating-Point Unit in the Synergistic Processor Element of a CELL Processor

OPENALEX - Publications

Hwa-Joon Oh Silvia Melitta Mueller Christian Jacobi B.W. Michael Hiroaki Nishikawa and 5 more

The floating-point unit (FPU) in the synergistic processor element (SPE) of a CELL is fully pipelined 4-way single-instruction multiple-data (SIMD) designed to accelerate media and data streaming with 128-bit operands. It supports 32-bit single-precision 16-bit integer operands two different latencies, six-cycle seven-cycle, 11 FO4 delay per stage. FPU optimizes performance critical multiply-add operations. Since exact rounding, exceptions, de-norm number handling are not important...

10.1109/jssc.2006.870924 article EN IEEE Journal of Solid-State Circuits 2006-04-01

Evaluating Coverage of Error Detection Logic for Soft Errors using Formal Methods

OPENALEX - Publications

Udo Krautz Matthias Pflanz Christian Jacobi H.-W. Tast Kai F. Weber and 1 more

In this paper we describe a methodology to measure exactly the quality of fault-tolerant designs by combining fault-injection in high level design (HLD) descriptions with formal verification approach. We utilize BDD based symbolic simulation determine coverage online error-detection and -correction logic. an easily portable approach, which can be applied wide variety multi-GHz industrial

10.1109/date.2006.244062 article EN 2006-01-01

The Vector Floating-Point Unit in a Synergistic Processor Element of a CELL Processor

OPENALEX - Publications

Silvia Melitta Mueller Christian Jacobi Hwa-Joon Oh Khoa Dac Tran S. Cottier and 7 more

The floating-point unit in the synergistic processor element of 1st generation multi-core CELL is described. FPU supports 4-way SIMD single precision and integer operations 2-way double operations. design required a high-frequency, low latency, power area efficiency with primary application to multimedia streaming workloads, such as 3D graphics. has 3 different latencies, optimizing performance critical FMA operations, which are executed 6-cycle latency at an 11FO4 cycle time. includes...

10.1109/arith.2005.45 article EN 2005-07-27

Using threads in interactive systems

OPENALEX - Publications

Carl Hauser Christian Jacobi Marvin Theimer Brent Welch Mark Weiser

We describe the results of examining two large research and commercial systems for ways that they use threads. used three methods: analysis macroscopic thread statistics, microsecond spacing between events, reading implementation code. identify ten different paradigms usage: defer work, general pumps, slack processes, sleepers, one-shots, deadlock avoidance, rejuvenation, serializers, encapsulated fork exploiting parallelism. While some, like are well known, others have not been previously...

10.1145/168619.168627 article EN 1993-01-01

IBM zEnterprise 196 microprocessor and cache subsystem

OPENALEX - Publications

F.Y. Busaba Mike Blake Brian Curran M. Fee Christian Jacobi and 3 more

The IBM zEnterprise® 196 (z196) system, announced in the second quarter of 2010, is latest generation System z® mainframe. system designed with a new microprocessor and memory subsystems, which distinguishes it from its z10® predecessor. has up to 40% improvement performance for traditional z/OS® workloads carries 60% more capacity when compared z10 subsystem four levels cache hierarchy (L1 through L4) constructs L3 L4 caches embedded DRAM silicon technology, achieves approximately three...

10.1147/jrd.2011.2173962 article EN IBM Journal of Research and Development 2012-01-01

Automatic Formal Verification of Fused-Multiply-Add FPUs

OPENALEX - Publications

Christian Jacobi Kai F. Weber Viresh Paruthi J. Baumgartner

In this paper we describe a fully-automated methodology for formal verification of fused-multiply-add floating point units (FPU). Our verifies an implementation FPU against simple reference model derived from the processor's architectural specification, which may include all aspects IEEE specification including denormal operands and exceptions. strategy uses combination BDD- SAT-based symbolic simulation. To make task tractable, use case-splitting, multiplier isolation, automatic reduction...

10.1109/date.2005.75 article EN Design, Automation, and Test in Europe 2005-04-01

IBM zEC12: The Third-Generation High-Frequency Mainframe Microprocessor

OPENALEX - Publications

C. Kevin Shum F.Y. Busaba Christian Jacobi

The zEnterprise EC12 is the latest generation of IBM'S System Z Enterprise Class mainframe servers. microprocessor operates at an ultra-high frequency 5.5 GHz and incorporates many pipeline-optimization instruction-processing techniques. It also supports innovative instruction set-architecture extensions for future software exploitation to acquire performance gains. this article highlights various factors inside zEC12 achieving best possible computing performance.

10.1109/mm.2013.9 article EN IEEE Micro 2013-02-13

The IBM z13 multithreaded microprocessor

OPENALEX - Publications

Brian Curran Christian Jacobi James Bonanno D. A. Schroter Kevyn Alexander and 2 more

The IBM z13™ system is the latest generation of z Systems™ mainframes. z13 microprocessor improves upon zEnterprise® EC12 (zEC12) processor with two vector execution units, higher instruction parallelism, and a simultaneous multithreaded (SMT) architecture that supports concurrent threads. These advances yield performance gains in legacy online transaction processing business analytics workloads. This features an eight-core chip, robust cache hierarchy, large multiprocessor design optimized...

10.1147/jrd.2015.2418591 article EN IBM Journal of Research and Development 2015-07-01

IBM z14™: 14nm microprocessor for the next-generation mainframe

OPENALEX - Publications

Christopher Berry J. Warnock John Isakson John Badar Brian Bell and 20 more

The IBM Z microprocessor in the z14 system has been redesigned to improve performance, capacity, and security [1] over previous z13 [2]. contains up 24 central processor (CP) 4 controller (SC) chips. Each CP, shown die photo A (Fig. 2.2.7), operates at 5.2GHz is comprised of 10 cores, 2 PCIe Gen3 interfaces, an IO bus (GX), 128MB L3 embedded DRAM (eDRAM) cache, X-BUS interfaces connecting other CP chips one SC chip, a redundant array independent memory (RAIM) interface. core on chip 4MB...

10.1109/isscc.2018.8310171 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2018-02-01

AI accelerator on IBM Telum processor

OPENALEX - Publications

C. Lichtenau Alper Buyuktosunoglu Ramon Bertran Peter Figuli Christian Jacobi and 5 more

IBM Telum is the next generation processor chip for Z and LinuxONE systems. The design focused on enterprise class workloads it achieves over 40% per socket performance growth compared to z15. first server-class with a dedicated on-chip AI accelerator that enables clients gain real time insights from their data as getting processed.

10.1145/3470496.3533042 article EN 2022-05-31

Experiences creating a portable cedar

OPENALEX - Publications

Russell R. Atkinson Alan Demers Carl Hauser Christian Jacobi Peter B. Kessler and 1 more

Cedar is the name for both a language and an environment in use Computer Science Laboratory at Xerox PARC since 1980. The superset of Mesa, major additions being garbage collection runtime types. Neither nor was originally intended to be portable, many years ran only on D-machines few other locations Xerox. We recently re-implemented make it portable across different architectures. Our strategy was, first, machine-dependent C code as intermediate language, second, create language-independent...

10.1145/73141.74847 article EN Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation 1989-06-21

IBM z14: Processor Characterization and Power Management for High-Reliability Mainframe Systems

OPENALEX - Publications

Christopher Berry David Wolpert Christos Vezrytzis Richard Rizzolo Seán Carey and 13 more

The IBM z14 is the latest update in storied history of mainframes. Reliability, availability, security, and scalability are foundation mainframe line. System reliability availability targets excess 10 years, requiring rigorous chip characterization processes. In this paper, we discuss some many processes used to ensure that lifetime. An additional part power management (PM). 5.2-GHz high-power design central processor requires advanced on-die PM capabilities adapt intensive instruction...

10.1109/jssc.2018.2873582 article EN IEEE Journal of Solid-State Circuits 2018-11-13

Design of the IBM z14 microprocessor

OPENALEX - Publications

Christian Jacobi Anthony Saporito M. Recktenwald A. Tsai U. Mayer and 8 more

The latest-generation IBM Z processor provides enhanced performance and compute capacity compared to its z13 predecessor. This paper describes some of the major improvements that include an additional perceptron branch predictor, a completely redesigned translation engine is tightly integrated into core pipeline, level-1 cache directory lookaside buffer design. Outside central processing unit (CPU), sizes have increased on each level, chip now contains 10 CPUs. system topology has been...

10.1147/jrd.2018.2798718 article EN IBM Journal of Research and Development 2018-01-26

Coming Soon ...