NFDI4DS | UHH-SEMS - Publication Details

Enabling autonomic behavior in systems software with hot swapping

OPENALEX - Publications

Jonathan Appavoo Kevin Hui Craig A. N. Soules Robert W. Wisniewski Dárica Machado da Silva and 10 more

Autonomic computing systems are designed to be self-diagnosing and self-healing, such that they detect performance correctness problems, identify their causes, apply the appropriate remedy. These abilities can improve performance, uptime, security, while simultaneously reducing effort skills required of system administrators. One way support these is by allowing monitoring code, diagnostic function implementations dynamically inserted removed in live systems. This "hot swapping" avoids...

10.1147/sj.421.0060 article EN IBM Systems Journal 2003-01-01

On the benefits and pitfalls of extending a statically typed language JIT compiler for dynamic scripting languages

OPENALEX - Publications

José G. Castaños David Edelsohn Kazuaki Ishizaki Priya Nagpurkar Toshio Nakatani and 2 more

Whenever the need to compile a new dynamically typed language arises, an appealing option is repurpose existing statically Just-In-Time (JIT) compiler (repurposed JIT compiler). Existing repurposed compilers (RJIT compilers), however, have not yet delivered hoped-for performance boosts. The of JVM languages, for instance, often lags behind standard interpreter implementations. Even more customized solutions that extend internals target compete poorly with those designed specifically...

10.1145/2384616.2384631 article EN 2012-10-19

Bandwidth Optimized Parallel Algorithms for Sparse Matrix-Matrix Multiplication using Propagation Blocking

OPENALEX - Publications

Zhixiang Gu José E. Moreira David Edelsohn Ariful Azad

Sparse matrix-matrix multiplication (SpGEMM) is a widely used kernel in various graph, scientific computing and machine learning algorithms. It well known that SpGEMM memory-bound operation, its peak performance expected to be bound by the memory bandwidth. Yet, existing algorithms fail saturate bandwidth, resulting suboptimal under Roofline model. In this paper, we characterize based on their access patterns develop practical lower upper bounds for performance. We then an algorithm outer...

10.1145/3350755.3400216 preprint EN 2020-07-06

Adding dynamically-typed language support to a statically-typed language compiler

OPENALEX - Publications

Kazuaki Ishizaki Takeshi Ogasawara José G. Castaños Priya Nagpurkar David Edelsohn and 1 more

Applications written in dynamically typed scripting languages are increasingly popular for Web software development. Even on the server side, programmers using such as Ruby and Python to build complex applications quickly. As number complexity of language grows, optimizing their performance is becoming important. Some best performing compilers optimizers developed entirely from scratch target a specific language. This approach not scalable, given variety languages, effort involved developing...

10.1145/2151024.2151047 article EN 2012-03-03

Corrugations in galactic discs generated by Magellanic-type perturbers

OPENALEX - Publications

David Edelsohn Bruce G. Elmegreen

The small perpendicular distortions in a large disc galaxy, such as the Milky Way, that are caused by an orbiting intermediate-mass companion Large Magellanic Cloud (LMC) have been modelled with parallel computer implementation of three-dimensinal N-body particle treecode. model demonstrates mass fraction 7.5 per cent Galaxy and orbital inclination 45° can generate height velocity perturbations inner primary galaxy order several hundred pc ~10 km −1, respectively, relative to unperturbed...

10.1093/mnras/287.4.947 article EN Monthly Notices of the Royal Astronomical Society 1997-06-01

Computer models of the Sagittarius dwarf interaction with the Milky Way

OPENALEX - Publications

David Edelsohn Bruce G. Elmegreen

The interaction between the dwarf galaxy in Sagittarius and Milky Way Galaxy has been modelled with a parallel computer implementation of an N-body treecode. Models are made that reproduce observed position, size, velocity, proper motion velocity gradient its likely pre-disc encounter, other models studied which just passed through disc. Several observable differences these cases found. In pre-collision case, is bound to it disc previously 1.7 × 108 yr ago anticentre direction. It will cross...

10.1093/mnras/290.1.7 article EN Monthly Notices of the Royal Astronomical Society 1997-09-01

IBM POWER9 system software

OPENALEX - Publications

Joefon Jann Paul Mackerras J. M. Ludden Michael Gschwind W. Ouren and 3 more

The IBM POWER9 architecture offers a substantial set of novel and performance-improvement features that are made available to both scale-up scale-out applications via system software. These provide significant performance improvements for cognitive, cloud, virtualization workloads, many which use dynamic scripting languages. In this paper, we describe some the key features.

10.1147/jrd.2018.2846959 article EN IBM Journal of Research and Development 2018-06-22

A matrix math facility for Power ISA(TM) processors

OPENALEX - Publications

José E. Moreira Kit Barton Steven Battle Peter Bergner Ramon Bertran and 20 more

Power ISA(TM) Version 3.1 has introduced a new family of matrix math instructions, collectively known as the Matrix-Multiply Assist (MMA) facility. The instructions in this facility implement numerical linear algebra operations on small matrices and are meant to accelerate computation-intensive kernels, such multiplication, convolution discrete Fourier transform. These have led power- area-efficient implementation high throughput engine future POWER10 processor. Performance per core is 4...

10.48550/arxiv.2104.03142 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Supporting hot-swappable components for system software

OPENALEX - Publications

Kim Chang Hui Jonathan Appavoo Robert W. Wisniewski Marc Auslander David Edelsohn and 4 more

Summary form only given. A hot-swappable component is one that can be replaced with a new or different implementation while the system running and actively using component. For example, of TCP/IP protocol stack, when hot-swappable, (perhaps to handle denial-of-service attacks improve performance), without disturbing existing network connections. The capability swap components offers number potential advantages such as: online upgrades for high availability systems, improved performance due...

10.1109/hotos.2001.990086 article EN 2005-08-24

Adding dynamically-typed language support to a statically-typed language compiler

OPENALEX - Publications

Kazuaki Ishizaki Takeshi Ogasawara José G. Castaños Priya Nagpurkar David Edelsohn and 1 more

Applications written in dynamically typed scripting languages are increasingly popular for Web software development. Even on the server side, programmers using such as Ruby and Python to build complex applications quickly. As number complexity of language grows, optimizing their performance is becoming important. Some best performing compilers optimizers developed entirely from scratch target a specific language. This approach not scalable, given variety languages, effort involved developing...

10.1145/2365864.2151047 article EN ACM SIGPLAN Notices 2012-03-03

On the benefits and pitfalls of extending a statically typed language JIT compiler for dynamic scripting languages

OPENALEX - Publications

José G. Castaños David Edelsohn Kazuaki Ishizaki Priya Nagpurkar Toshio Nakatani and 2 more

Whenever the need to compile a new dynamically typed language arises, an appealing option is repurpose existing statically Just-In-Time (JIT) compiler (repurposed JIT compiler). Existing repurposed compilers (RJIT compilers), however, have not yet delivered hoped-for performance boosts. The of JVM languages, for instance, often lags behind standard interpreter implementations. Even more customized solutions that extend internals target compete poorly with those designed specifically...

10.1145/2398857.2384631 article EN ACM SIGPLAN Notices 2012-10-19

IRAS LRS Spectra of Comets Tempel 1 and Tempel 2

OPENALEX - Publications

D. K. Lynch J. A. Hackwell David Edelsohn F. Lahuis Pjotr R. Roelfsema and 3 more

10.1006/icar.1995.1054 article EN Icarus 1995-03-01

Contributions to the GNU Compiler Collection

OPENALEX - Publications

David Edelsohn Wolfgang Gellerich Mostafa Hagog Dorit Naishlos Mircea Namolaru and 4 more

The GCC (GNU Compiler Collection) project of the Free Software Foundation has resulted in one most widespread compilers use today that is capable generating code for a variety platforms. Since 1987, many volunteers from academia and private sector have been working to continuously improve functionality quality GCC. Some compiler's key components were, continue be, developed at IBM Research laboratories. We review several IBM's contributions compiler, including generator zSeries® processor...

10.1147/sj.442.0259 article EN IBM Systems Journal 2005-01-01

HIERARCHICAL TREE-STRUCTURES AS ADAPTIVE MESHES

OPENALEX - Publications

David Edelsohn

New adaptive mesh refinement algorithms provide an opportunity to utilize the same hierarchical tree-structures developed for multipole-based particle simulations in grid-based of both continuum and problems. Representing a multipole method simulation with this structure provides natural formalism which unite these two classes solvers. This paper discusses how methods exploit basic principle locality evident many systems, such as those governed by Poisson's Equation, introduces issues...

10.1142/s0129183193000707 article EN International Journal of Modern Physics C 1993-10-01

FlexSEE

OPENALEX - Publications

José E. Moreira Jessica H. Tseng Manoj Kumar Eknath Ekanadham Joefon Jann and 4 more

In this paper, we present a comprehensive security architecture, Flexible Secure Execution Environment (FlexSEE), for confidential computing in modern cloud environments. FlexSEE does not require the trust of system software on compute server and guarantees that user data is visible only non-privileged mode to designated program trusted by owner hardware, thus protecting from an untrusted hypervisor, OS, or other users' applications, server.

10.1145/3587135.3592170 article EN 2023-05-09

A generalized expression optimization hook for C++ on a high-performance architectures

OPENALEX - Publications

David Edelsohn

C++ has gained broad acceptance as an object-oriented evolutionary extension to the C language, but it severely constrains methods for operating on class objects by forcing all data manipulation through interface which assumes that basic operations can be implemented they are written: unary or binary operators. allows great flexibility in creation of complex structures perform same functionality built-in types many other languages unfortunately does not allow equivalent level feasibility so...

10.1109/shpcc.1994.296668 article EN 2002-12-17

Bandwidth-Optimized Parallel Algorithms for Sparse Matrix-Matrix Multiplication using Propagation Blocking

OPENALEX - Publications

Zhixiang Gu José E. Moreira David Edelsohn Ariful Azad

Sparse matrix-matrix multiplication (SpGEMM) is a widely used kernel in various graph, scientific computing and machine learning algorithms. It well known that SpGEMM memory-bound operation, its peak performance expected to be bound by the memory bandwidth. Yet, existing algorithms fail saturate bandwidth, resulting suboptimal under Roofline model. In this paper we characterize based on their access patterns develop practical lower upper bounds for performance. We then an algorithm outer...

10.48550/arxiv.2002.11302 preprint EN other-oa arXiv (Cornell University) 2020-01-01