NFDI4DS | UHH-SEMS - Publication Details

Stateful dataflow multigraphs

OPENALEX - Publications

Tal Ben‐Nun Johannes de Fine Licht Alexandros Nikolaos Ziogas Timo Schneider Torsten Hoefler

The ubiquity of accelerators in high-performance computing has driven programming complexity beyond the skill-set average domain scientist. To maintain performance portability future, it is imperative to decouple architecture-specific paradigms from underlying scientific computations. We present Stateful DataFlow multiGraph (SDFG), a data-centric intermediate representation that enables separating program definition its optimization. By combining fine-grained data dependencies with...

10.1145/3295500.3356173 article EN 2019-11-07

Single neuromorphic memristor closely emulates multiple synaptic mechanisms for energy efficient neural networks

OPENALEX - Publications

Christoph Weilenmann Alexandros Nikolaos Ziogas Till Zellweger Kevin Portner Marko Mladenović and 4 more

Biological neural networks do not only include long-term memory and weight multiplication capabilities, as commonly assumed in artificial networks, but also more complex functions such short-term memory, plasticity, meta-plasticity - all collocated within each synapse. Here, we demonstrate memristive nano-devices based on SrTiO3 that inherently emulate these synaptic functions. These memristors operate a non-filamentary, low conductance regime, which enables stable energy efficient...

10.1038/s41467-024-51093-3 article EN cc-by-nc-nd Nature Communications 2024-08-13

A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning

OPENALEX - Publications

Tal Ben‐Nun Maciej Besta Simon Huber Alexandros Nikolaos Ziogas Daniel Peter and 1 more

We introduce Deep500: the first customizable benchmarking infrastructure that enables fair comparison of plethora deep learning frameworks, algorithms, libraries, and techniques. The key idea behind Deep500 is its modular design, where factorized into four distinct levels: operators, network processing, training, distributed training. Our evaluation illustrates (enables combining different codes) (uses carefully selected metrics). Moreover, fast (incurs negligible overheads), verifiable...

10.1109/ipdps.2019.00018 article EN 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2019-05-01

Forensic Hydrology: A Complete Reconstruction of an Extreme Flood Event in Data-Scarce Area

OPENALEX - Publications

Aristoteles Tegos Alexandros Nikolaos Ziogas Vasilis Bellos Apostolos Tzimas

On 18 September 2020, the Karditsa prefecture of Thessaly region (Greece) experienced a catastrophic flood as consequence IANOS hurricane. This intense phenomenon was characterized by rainfall records ranging from 220 mm up to 530 mm, in time interval 15 h. Extended public infrastructure damaged and thousands houses commercial properties were flooded, while four casualties recorded. The aim this study provide forensic research on reconstruction event vicinity city. First, we performed...

10.3390/hydrology9050093 article EN cc-by Hydrology 2022-05-20

A data-centric approach to extreme-scale ab initio dissipative quantum transport simulations

OPENALEX - Publications

Alexandros Nikolaos Ziogas Tal Ben‐Nun Guillermo Indalecio Fernández Timo Schneider Mathieu Luisier and 1 more

The computational efficiency of a state the art ab initio quantum transport (QT) solver, capable revealing coupled electro-thermal properties atomically-resolved nano-transistors, has been improved by up to two orders magnitude through data centric reorganization application. approach yields coarse-and fine-grained data-movement characteristics that can be used for performance and communication modeling, communication-avoidance, dataflow transformations. resulting code tuned top-6 hybrid...

10.1145/3295500.3357156 preprint EN 2019-11-07

Learning the Hamiltonian Matrix of Large Atomic Systems

OPENALEX - Publications

Chen Hao Xia Manasa Kaniselvan Alexandros Nikolaos Ziogas Marko Mladenović Rayen Mahjoub and 2 more

Graph neural networks (GNNs) have shown promise in learning the ground-state electronic properties of materials, subverting ab initio density functional theory (DFT) calculations when underlying lattices can be represented as small and/or repeatable unit cells (i.e., molecules and periodic crystals). Realistic systems are, however, non-ideal generally characterized by higher structural complexity. As such, they require large (10+ Angstroms) thousands atoms to accurately described. At these...

10.48550/arxiv.2501.19110 preprint EN arXiv (Cornell University) 2025-01-31

Productivity, Portability, Performance, and Reproducibility: Data-Centric Python

OPENALEX - Publications

Alexandros Nikolaos Ziogas Timo Schneider Tal Ben‐Nun Alexandru Calotoiu Tiziano De Matteis and 3 more

10.1109/tpds.2025.3549310 article EN IEEE Transactions on Parallel and Distributed Systems 2025-04-09

NPBench

OPENALEX - Publications

Alexandros Nikolaos Ziogas Tal Ben‐Nun Timo Schneider Torsten Hoefler

Python, already one of the most popular languages for scientific computing, has made significant inroads in High Performance Computing (HPC). At center Python's ecosystem is NumPy, an efficient implementation multi-dimensional array (tensor) structure, together with basic arithmetic and linear algebra. Compared to traditional HPC languages, relatively low performance Python NumPy spawned research compilers frameworks that decouple compact representation from underlying implementation....

10.1145/3447818.3460360 article EN 2021-06-03

Productivity, portability, performance

OPENALEX - Publications

Alexandros Nikolaos Ziogas Timo Schneider Tal Ben‐Nun Alexandru Calotoiu Tiziano De Matteis and 3 more

Python has become the de facto language for scientific computing. Programming in is highly productive, mainly due to its rich science-oriented software ecosystem built around NumPy module. As a result, demand support High Performance Computing (HPC) skyrocketed. However, itself does not necessarily offer high performance. In this work, we present workflow that retains Python's productivity while achieving portable performance across different architectures. The workflow's key features are...

10.1145/3458817.3476176 article EN 2021-10-21

Pebbles, Graphs, and a Pinch of Combinatorics

OPENALEX - Publications

Grzegorz Kwaśniewski Tal Ben‐Nun Lukas Gianinazzi Alexandru Calotoiu Timo Schneider and 3 more

Determining I/O lower bounds is a crucial step in obtaining communication-efficient parallel algorithms, both across the memory hierarchy and between processors. Current approaches either study specific algorithms individually, disallow programmatic motifs such as recomputation, or produce asymptotic that exclude important constants. We propose novel approach for precise on general class of programs, which we call Simple Overlap Access Programs (SOAP). SOAP analysis covers wide variety from...

10.1145/3409964.3461796 preprint EN 2021-06-30

On the parallel I/O optimality of linear algebra kernels

OPENALEX - Publications

Grzegorz Kwaśniewski Marko Kabić Tal Ben‐Nun Alexandros Nikolaos Ziogas Jens Eirik Saethre and 6 more

Matrix factorizations are among the most important building blocks of scientific computing. State-of-the-art libraries, however, not communication-optimal, underutilizing current parallel architectures. We present novel algorithms for Cholesky and LU that utilize an asymptotically communication-optimal 2.5D decomposition. first establish a theoretical framework deriving I/O lower bounds linear algebra kernels, then its insights to derive schedules, both communicating N^3/(P*sqrt(M)) elements...

10.1145/3458817.3476167 preprint EN 2021-10-21

Stateful Dataflow Multigraphs: A Data-Centric Model for Performance Portability on Heterogeneous Architectures

OPENALEX - Publications

Tal Ben‐Nun Johannes de Fine Licht Alexandros Nikolaos Ziogas Timo Schneider Torsten Hoefler

The ubiquity of accelerators in high-performance computing has driven programming complexity beyond the skill-set average domain scientist. To maintain performance portability future, it is imperative to decouple architecture-specific paradigms from underlying scientific computations. We present Stateful DataFlow multiGraph (SDFG), a data-centric intermediate representation that enables separating program definition its optimization. By combining fine-grained data dependencies with...

10.48550/arxiv.1902.10345 preprint EN other-oa arXiv (Cornell University) 2019-01-01

On the parallel I/O optimality of linear algebra kernels

OPENALEX - Publications

Grzegorz Kwaśniewski Tal Ben‐Nun Alexandros Nikolaos Ziogas Timo Schneider Maciej Besta and 1 more

Dense linear algebra kernels are fundamental components of many scientific computing applications. In this work we present a novel method deriving parallel I/O lower bounds for broad family programs. Based on the X-Partitioning abstraction, our explicitly captures inter-statement dependencies. Applying analysis to LU factorization, derive COnfLUX, an algorithm with cost N3/([EQUATION]) communicated elements per processor - only 1/3× over established bound. We evaluate COnfLUX various problem...

10.1145/3437801.3441590 article EN 2021-02-17

Arrow Matrix Decomposition: A Novel Approach for Communication-Efficient Sparse Matrix Multiplication

OPENALEX - Publications

Lukas Gianinazzi Alexandros Nikolaos Ziogas Langwen Huang Piotr Łuczyński Saleh Ashkboosh and 7 more

We propose a novel approach to iterated sparse matrix dense multiplication, fundamental computational kernel in scientific computing and graph neural network training. In cases where sizes exceed the memory of single compute node, data transfer becomes bottleneck. An based on multiplication algorithms leads sub-optimal scalability fails exploit sparsity problem. To address these challenges, we decomposing into small number highly structured matrices called arrow matrices, which are connected...

10.1145/3627535.3638496 preprint EN 2024-02-20

Single Neuromorphic Memristor closely Emulates Multiple Synaptic Mechanisms for Energy Efficient Neural Networks

OPENALEX - Publications

Christoph Weilenmann Alexandros Nikolaos Ziogas Till Zellweger Kevin Portner Marko Mladenović and 4 more

Biological neural networks do not only include long-term memory and weight multiplication capabilities, as commonly assumed in artificial networks, but also more complex functions such short-term memory, plasticity, meta-plasticity - all collocated within each synapse. Here, we demonstrate memristive nano-devices based on SrTiO3 that inherently emulate these synaptic functions. These memristors operate a non-filamentary, low conductance regime, which enables stable energy efficient...

10.48550/arxiv.2402.16628 preprint EN arXiv (Cornell University) 2024-02-26

Electron-Electron Interactions in Device Simulation via Non-equilibrium Green's Functions and the GW Approximation

OPENALEX - Publications

Leonard Deuschle Jiang Cao Alexandros Nikolaos Ziogas Anders Winka Alexander Maeder and 2 more

The continuous scaling of metal-oxide-semiconductor field-effect transistors (MOSFETs) has led to device geometries where charged carriers are increasingly confined ever smaller channel cross sections. This development is associated with reduced screening long-range Coulomb interactions. To accurately predict the behavior such ultra-scaled devices, electron-electron (e-e) interactions must be explicitly incorporated in their quantum transport simulation. In this paper, we present an...

10.48550/arxiv.2412.12986 preprint EN arXiv (Cornell University) 2024-12-17

Towards Exascale Simulations of Nanoelectronic Devices in the GW Approximation

OPENALEX - Publications

Leonard Deuschle Alexander Maeder V. Maillou Nicolas Vetsch Anders Winka and 3 more

10.1109/sc41406.2024.00069 article EN 2024-11-17

Accelerated Atomistic Kinetic Monte Carlo Simulations of Resistive Memory Arrays

OPENALEX - Publications

Manasa Kaniselvan Alexander Maeder Marko Mladenović Mathieu Luisier Alexandros Nikolaos Ziogas

10.1109/sc41406.2024.00097 article EN 2024-11-17

Nanoscale Device Modeling Beyond the Ballistic Limit of Transport and Fixed Geometries

OPENALEX - Publications

Mathieu Luisier John Backman Jiang Cao Leonard Deuschle Manasa Kaniselvan and 8 more

10.1109/iedm50854.2024.10873375 article EN 2021 IEEE International Electron Devices Meeting (IEDM) 2024-12-07

Modern Developments in Flood Modelling

OPENALEX - Publications

Aristoteles Tegos Alexandros Nikolaos Ziogas Vasilis Bellos

Flood modelling is among the most challenging scientific task because it covers a wide area of complex physical phenomena associated with highly uncertain and non-linear processes where development physically interpretive solutions usually suffers from lack recorded data [...]

10.3390/hydrology10050112 article EN cc-by Hydrology 2023-05-15

High-Performance and Programmable Attentional Graph Neural Networks with Global Tensor Formulations

OPENALEX - Publications

Maciej Besta Paweł Renc Robert Gerstenberger Paolo Sylos Labini Alexandros Nikolaos Ziogas and 13 more

Graph attention models (A-GNNs), a type of Neural Networks (GNNs), have been shown to be more powerful than simpler convolutional GNNs (C-GNNs). However, A-GNNs are complex program and difficult scale. To address this, we develop novel mathematical formulation, based on tensors that group all the feature vectors, targeting both training inference A-GNNs. The formulation enables straightforward adoption communication-minimizing routines, it fosters optimizations such as vectorization,...

10.1145/3581784.3607067 article EN 2023-10-30