NFDI4DS | UHH-SEMS - Publication Details

Application Acceleration on FPGAs with OmpSs@FPGA

OPENALEX - Publications

Jaume Bosch Xubin Tan Antonio Filgueras Miquel Vidal Marc Mateu and 5 more

OmpSs@FPGA is the flavor of OmpSs that allows offloading application functionality to FPGAs. Similarly OpenMP, it based on compiler directives. While OpenMP specification also includes support for heterogeneous execution, we use and as prototype implementation develop new ideas OpenMP. implements tasking model with runtime automatically exploit all SMP FPGA resources available in execution platform. In this paper, present ecosystem, Mercurium Nanos++ system. We show how applications are...

10.1109/fpt.2018.00021 article EN 2018-12-01

OmpSs@Zynq all-programmable SoC ecosystem

OPENALEX - Publications

Antonio Filgueras Eduard Gil Daniel Jiménez-González Carlos Álvarez Xavier Martorell and 3 more

OmpSs is an OpenMP-like directive-based programming model that includes heterogeneous execution (MIC, GPU, SMP, etc.) and runtime task dependencies management. Indeed, has largely influenced the recently appeared OpenMP 4.0 specification. Zynq All-Programmable SoC combines features of a SMP FPGA benefits DLP, ILP TLP parallelisms in order to efficiently exploit new technology improvements chip resource capacities. In this paper, we focus on programmability support, presenting successful...

10.1145/2554688.2554777 article EN 2014-02-18

OmpSs@FPGA framework for high performance FPGA computing

OPENALEX - Publications

Juan Miguel de Haro Ruiz Jaume Bosch Antonio Filgueras Miquel Vidal Daniel Jiménez-González and 4 more

This article presents the new features of OmpSs@FPGA framework. OmpSs is a data-flow programming model that supports task nesting and dependencies to target asynchronous parallelism heterogeneity. extension addressed specifically FPGAs. environment built on top Mercurium source compiler Nanos++ runtime system. To address FPGA specifics implements several related as local variable caching, wide memory accesses or accelerator replication. In addition, part has been ported hardware. Driven by...

10.1109/tc.2021.3086106 article EN IEEE Transactions on Computers 2021-01-01

The AXIOM Software Layers

OPENALEX - Publications

Carlos Álvarez Eduard Ayguadé Javier Bueno Antonio Filgueras Daniel Jiménez-González and 15 more

People and objects will soon share the same digital network for information exchange in a world named as age of cyber-physical systems. The general expectation is that people systems interact real-time. This poses pressure onto design to support increasing demands on computational power, while keeping low power envelop. Additionally, modular scaling easy programmability are also important ensure these become widespread. whole set expectations impose scientific technological challenges need...

10.1109/dsd.2015.52 article EN 2015-08-01

The AXIOM project (Agile, eXtensible, fast I/O Module)

OPENALEX - Publications

Dimitris Theodoropoulos Dionisios Pnevmatikatos Carlos Álvarez Eduard Ayguadé Javier Bueno and 11 more

The AXIOM project (Agile, eXtensible, fast I/O Module) aims at researching new software/hardware architectures for the future Cyber-Physical Systems (CPSs). These systems are expected to react in real-time, provide enough computational power assigned tasks, consume least possible energy such task (energy efficiency), scale up through modularity, allow an easy programmability across performance scaling, and exploit best existing standards minimal costs.

10.1109/samos.2015.7363684 article EN 2015-07-01

The AXIOM platform for next-generation cyber physical systems

OPENALEX - Publications

Dimitris Theodoropoulos Somnath Mazumdar Eduard Ayguadé Nicola Bettin Javier Bueno and 14 more

10.1016/j.micpro.2017.05.018 article EN Microprocessors and Microsystems 2017-06-03

The AXIOM software layers

OPENALEX - Publications

Carlos Álvarez Eduard Ayguadé Jaume Bosch Javier Bueno Artem Cherkashin and 20 more

People and objects will soon share the same digital network for information exchange in a world named as age of cyber-physical systems. The general expectation is that people systems interact real-time. This poses pressure onto design to support increasing demands on computational power, while keeping low power envelop. Additionally, modular scaling easy programmability are also important ensure these become widespread. whole set expectations impose scientific technological challenges need...

10.1016/j.micpro.2016.07.002 article EN cc-by-nc-nd Microprocessors and Microsystems 2016-07-09

Coarse-Grain Performance Estimator for Heterogeneous Parallel Computing Architectures like Zynq All-Programmable SoC

OPENALEX - Publications

Daniel Jiménez-González Carlos Álvarez Antonio Filgueras Xavier Martorell Jan Langer and 2 more

Heterogeneous computing is emerging as a mandatory requirement for power-efficient system design. With this aim, modern heterogeneous platforms like Zynq All-Programmable SoC, that integrates ARM-based SMP and programmable logic, have been designed. However, those introduce large design cycles consisting on hardware/software partitioning, decisions granularity number of hardware accelerators, integration, bitstream generation, etc. This paper presents performance parallel estimation systems...

10.48550/arxiv.1508.06830 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Towards EXtreme scale technologies and accelerators for euROhpc hw/Sw supercomputing applications for exascale: The TEXTAROSSA approach

OPENALEX - Publications

Giovanni Agosta Marco Aldinucci Carlos Álvarez R. Ammendola Yasir Arfat and 51 more

10.1016/j.micpro.2022.104679 article EN Microprocessors and Microsystems 2022-09-23

Heterogeneous tasking on SMP/FPGA SoCs: The case of OmpSs and the Zynq

OPENALEX - Publications

Antonio Filgueras Eduard Gil Carlos Álvarez Daniel Jiménez-González Xavier Martorell and 2 more

OmpSs is a directive-based programming model that uses OpenMP-like directives, allow to execute the tasks annotated on both SMPs and as FPGA kernels modern SoC processors, like Xilinx Zynq platform. includes support for accelerators (MIC, GPUs, FPGAs) task dependencies, OpenMP 4.0 will support. In this paper we present our approach of FPGAs SoC, current status implementation, its analysis performance evaluation.

10.1109/vlsi-soc.2013.6673293 article EN 2013-10-01

Exploiting Parallelism on GPUs and FPGAs with OmpSs

OPENALEX - Publications

Jaume Bosch Antonio Filgueras Miquel Vidal Daniel Jiménez-González Carlos Álvarez and 1 more

This paper presents the OmpSs approach to deal with heterogeneous programming on GPU and FPGA accelerators. The model is based Mercurium compiler Nanos++ runtime. Applications are annotated directives specifying task-based parallelism. transforms code exploit parallelism in SMP host cores, also spawn work CUDA/OpenCL devices, For programmer needs only insert annotations provide kernel function be compiled by native compiler. In case of FPGAs, uses High-Level Synthesis tools from vendors...

10.1145/3152821.3152880 article EN 2017-09-09

The AXIOM Project: IoT on Heterogeneous Embedded Platforms

OPENALEX - Publications

Antonio Filgueras Miquel Vidal Marc Mateu Daniel Jiménez-González Carlos Álvarez and 12 more

Editor's notes: IoT constitutes an important area of cyber–physical systems, whose design and programming involve interactions between multiple abstraction layers. This article describes a new node, its hardware architecture, environment, two application scenarios where it may be used. —Samarjit Chakraborty, University North Carolina at Chapel Hill

10.1109/mdat.2019.2952335 article EN IEEE Design and Test 2019-11-11

AXIOM: A Hardware-Software Platform for Cyber Physical Systems

OPENALEX - Publications

Somnath Mazumdar Eduard Ayguadé Nicola Bettin Javier Bueno Sara Ermini and 10 more

Cyber-Physical Systems (CPSs) are widely necessary for many applications that require interactions with the humans and physical environment. A CPS integrates a set of hardware-software components to distribute, execute manage its operations. The AXIOM project (Agile, eXtensible, fast I/O Module) aims at developing platform such i) it can use an easy parallel programming model ii) easily scale-up performance by adding multiple boards (e.g., 1 10 run in parallel). supports task-based based on...

10.1109/dsd.2016.80 article EN 2016-08-01

Breaking master-slave model between host and FPGAs

OPENALEX - Publications

Jaume Bosch Miquel Vidal Antonio Filgueras Carlos Álvarez Daniel Jiménez-González and 2 more

This paper proposes to enhance current task-based programming models by breaking their master-slave approach between the main processor and its hardware accelerators. As a proof-of-concept, it presents an extension of [email protected] toolchain that allows tasks offloaded into FPGA create synchronize nested on own without involving host. Those spawned may target host execute code not suitable for FPGA, like system calls or I/O operations; other kernel accelerators inside same FPGA. In...

10.1145/3332466.3374545 article EN 2020-02-19

The TEXTAROSSA Project: Cool all the Way Down to the Hardware

OPENALEX - Publications

Antonio Filgueras Giovanni Agosta Marco Aldinucci Carlos Álvarez Pasqua D’Ambra and 36 more

10.1109/dsd64264.2024.00076 article EN 2022 25th Euromicro Conference on Digital System Design (DSD) 2024-08-28

Improving Performance of HPC Kernels on FPGAs Using High-Level Resource Management

OPENALEX - Publications

Antonio Filgueras Miquel Vidal Daniel Jiménez-González Carlos Álvarez Xavier Martorell

In state-of-the-art FPGA, especially in chiplet-based devices, place and route has become an important challenge due to increase device size complexity. the same way, off-chip memory resources have grown number of modules. Making efficient use them a difficult task.

10.1109/fccm57271.2023.00041 article EN 2023-05-01

TEXTAROSSA: Towards EXtreme scale Technologies and Accelerators for euROhpc hw/Sw Supercomputing Applications for exascale

OPENALEX - Publications

Giovanni Agosta Daniele Cattaneo William Fornaciari Andrea Galimberti Giuseppe Massari and 46 more

To achieve high performance and energy efficiency on near-future exascale computing systems, three key technology gaps needs to be bridged. These include: thermal control; extreme computation via HW acceleration new arithmetics; methods tools for seamless integration of reconfigurable accelerators in heterogeneous HPC multi-node platforms. TEXTAROSSA aims at tackling this gap through a co-design approach solutions, supported by the extension SW IPs, programming models derived from European research.

10.1109/dsd53832.2021.00051 article EN 2022 25th Euromicro Conference on Digital System Design (DSD) 2021-09-01

High Performance Computing PP-Distance Algorithms to Generate X-ray Spectra from 3D Models

OPENALEX - Publications

César González Simone Balocco Jaume Bosch Juan Miguel de Haro Ruiz Maurizio Paolini and 3 more

X-ray crystallography is a powerful method that has significantly contributed to our understanding of the biological function proteins and other molecules. This relies on production crystals that, however, are usually bottleneck in process. For some molecules, no crystallization been achieved or insufficient were obtained. Some systems do not crystallize at all, such as nanoparticles which, because their dimensions, cannot be treated by usual crystallographic methods. To solve this, whole...

10.3390/ijms231911408 article EN International Journal of Molecular Sciences 2022-09-27

FPGA Framework Improvements for HPC Applications

OPENALEX - Publications

Antonio Filgueras Miquel Vidal Daniel Jiménez-González Carlos Álvarez Xavier Martorell

In modern FPGA devices, place and route has become an increasingly difficult task due to increase in resources device complexity. This results exponential of implementation possibilities. Such a huge search space causes tools have hard time providing good solution. is even more challenging chiplet-based devices their topology. the same way, off-chip memory grown both size number modules. These are presented user as raw interfaces requiring manage how accelerator kernels access make effective...

10.1109/icfpt59805.2023.00048 article EN 2023-12-12