NFDI4DS | UHH-SEMS - Publication Details

Critical review of on-board capacity estimation techniques for lithium-ion batteries in electric and hybrid electric vehicles

OPENALEX - Publications

Alexander Farmann Wladislaw Waag Andrea Marongiu Dirk Uwe Sauer

10.1016/j.jpowsour.2015.01.129 article EN Journal of Power Sources 2015-01-22

Differential voltage analysis as a tool for analyzing inhomogeneous aging: A case study for LiFePO4|Graphite cylindrical cells

OPENALEX - Publications

Meinert Lewerenz Andrea Marongiu Alexander Warnecke Dirk Uwe Sauer

10.1016/j.jpowsour.2017.09.059 article EN Journal of Power Sources 2017-09-29

Influence of the vehicle-to-grid strategy on the aging behavior of lithium battery electric vehicles

OPENALEX - Publications

Andrea Marongiu Marco Roscher Dirk Uwe Sauer

10.1016/j.apenergy.2014.06.063 article EN Applied Energy 2014-07-16

PULP: A parallel ultra low power platform for next generation IoT applications

OPENALEX - Publications

Davide Rossi Francesco Conti Andrea Marongiu Antonio Pullini Igor Loi and 5 more

This article consists of a collection slides from the authors' conference presentation.

10.1109/hotchips.2015.7477325 article EN 2015-08-01

A transprecision floating-point platform for ultra-low power computing

OPENALEX - Publications

Giuseppe Tagliavini Stefan Mach Davide Rossi Andrea Marongiu Luca Benini

In modern low-power embedded platforms, the execution of floating-point (FP) operations emerges as a major contributor to energy consumption compute-intensive applications with large dynamic range. Experimental evidence shows that 50% consumed by core and its data memory is related FP computations. The adoption formats requiring lower number bits an interesting opportunity reduce consumption, since it allows simplify arithmetic circuitry bandwidth required transfer between registers enabling...

10.23919/date.2018.8342167 article EN Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015 2018-03-01

A critical overview of definitions and determination techniques of the internal resistance using lithium-ion, lead-acid, nickel metal-hydride batteries and electrochemical double-layer capacitors as examples

OPENALEX - Publications

Grzegorz Piłatowicz Andrea Marongiu Julia Drillkens Philipp Sinhuber Dirk Uwe Sauer

10.1016/j.jpowsour.2015.07.073 article EN Journal of Power Sources 2015-07-31

Comprehensive study of the influence of aging on the hysteresis behavior of a lithium iron phosphate cathode-based lithium ion battery – An experimental investigation of the hysteresis

OPENALEX - Publications

Andrea Marongiu Felix Gerd Wilhelm Nußbaum Wladislaw Waag Maitane Garmendia Dirk Uwe Sauer

10.1016/j.apenergy.2016.02.086 article EN Applied Energy 2016-03-28

Dissecting the CUDA scheduling hierarchy: a Performance and Predictability Perspective

OPENALEX - Publications

Ignacio Sanudo Olmedo Nicola Capodieci Jorge Martínez Andrea Marongiu Marko Bertogna

Over the last few years, ever-increasing use of Graphic Processing Units (GPUs) in safety-related domains has opened up many research problems real-time community. The closed and proprietary nature scheduling mechanisms deployed NVIDIA GPUs, for instance, represents a major obstacle deriving proper schedulability analysis latency-sensitive applications. Existing literature addresses these issues by either (i) providing simplified models heterogeneous CPUGPU systems their associated policies,...

10.1109/rtas48715.2020.000-5 article EN 2020-04-01

On-board capacity estimation of lithium iron phosphate batteries by means of half-cell curves

OPENALEX - Publications

Andrea Marongiu Nsombo Nlandi Yao Rong Dirk Uwe Sauer

10.1016/j.jpowsour.2016.05.041 article EN Journal of Power Sources 2016-05-26

An OpenMP Compiler for Efficient Use of Distributed Scratchpad Memory in MPSoCs

OPENALEX - Publications

Andrea Marongiu Luca Benini

Most of today's state-of-the-art processors for mobile and embedded systems feature on-chip scratchpad memories. To efficiently exploit the advantages low-latency high-bandwidth memory modules in hierarchy, there is need programming models and/or language features that expose such architectural details. On other hand, effectively exploiting limited space requires programmer to devise an efficient partitioning distributed placement shared data at application level. In this paper, we propose a...

10.1109/tc.2010.199 article EN IEEE Transactions on Computers 2010-10-19

Timing characterization of OpenMP4 tasking model

OPENALEX - Publications

María A. Serrano Alessandra Melani Roberto Vargas Andrea Marongiu Marko Bertogna and 1 more

OpenMP is increasingly being supported by the newest high-end embedded many-core processors. Despite lack of any notion real-time execution, latest specification (v4.0) introduces a tasking model that resembles way applications are modeled and designed, i.e., as set periodic task graphs. This makes OpenMP4 convenient candidate to be adopted in future systems. However, incorporates well features guarantee backward compatibility with previous versions limit its practical usability The most...

10.1109/cases.2015.7324556 article EN 2015-10-01

Timing characterization of OpenMP4 tasking model

OPENALEX - Publications

María A. Serrano Alessandra Melani Roberto Vargas Andrea Marongiu Marko Bertogna and 1 more

OpenMP is increasingly being supported by the newest high-end embedded many-core processors. Despite lack of any notion real-time execution, latest specification (v4.0) introduces a tasking model that resembles way applications are modeled and designed, i.e., as set periodic task graphs. This makes OpenMP4 convenient candidate to be adopted in future systems. However, incorporates well features guarantee backward compatibility with previous versions limit its practical usability The most...

10.5555/2830689.2830709 article EN Compilers, Architecture, and Synthesis for Embedded Systems 2015-10-04

Calendar aging of lithium-ion cells with high‑nickel cathodes: On the influence of storage methods

OPENALEX - Publications

Timo Rüwald Andrea Marongiu Dominik Schulte Dirk Uwe Sauer

10.1016/j.est.2025.116412 article EN Journal of Energy Storage 2025-04-09

VirtualSoC: A Full-System Simulation Environment for Massively Parallel Heterogeneous System-on-Chip

OPENALEX - Publications

Daniele Bortolotti Christian Pinto Andrea Marongiu Martino Ruggiero Luca Benini

Driven by flexibility, performance and cost constraints of demanding modern applications, heterogeneous System-on-Chip (SoC) is the dominant design paradigm in embedded system computing domain. SoC architecture heterogeneity clearly provide a wider power/performance scaling, combining high power efficient general-purpose cores along with massively parallel many-core-based accelerators. Besides complex hardware, generally these kinds platforms host also an advanced software ecosystem,...

10.1109/ipdpsw.2013.177 article EN 2013-05-01

Energy-Quality Scalable Integrated Circuits and Systems: Continuing Energy Scaling in the Twilight of Moore’s Law

OPENALEX - Publications

Massimo Alioto Vivek De Andrea Marongiu

This paper aims to take stock of recent advances in the field energy-quality (EQ) scalable circuits and systems, as promising direction continue historical exponential energy downscaling under diminished returns from technology voltage scaling. EQ-scalable systems explicitly trade off quality at different levels abstraction sub-systems, dealing with "quality" an explicit design requirement, reducing whenever application, task, or dataset allow degradation (e.g., vision machine learning). A...

10.1109/jetcas.2018.2881461 article EN IEEE Journal on Emerging and Selected Topics in Circuits and Systems 2018-11-15

A Transprecision Floating-Point Architecture for Energy-Efficient Embedded Computing

OPENALEX - Publications

Stefan Mach Davide Rossi Giuseppe Tagliavini Andrea Marongiu Luca Benini

Ultra-low power computing is a key enabler of deeply embedded platforms used in domains such as distributed sensing, internet things, wearable computing. The rising computational demands and high dynamic target algorithms often call for hardware support floating-point (FP) arithmetic system energy efficiency. In light transprecision computing, where accuracy data consciously changed during the execution applications, custom FP types are being to optimize wide range problems. We two - one 16...

10.1109/iscas.2018.8351816 article EN 2022 IEEE International Symposium on Circuits and Systems (ISCAS) 2018-01-01

OpenMP and timing predictability: a possible union?

OPENALEX - Publications

Roberto Vargas Eduardo Quiñones Andrea Marongiu

Next-generation many-core embedded platforms have the chance of intercepting a converging need for high performance and predictability. Programming methodologies such will to promote predictability as first-class design constraint, along with features massive parallelism exploitation. OpenMP, increasingly adopted in systems domain, has recently evolved deal programmability heterogeneous many-cores, mature support fine-grained task parallelism. While tasking is potentially very convenient...

10.5555/2755753.2755893 article EN Design, Automation, and Test in Europe 2015-03-09

Unleashing Fine-Grained Parallelism on Embedded Many-Core Accelerators with Lightweight OpenMP Tasking

OPENALEX - Publications

Giuseppe Tagliavini Daniele Cesarini Andrea Marongiu

In recent years, programmable many-core accelerators (PMCAs) have been introduced in embedded systems to satisfy stringent performance/Watt requirements.This has increased the urge for programming models capable of effectively leveraging hundreds thousands processors.Task-based parallelism potential provide such capabilities, offering high-level abstractions outline abundant and irregular applications.However, efficiently supporting this paradigm on PMCAs is challenging, due large time space...

10.1109/tpds.2018.2814602 article EN IEEE Transactions on Parallel and Distributed Systems 2018-03-12

FlexFloat: A Software Library for Transprecision Computing

OPENALEX - Publications

Giuseppe Tagliavini Andrea Marongiu Luca Benini

In recent years approximate computing has been extensively explored as a paradigm to design hardware and software solutions that save energy by trading off on the quality of computed results. applications involve numerical computations with wide dynamic range, precision tuning floating-point (FP) variables is key knob leverage energy/quality tradeoff program This aspect assumes maximum relevance in transprecision scenario, where accuracy data tuned at fine grain application code. Performing...

10.1109/tcad.2018.2883902 article EN IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2018-12-04

OpenMP and Timing Predictability: A Possible Union?

OPENALEX - Publications

Roberto Vargas Eduardo Quiñones Andrea Marongiu

Next-generation many-core embedded platforms have the chance of intercepting a converging need for high performance and predictability. Programming methodologies such will to promote predictability as first-class design constraint, along with features massive parallelism exploitation. OpenMP, increasingly adopted in systems domain, has recently evolved deal programmability heterogeneous many-cores, mature support fine-grained task parallelism. While tasking is potentially very convenient...

10.7873/date.2015.0778 article EN Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015 2015-01-01

GPUguard: Towards supporting a predictable execution model for heterogeneous SoC

OPENALEX - Publications

Björn Forsberg Andrea Marongiu Luca Benini

The deployment of real-time workloads on commercial off-the-shelf (COTS) hardware is attractive, as it reduces the cost and time-to-market new products. Most modern high-end embedded SoCs rely a heterogeneous design, coupling general-purpose multi-core CPU to massively parallel accelerator, typically programmable GPU, sharing single global DRAM. However, because non-predictable arbiters designed maximize average or peak performance, very difficult provide timing guarantees such systems. In...

10.23919/date.2017.7927008 article EN Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015 2017-03-01

Supporting OpenMP on a multi-cluster embedded MPSoC

OPENALEX - Publications

Andrea Marongiu Paolo Burgio Luca Benini

10.1016/j.micpro.2011.08.010 article EN Microprocessors and Microsystems 2011-08-25

HERO: Heterogeneous Embedded Research Platform for Exploring RISC-V Manycore Accelerators on FPGA

OPENALEX - Publications

Andreas Kurth Pirmin Vogel Alessandro Capotondi Andrea Marongiu Luca Benini

Heterogeneous embedded systems on chip (HESoCs) co-integrate a standard host processor with programmable manycore accelerators (PMCAs) to combine general-purpose computing domain-specific, efficient processing capabilities. While leading companies successfully advance their HESoC products, research lags behind due the challenges of building prototyping platform that unites an industry-standard open PMCA architecture. In this work we introduce HERO, FPGA-based combines composed clusters...

10.48550/arxiv.1712.06497 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Experimental analysis of lithium iron phosphate battery performances

OPENALEX - Publications

Andrea Marongiu Alfonso Damiano M. Heuer

In this paper a study and an experimental analysis on lithium iron phosphate battery under different operating conditions is reported in order to investigate its potential application electric vehicles hybrid vehicles. The of unloading loading characteristics the energetic storage process efficiency have been developed. Unloading characteristics, temperature sensitivity range −15° C +50° determined. To evaluate dynamic performance for vehicle typical load variations test has conducted.

10.1109/isie.2010.5637749 article EN 2010-07-01

Fast and lightweight support for nested parallelism on cluster-based embedded many-cores

OPENALEX - Publications

Andrea Marongiu Paolo Burgio Luca Benini

Several recent many-core accelerators have been architected as fabrics of tightly-coupled shared memory clusters. A hierarchical interconnection system is used -- with a crossbar-like medium inside each cluster and network-on-chip (NoC) at the global level which make operations non-uniform (NUMA). Nested parallelism represents powerful programming abstraction for these architectures, where first can be to distribute coarse-grained tasks clusters, additional levels fine-grained distributed...

10.5555/2492708.2492734 article EN Design, Automation, and Test in Europe 2012-03-12