NFDI4DS | UHH-SEMS - Publication Details

Tanja Harbaum

ORCID: 0000-0001-7310-567X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5053259767

Research Areas

Parallel Computing and Optimization Techniques
Advanced Memory and Neural Computing
Embedded Systems Design Techniques
Advanced Neural Network Applications
Adversarial Robustness in Machine Learning
Interconnection Networks and Systems
Particle physics theoretical and experimental studies
CCD and CMOS Imaging Sensors
Particle Detector Development and Performance
Radiation Effects in Electronics
Hand Gesture Recognition Systems
Experimental Learning in Engineering
High-Energy Particle Collisions Research
Human Pose and Action Recognition
Handwritten Text Recognition Techniques
Industrial Vision Systems and Defect Detection
Anomaly Detection Techniques and Applications
Distributed and Parallel Computing Systems
VLSI and Analog Circuit Testing
Real-Time Systems Scheduling
Graphene research and applications
Context-Aware Activity Recognition Systems
Mechatronics Education and Applications
Stochastic Gradient Optimization Techniques
Security and Verification in Computing

Karlsruhe Institute of Technology
2013-2024

Institut für Informationsverarbeitung
2019-2024

Rutherford Appleton Laboratory
2016

Max Planck Institute for Nuclear Physics
2015-2016

FZI Research Center for Information Technology
2011

KIHT: Kaligo-Based Intelligent Handwriting Teacher

OPENALEX - Publications

Tanja Harbaum Alexey Serdyuk Fabian Kreß Tim Hamann Jens Barth and 6 more

10.23919/date58400.2024.10546623 preprint 2024-03-25

An FPGA-based track finder for the L1 trigger of the CMS experiment at the high luminosity LHC

OPENALEX - Publications

C. Amstutz F. A. Ball M. Balzer J. J. Brooke L. Calligaris and 26 more

A new tracking system is under development for operation in the CMS experiment at High Luminosity LHC. It includes an outer tracker which will construct stubs, built by correlating clusters two closely spaced sensor layers rejection of hits from low transverse momentum tracks, and transmit them off-detector 40 MHz. If data to contribute keeping Level-1 trigger rate around 750 kHz increased luminosity, a crucial component upgrade be ability identify tracks with above 3 GeV/c building out...

10.1109/rtc.2016.7543102 article EN 2016-06-01

A Hardware Perspective on the ChaCha Ciphers: Scalable Chacha8/12/20 Implementations Ranging from 476 Slices to Bitrates of 175 Gbit/s

OPENALEX - Publications

Johannes Pfau Maximilian Reuter Tanja Harbaum Klaus Hofmann Jürgen Becker

AES (Advanced Encryption Standard) accelerators are commonly used in high-throughput applications, but they have notable resource requirements. We investigate replacing the cipher with ChaCha ciphers and propose first FPGA implementations optimized for data throughput. In consequence, we compare of three different system architectures analyze which aspects dominate performance those.Our experimental results indicate that a bandwidth 175 Gbit/s can be reached as little 2982 slices, whereas...

10.1109/socc46988.2019.1570548289 article EN 2019-09-01

A Content Adapted FPGA Memory Architecture with Pattern Recognition Capability for L1 Track Triggering in the LHC Environment

OPENALEX - Publications

Tanja Harbaum Mahmoud Seboui M. Balzer Jürgen Becker M. Weber

Modern high-energy physics experiments such as the Compact Muon Solenoid experiment at CERN produce an extraordinary amount of data every 25ns. To handle a rate more than 50Tbit/s multi-level trigger system is required, which reduces rate. Due to increased luminosity after Phase-II-Upgrade LHC, CMS tracking has be redesigned. The current unable resulting this upgrade. Because latency few microseconds Level 1 Track Trigger implemented in hardware. State-of-the-art pattern recognition filter...

10.1109/fccm.2016.52 article EN 2016-05-01

ATLAS: An Approximate Time-Series LSTM Accelerator for Low-Power IoT Applications

OPENALEX - Publications

Fabian Kreß Alexey Serdyuk Micha Hiegle Disnebio Waldmann Tim Hotfilter and 6 more

Enabling the use of Deep Neural Networks (DNNs) for time-series-based applications on low-power devices such as wearables opens up a wide range new features and services. However, inference requires an enormous amount operations to be performed by computing platform. In addition, Long Short-Term Memory (LSTM)-based networks require memory store internal cell state future calculations. this paper, we therefore propose hardware/software co-design based LSTM hardware accelerator architecture...

10.1109/dsd60849.2023.00084 article EN 2022 25th Euromicro Conference on Digital System Design (DSD) 2023-09-06

Modular Hardware Design for High-Performance MIMO-Capable SDR Systems to Accelerate 6G Development

OPENALEX - Publications

Christian Karle Marc Neu Benjamin Nuß Jiayi Chen Ludger Witte and 3 more

10.1109/socc62300.2024.10737765 article EN 2024-09-16

An Analytical Model of Configurable Systolic Arrays to find the Best-Fitting Accelerator for a given DNN Workload

OPENALEX - Publications

Tim Hotfilter Patrick Schmidt Julian Höfer Fabian Kreß Tanja Harbaum and 1 more

Since their breakthrough, complexity of Deep Neural Networks (DNNs) is rising steadily. As a result, accelerators for DNNs are now used in many domains. However, designing and configuring an accelerator that meets the requirements given application perfectly challenging task. In this paper, we therefore present our approach to support design process. With analytical model systolic array can estimate performance, energy consumption area each option. To determine these metrics, usually cycle...

10.1145/3579170.3579258 article EN 2023-01-17

SiFI-AI: A Fast and Flexible RTL Fault Simulation Framework Tailored for AI Models and Accelerators

OPENALEX - Publications

Julian Hoefer Fabian Kempf Tim Hotfilter Fabian Kreß Tanja Harbaum and 1 more

For AI-based systems in safety-critical domains, it is inevitable to understand the impact of random hardware faults affecting target accelerators. The high degree data reuse makes Deep Neural Network (DNN) accelerators susceptible significant fault propagation and hence hazardous predictions. Therefore, we present SiFI-AI, a simulation framework for injection DNN SiFI-AI proposes hybrid approach combining fast AI inference with cycle-accurate RTL simulation. Time-expensive only used...

10.1145/3583781.3590226 article EN Proceedings of the Great Lakes Symposium on VLSI 2022 2023-05-31

Hardware-aware Workload Distribution for AI-based Online Handwriting Recognition in a Sensor Pen

OPENALEX - Publications

Fabian KreB Alexey Serdyuk Tim Hotfilter Julian Hoefer Tanja Harbaum and 2 more

Time series-based applications such as recognition of handwriting benefit from using Deep Neural Networks (DNNs) in terms accuracy and efficiency. Due to strict power memory limitations embedded platforms the Internet-of-Things (IoT), inference DNNs is usually performed on more powerful less constrained devices. However, mobile devices smartphones or tablets leads high system requirements. In this paper, we present our approach for distributing computational workload between sensor pen a...

10.1109/meco55406.2022.9797131 article EN 2022 11th Mediterranean Conference on Embedded Computing (MECO) 2022-06-07

Runtime Adaptive Cache Checkpointing for RISC Multi-Core Processors

OPENALEX - Publications

Fabian Kempf Julian Hoefer Fabian Kres Tim Hotfilter Tanja Harbaum and 1 more

In the future, it is expected that safety-critical and non-critical applications are executed on same hardware. Therefore, future hardware systems should be capable of providing runtime support for higher reliability requirements performance noncritical equally. this paper, we present a run-time adaptive cache with coarse-grained safety mechanism to tackle emerging challenge. For applications, operates in mode without any mechanisms. On other hand, checkpointing rollback feature fault...

10.1109/socc56010.2022.9908110 article EN 2022-09-05

CNNParted: An open source framework for efficient Convolutional Neural Network inference partitioning in embedded systems

OPENALEX - Publications

Fabian Kreß Vladimir Sidorenko Patrick Schmidt Julian Hoefer Tim Hotfilter and 3 more

10.1016/j.comnet.2023.109759 article EN Computer Networks 2023-04-08

The ZuSE-KI-Mobil AI Accelerator SoC: Overview and a Functional Safety Perspective

OPENALEX - Publications

Fabian Kempf Julian Hoefer Tanja Harbaum Jürgen Becker Nael Fasfous and 13 more

ZuSE-KI-Mobil (ZuKIMo) is a nationally funded research project, currently in its intermediate stage. The goal of the ZuKIMo project to develop new System-on-Chip (SoC) platform and corresponding ecosystem enable efficient Artificial Intelligence (AI) applications with specific requirements. With ZuKIMo, we specifically target from mobility domain, i.e. autonomous vehicles drones. initial built by consortium consisting seven partners German academia industry. We SoC around novel AI...

10.23919/date56975.2023.10137257 article EN Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015 2023-04-01

DREAM: Distributed Reinforcement Learning Enabled Adaptive Mixed-Critical NoC

OPENALEX - Publications

Nidhi Anantharajaiah Yunhe Xu Fabian Lesniak Tanja Harbaum Jürgen Becker

Applications of different criticality sharing the same System-on-Chip (SoC) platform are increasing in popularity to reduce overall cost. Spatial and temporal isolation techniques utilized inter application influence ensure real-time requirements met. involves partitioning communication resources such partitions can result irregular topologies. It is desirable that on-chip interconnect on systems support within all possible partition shapes using efficient routing techniques. To improve...

10.1109/isvlsi59464.2023.10238569 article EN 2023-06-20

Towards the on-device Handwriting Trajectory Reconstruction of the Sensor Enhanced Pen

OPENALEX - Publications

Alexey Serdyuk Fabian KreB Micha Hiegle Tanja Harbaum Jürgen Becker and 6 more

10.1109/wf-iot58464.2023.10539488 preprint EN 2023-10-12

Auto-SI: An adaptive reconfigurable processor with run-time loop detection and acceleration

OPENALEX - Publications

Tanja Harbaum Christoph Schade Marvin Damschen Carsten Tradowsky Lars Bauer and 2 more

Modern computer architectures have an ever-increasing demand for performance, but are constrained in power dissipation and chip area. To tackle these demands, with application-specific accelerators gained traction research industry. While this is a very promising direction, hard-wired fall short when too many applications need to be supported or flexibility required. In paper, we propose automatic loop detection hardware acceleration approach adaptive reconfigurable processor. Our...

10.1109/socc.2017.8226027 article EN 2017-09-01

Hardware-aware Partitioning of Convolutional Neural Network Inference for Embedded AI Applications

OPENALEX - Publications

Fabian Kres Julian Hoefer Tim Hotfilter Iris Walter Vladimir Sidorenko and 2 more

Embedded image processing applications like multicamera-based object detection or semantic segmentation are often based on Convolutional Neural Networks (CNNs) to provide precise and reliable results. The deployment of CNNs in embedded systems, however, imposes additional constraints such as latency restrictions limited energy consumption the sensor platform. These requirements have be considered during hardware/software co-design Artifical Intelligence (AI) applications. In addition,...

10.1109/dcoss54816.2022.00034 article EN 2022-05-01

Emulation of a prototype FPGA track finder for the CMS Phase-2 upgrade with the CIDAF emulation framework

OPENALEX - Publications

C. Amstutz F. A. Ball M. Balzer J. J. Brooke L. Calligaris and 26 more

The CMS collaboration is preparing a major upgrade of its detector, so it can operate during the high luminosity run LHC from 2026. upgraded tracker electronics will reconstruct trajectories charged particles within latency few microseconds, that they be used by level-1 trigger. An emulation framework, CIDAF, has been developed to provide reference for proposed FPGA-based implementation this track finder, which employs Time-Multiplexed (TM) technique data processing.

10.1109/rtc.2016.7543110 article EN 2016-06-01

Enhanced Accelerator Design for Efficient CNN Processing with Improved Row-Stationary Dataflow

OPENALEX - Publications

Fabian Lesniak Annina Gutermann Tanja Harbaum Jürgen Becker

10.1145/3649476.3658737 article EN Proceedings of the Great Lakes Symposium on VLSI 2022 2024-06-10

Automated Deep Neural Network Inference Partitioning for Distributed Embedded Systems

OPENALEX - Publications

Fabian Kreß El Mahdi El Annabi Tim Hotfilter Julian Hoefer Tanja Harbaum and 1 more

Distributed systems can be found in various applications, e.g., robotics or autonomous driving, to achieve higher flexibility and robustness. Thereby, data flow centric applications such as Deep Neural Network (DNN) inference benefit from partitioning the workload over multiple compute nodes terms of performance energy-efficiency. However, mapping large models on distributed embedded is a complex task, due low latency high throughput requirements combined with strict energy memory...

10.48550/arxiv.2406.19913 preprint EN arXiv (Cornell University) 2024-06-28

A Challenge-Based Blended Learning Approach for an Introductory Digital Circuits and Systems Course

OPENALEX - Publications

Julian Hoefer Michael Gauß Manuela Adams Fabian Kreß Fabian Kempf and 4 more

10.1109/iscas58744.2024.10557955 article EN 2022 IEEE International Symposium on Circuits and Systems (ISCAS) 2024-05-19

EMDRIVE Architecture: Embedded Distributed Computing and Diagnostics from Sensor to Edge

OPENALEX - Publications

Patrick Schmidt Iuliia Topko Matthias Stammler Tanja Harbaum Jürgen Becker and 29 more

10.23919/date58400.2024.10546796 article EN 2024-03-25

Automated Deep Neural Network Inference Partitioning for Distributed Embedded Systems

OPENALEX - Publications

Fabian Kreß El Mahdi El Annabi Tim Hotfilter Julian Hoefer Tanja Harbaum and 1 more

10.1109/isvlsi61997.2024.00019 article EN 2024-07-01

ICE TEA: Insertion of Custom Early Exits for Time-, Energy- & Anomaly-Aware Neural Networks

OPENALEX - Publications

Matthias Stammler Julian Hoefer Patrick Schmidt Tanja Harbaum Jürgen Becker

10.1109/isvlsi61997.2024.00125 article EN 2024-07-01

VHDL Crash Course: A Multimedia-Based Teaching Approach

OPENALEX - Publications

Fabian Kreß Vladimir Sidorenko Iuliia Topko K. Unger Tanja Harbaum and 1 more

10.1109/gecon62014.2024.10734007 article EN 2024-08-05

Coming Soon ...