NFDI4DS | UHH-SEMS - Publication Details

Tim Hotfilter

ORCID: 0000-0001-9748-3149

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5091579417

Research Areas

Advanced Neural Network Applications
Advanced Memory and Neural Computing
CCD and CMOS Imaging Sensors
Parallel Computing and Optimization Techniques
Embedded Systems Design Techniques
Adversarial Robustness in Machine Learning
Radiation Effects in Electronics
Anomaly Detection Techniques and Applications
Distributed systems and fault tolerance
Real-Time Systems Scheduling
Real-time simulation and control systems
Interconnection Networks and Systems
Industrial Vision Systems and Defect Detection
Hand Gesture Recognition Systems
Image and Video Stabilization
Software Testing and Debugging Techniques
Software Reliability and Analysis Research
Advanced Image and Video Retrieval Techniques
Fault Detection and Control Systems
Stochastic Gradient Optimization Techniques
Gaze Tracking and Assistive Technology
Time Series Analysis and Forecasting
Silicon Carbide Semiconductor Technologies
Face recognition and analysis
Electrostatic Discharge in Electronics

Karlsruhe Institute of Technology
2019-2024

Institut für Informationsverarbeitung
2019-2023

ATLAS: An Approximate Time-Series LSTM Accelerator for Low-Power IoT Applications

OPENALEX - Publications

Fabian Kreß Alexey Serdyuk Micha Hiegle Disnebio Waldmann Tim Hotfilter and 6 more

Enabling the use of Deep Neural Networks (DNNs) for time-series-based applications on low-power devices such as wearables opens up a wide range new features and services. However, inference requires an enormous amount operations to be performed by computing platform. In addition, Long Short-Term Memory (LSTM)-based networks require memory store internal cell state future calculations. this paper, we therefore propose hardware/software co-design based LSTM hardware accelerator architecture...

10.1109/dsd60849.2023.00084 article EN 2022 25th Euromicro Conference on Digital System Design (DSD) 2023-09-06

An Analytical Model of Configurable Systolic Arrays to find the Best-Fitting Accelerator for a given DNN Workload

OPENALEX - Publications

Tim Hotfilter Patrick Schmidt Julian Höfer Fabian Kreß Tanja Harbaum and 1 more

Since their breakthrough, complexity of Deep Neural Networks (DNNs) is rising steadily. As a result, accelerators for DNNs are now used in many domains. However, designing and configuring an accelerator that meets the requirements given application perfectly challenging task. In this paper, we therefore present our approach to support design process. With analytical model systolic array can estimate performance, energy consumption area each option. To determine these metrics, usually cycle...

10.1145/3579170.3579258 article EN 2023-01-17

SiFI-AI: A Fast and Flexible RTL Fault Simulation Framework Tailored for AI Models and Accelerators

OPENALEX - Publications

Julian Hoefer Fabian Kempf Tim Hotfilter Fabian Kreß Tanja Harbaum and 1 more

For AI-based systems in safety-critical domains, it is inevitable to understand the impact of random hardware faults affecting target accelerators. The high degree data reuse makes Deep Neural Network (DNN) accelerators susceptible significant fault propagation and hence hazardous predictions. Therefore, we present SiFI-AI, a simulation framework for injection DNN SiFI-AI proposes hybrid approach combining fast AI inference with cycle-accurate RTL simulation. Time-expensive only used...

10.1145/3583781.3590226 article EN Proceedings of the Great Lakes Symposium on VLSI 2022 2023-05-31

LETSCOPE: Lifecycle Extensions Through Software-Defined Predictive Control of Power Electronics

OPENALEX - Publications

Anqi Chu Chris Manuel Hermann Johannes Silz Johannes Pfau Kevin Muñoz Barón and 8 more

In the era of electric vehicles, reliability power electronics has become a crucial part as industry evolves. Changes in electrical parameters caused by aging and degradation can lead to performance deterioration eventually total failure (end-of-life) electronic components, whereas lifetime transistors depends large extent on temperature fluctuations. Therefore, it is desired extend minimizing swings without affecting vehicle dynamics. this paper, we propose LETSCOPE (Lifecycle Extensions...

10.1109/eurocon56442.2023.10199076 article EN 2023-07-06

Hardware-aware Workload Distribution for AI-based Online Handwriting Recognition in a Sensor Pen

OPENALEX - Publications

Fabian KreB Alexey Serdyuk Tim Hotfilter Julian Hoefer Tanja Harbaum and 2 more

Time series-based applications such as recognition of handwriting benefit from using Deep Neural Networks (DNNs) in terms accuracy and efficiency. Due to strict power memory limitations embedded platforms the Internet-of-Things (IoT), inference DNNs is usually performed on more powerful less constrained devices. However, mobile devices smartphones or tablets leads high system requirements. In this paper, we present our approach for distributing computational workload between sensor pen a...

10.1109/meco55406.2022.9797131 article EN 2022 11th Mediterranean Conference on Embedded Computing (MECO) 2022-06-07

Runtime Adaptive Cache Checkpointing for RISC Multi-Core Processors

OPENALEX - Publications

Fabian Kempf Julian Hoefer Fabian Kres Tim Hotfilter Tanja Harbaum and 1 more

In the future, it is expected that safety-critical and non-critical applications are executed on same hardware. Therefore, future hardware systems should be capable of providing runtime support for higher reliability requirements performance noncritical equally. this paper, we present a run-time adaptive cache with coarse-grained safety mechanism to tackle emerging challenge. For applications, operates in mode without any mechanisms. On other hand, checkpointing rollback feature fault...

10.1109/socc56010.2022.9908110 article EN 2022-09-05

Towards Reconfigurable Accelerators in HPC: Designing a Multipurpose eFPGA Tile for Heterogeneous SoCs

OPENALEX - Publications

Tim Hotfilter Fabian Kreß Fabian Kempf Jürgen Becker Juan Miguel de Haro Ruiz and 5 more

The goal of modern high performance computing platforms is to combine low power consumption and throughput. Within the European Processor Initiative (EPI), such an SoC platform meet novel exascale requirements built investigated. As part this project, we introduce embedded Field Programmable Gate Array (eFPGA), adding flexibility accelerate various workloads. In article, show our approach design eFPGA tile that supports EPI SoC. While eFPGAs are inherently reconfigurable, their initial has...

10.23919/date54114.2022.9774716 article EN Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015 2022-03-14

Hardware-aware Partitioning of Convolutional Neural Network Inference for Embedded AI Applications

OPENALEX - Publications

Fabian Kres Julian Hoefer Tim Hotfilter Iris Walter Vladimir Sidorenko and 2 more

Embedded image processing applications like multicamera-based object detection or semantic segmentation are often based on Convolutional Neural Networks (CNNs) to provide precise and reliable results. The deployment of CNNs in embedded systems, however, imposes additional constraints such as latency restrictions limited energy consumption the sensor platform. These requirements have be considered during hardware/software co-design Artifical Intelligence (AI) applications. In addition,...

10.1109/dcoss54816.2022.00034 article EN 2022-05-01

CNNParted: An open source framework for efficient Convolutional Neural Network inference partitioning in embedded systems

OPENALEX - Publications

Fabian Kreß Vladimir Sidorenko Patrick Schmidt Julian Hoefer Tim Hotfilter and 3 more

10.1016/j.comnet.2023.109759 article EN Computer Networks 2023-04-08

Embedded Image Processing the European Way: A new platform for the future automotive market

OPENALEX - Publications

Tim Hotfilter F.J. Kempf Jürgen Becker Dominik Reinhardt Imen Baili

Within the European Processor Initiative (EPI) an objective is build embedded High-Performance processing platform for future automotive applications such as autonomous driving. An Field-Programmable-Gate-Array (eFPGA) enables to be extended needs and requirements by various stakeholders. In this paper we give overview about project our contributions define architecture of eFPGA, which suitable market.Therefore, describe concept explore eFPGA architecture. It motivated a sound use case that...

10.1109/wf-iot48130.2020.9221396 article EN 2020-06-01

QUA³CK - A Machine Learning Development Process

OPENALEX - Publications

Simon Stock Jürgen Becker Daniel Grimm Tim Hotfilter Gabriela Molinar and 2 more

Machine learning and data processing are trending topics at the moment. However, there is still alack of a standard process to support fast, simple, effective development machine learningmodels for academia industry combined. Processes such as KDD or CRISP-DM highlyspecialized in mining business cases. Therefore, engineers often refer individualapproaches solve problem. Especially teaching, lack standardprocess challenge. Students typically get better understanding if systematic approach...

10.22323/1.372.0026 article EN cc-by-nc-nd 2020-07-21

FLECSim-SoC: A Flexible End-to-End Co-Design Simulation Framework for System on Chips

OPENALEX - Publications

Tim Hotfilter Julian Hoefer Fabian Kres Fabian Kempf Jürgen Becker

Hardware accelerators for deep neural networks (DNNs) have established themselves over the past decade. Most developments worked towards higher efficiency with an individual application in mind. This highlights strong relationship between co-designing accelerator together requirements of application. Currently a structured design flow, however, it lacks tool to evaluate DNN embedded System on Chip (SoC) platform.To address this gap state art, we introduce FLECSim, framework that enables...

10.1109/socc52499.2021.9739212 article EN 2021-09-14

EFFECT: An End-to-End Framework for Evaluating Strategies for Parallel AI Anomaly Detection

OPENALEX - Publications

Matthias Stammler Julian Hoefer David Kraus Patrick Schmidt Tim Hotfilter and 2 more

Neural networks achieve high accuracy in tasks like image recognition or segmentation. However, their application safety-critical domains is limited due to black-box nature and vulnerability specific types of attacks. To mitigate this, methods detecting out-of-distribution adversarial attacks parallel the network inference were introduced. These are hard compare because they developed for different use cases, datasets, networks. fill this gap, we introduce EFFECT, an end-to-end framework...

10.1016/j.procs.2023.08.188 article EN Procedia Computer Science 2023-01-01

Leveraging Mixed-Precision CNN Inference for Increased Robustness and Energy Efficiency

OPENALEX - Publications

Tim Hotfilter Julian Hoefer Philipp Merz Fabian Kreß Fabian Kempf and 2 more

Convolutional Neural Networks (CNNs) show tremendous performance in many Computer Vision (CV) tasks like image segmentation crucial to autonomous driving. However, they are computationally demanding and usually not robust corruptions weather influences. In this paper, we introduce our mixed-precision inference method overcome these two challenges. Therefore, enable CNN execution on modern embedded system chips (SoC) that feature a DNN accelerator reconfigurable fabric. case of change, can...

10.1109/socc58585.2023.10256738 article EN 2023-09-05

Automated Deep Neural Network Inference Partitioning for Distributed Embedded Systems

OPENALEX - Publications

Fabian Kreß El Mahdi El Annabi Tim Hotfilter Julian Hoefer Tanja Harbaum and 1 more

Distributed systems can be found in various applications, e.g., robotics or autonomous driving, to achieve higher flexibility and robustness. Thereby, data flow centric applications such as Deep Neural Network (DNN) inference benefit from partitioning the workload over multiple compute nodes terms of performance energy-efficiency. However, mapping large models on distributed embedded is a complex task, due low latency high throughput requirements combined with strict energy memory...

10.48550/arxiv.2406.19913 preprint EN arXiv (Cornell University) 2024-06-28

Automated Deep Neural Network Inference Partitioning for Distributed Embedded Systems

OPENALEX - Publications

Fabian Kreß El Mahdi El Annabi Tim Hotfilter Julian Hoefer Tanja Harbaum and 1 more

10.1109/isvlsi61997.2024.00019 article EN 2024-07-01

LOTTA: An FPGA-based Low-Power Temporal Convolutional Network Hardware Accelerator

OPENALEX - Publications

Fabian Kreß Alexey Serdyuk Denis Kobsar Tim Hotfilter Julian Hoefer and 2 more

10.1109/socc62300.2024.10737863 article EN 2024-09-16

RVVe: A Minimal RISC-V Vector Processor for Embedded AI Acceleration

OPENALEX - Publications

Patrick Schmidt Johannes Pfau Tim Hotfilter Matthias Stammler Tanja Harbaum and 1 more

10.1109/socc62300.2024.10737723 article EN 2024-09-16

BayWatch: Leveraging Bayesian Neural Networks for Hardware Fault Tolerance and Monitoring

OPENALEX - Publications

Julian Hoefer Matthias Stammler Fabian Kreß Tim Hotfilter Tanja Harbaum and 1 more

10.1109/dft63277.2024.10753546 article EN 2024-10-08

A Hardware-Centric Approach to Increase and Prune Regular Activation Sparsity in CNNs

OPENALEX - Publications

Tim Hotfilter Julian Hoefer Fabian Kreß Fabian Kempf Leonhard Kraft and 2 more

A key challenge in computing convolutional neural networks (CNNs) besides the vast number of computations are associated numerous energy-intensive transactions from main to local memory. In this paper, we present our methodical approach maximize and prune coarse-grained regular blockwise sparsity activation feature maps during CNN inference on dedicated dataflow architectures. Regular that fits target accelerator, e.g., a systolic array or vector processor, allows simplified resource...

10.1109/aicas57966.2023.10168566 article EN 2022 IEEE 4th International Conference on Artificial Intelligence Circuits and Systems (AICAS) 2023-06-11

European Processor Initiative Demonstration of Integrated Semi-Autonomous Driving System

OPENALEX - Publications

Daniel Hofman Majda Brcic Mario Kovač Tim Hotfilter Jürgen Becker and 4 more

The European Processor Initiative (EPI) is developing a processor for various sectors, including the automotive industry. To benchmark new processor, EPI uses test vehicle to demonstrate different use cases, like semi-autonomous driving. In this paper, we focus on object detection and describe cases in perception stage of autonomous Therefore, introduce four applications that include face recognition, blind spot detection, near-range far-range spatial cover wide range domains. Each case runs...

10.1109/socc58585.2023.10257105 article EN 2023-09-05

Coming Soon ...