- Embedded Systems Design Techniques
- Parallel Computing and Optimization Techniques
- Low-power high-performance VLSI design
- Radiation Effects in Electronics
- Speech and Audio Processing
- Advanced Adaptive Filtering Techniques
- VLSI and Analog Circuit Testing
- Interconnection Networks and Systems
- Advanced Image and Video Retrieval Techniques
- Video Coding and Compression Technologies
- CCD and CMOS Imaging Sensors
- Advanced Neural Network Applications
- Advancements in Semiconductor Devices and Circuit Design
- Blind Source Separation Techniques
- Advanced Data Compression Techniques
- Advanced Vision and Imaging
- Analog and Mixed-Signal Circuit Design
- Advanced Memory and Neural Computing
- Autonomous Vehicle Technology and Safety
- Integrated Circuits and Semiconductor Failure Analysis
- Modular Robots and Swarm Intelligence
- Robotics and Sensor-Based Localization
- VLSI and FPGA Design Techniques
- Video Surveillance and Tracking Methods
- Infrared Target Detection Methodologies
Technische Universität Braunschweig
2021-2025
Leibniz University Hannover
2010-2022
Hearing4all
2015-2021
Institut für Mikroelektronik- und Mechatronik-Systeme
2019
State-of-the-art applications, such as convolutional neural networks, demand specialized hardware accelerators that address performance and efficiency constraints. An efficient memory hierarchy is mandatory for systems. While the architectures of general-purpose processors (e.g., CPU or GPUs) are based on cache systems, dedicated have mostly adopted DMA (Direct Memory Access) concept due to application field image processing. features like 2D data transfers padding can optimize accesses...
Modern and future AI-based automotive applications, such as autonomous driving, require the efficient real-time processing of huge amounts data from different sensors, like camera, radar, LiDAR. In ZuSE-KI-AVF project, multiple university, industry partners collaborate to develop a novel massive parallel processor architecture, based on cus-tomized RISC-V host processor, an high-performance vertical vector coprocessor. addition, software development framework is also provided efficiently...
In many applications of machine listening it is useful to know how well an automatic speech recognition system will do before the actual performed. this study we investigate different performance measures with aim predicting word error rates (WERs) in spatial acoustic scenes which type noise, signal-to-noise ratio, parameters for filtering, and amount reverberation are varied. All under consideration based on phoneme posteriorgrams obtained from a deep neural net. While frame-wise entropy...
Numerous approximate adders have been proposed in the literature response to languishing benefits of technology scaling. However, they obtained with an ad-hoc and non-systematic methodology which does not fully exploit design space possibilities. This paper provides a conceptual framework for systematic adders, including hybrid non-equally segmented approaches as well more robust error metrics. The discriminates scenarios, where processing provide significant from those it does; this later...
This paper explores a real- and complex-valued multiply-accumulate (MAC) functional unit for digital signal processors. MAC units with single-instruction-multiple-data (SIMD) support are often used to increase the processing performance in modern Compared real-valued SIMD-MAC units, proposed uses same multipliers also butterfly operations. The area overhead complex mode is small. Complex-valued operations speed up algorithms make execution more efficient terms of power consumption. As case...
This paper presents two new architecture optimizations to improve the processing performance of video applications with a high degree data parallelism in VLIW processors. On one hand, register file access mechanism, called X4 operation mode, allows wide operands made up several consecutive registers file, while keeping its normal functionality (i.e. single read/write access). other functional unit is proposed efficiently process typical stereoscopic application based on rank transformation...
Bit errors due to radiation effects are becoming increasingly important as the fabrication technologies shrinking with every generation of integrated circuits. The resulting smaller transistors more prone high-energy irradiation. This is relevant in avionics or even automotive, where safety millions cars must be ensured. paper proposes an experiment, multiple FPGAs (Field Programmable Gate Arrays) exposed 2.45MeV neutron irradiation parallel. Bitflips different memory components (Block RAM,...
Hardware architectures for modern hearing aid devices have to provide ultra low power consumption at a small silicon area and moderate computational performance deal with the continuously growing complexity of signal processing. At same time, they need remain flexible future algorithmic changes. These challenging design goals can be achieved by using Application-Specific Instruction-Set Processors (ASIPs), where baseline architecture is customized target class applications. In this paper,...
When using Field-Programmable Gate Arrays (FPGA) in safety-critical and harsh environments, it is important to understand possible faults implement appropriate mitigation prevent critical system errors. Electronic components can be affected by radiation, including naturally occurring background radiation. Due their reconfigurability, FPGAs exhibit not only with regard application data but also the configuration memory, which defines functionality of logic circuit. This paper proposes an...
Abstract On the one hand, processors for hearing aids are highly specialized audio processing, on other hand they have to meet challenging hardware restrictions. This paper aims provide an overview of requirements, architectures, and implementations these processors. Special attention is given increasingly common application-specific instruction-set (ASIPs). The main focus this lies hardware-related aspects such as processor architecture, interfaces, application specific integrated circuit...
Research of efficient fault tolerance techniques for digital systems requires insight into the propagation mechanism inside ASIC design. Radiation, high temperature, or charge sharing effects in ultra-deep submicron technologies influence generation and dependent on die location. The proposed methodology links injection to floorplan view a standard cell ASIC. This is achieved by instrumentation gate netlist after place&route, emulation an FPGA system experiment control via interactive user...
The integration of application specific instruction set processors (ASIPs) in hearing aids requires various architectural customizations and software-side optimizations order to meet the stringent power consumption constraints processing performance demands. This paper presents KAVUAKA aid processor its ASIC as a system on chip (SoC). final contains four cores ten co-processors. Each these co-processors were individually customized differ their data path width. are organized two clusters,...
Abstract Microcontrollers to be used in harsh environmental conditions, e.g., at high temperatures or radiation exposition, need fabricated robust technology nodes order operate reliably. However, these are considerably larger than cutting-edge semiconductor technologies and provide less speed, drastically reducing system performance. In achieve low silicon area costs, power consumption reasonable performance, the processor architecture organization itself is a major influential design...
Computational-intensive algorithms are often realized with dedicated or customized hardware architectures suffering from high development costs and low flexibility thereafter. Instead, modern multicore manycore processors can execute a diversity of software applications (e.g, driver assistance systems) written in portable high-level programming languages resulting less porting effort at lower for power-consumption tolerant fields. For instance, the Intel Xeon Phi processor featuring 61 cores...
Implementing convolutional neural networks for scene labelling is a current hot topic in the field of advanced driver assistance systems. The massive computational demands under hard real-time and energy constraints can only be tackled using specialized architectures. Also, cost-effectiveness an important factor when targeting lower quantities. In this PhD thesis, vector processor architecture optimized FPGA devices proposed. Amongst other hardware mechanisms, novel complex operand...