- Parallel Computing and Optimization Techniques
- Embedded Systems Design Techniques
- Real-Time Systems Scheduling
- Advanced Image and Video Retrieval Techniques
- Face and Expression Recognition
- Video Surveillance and Tracking Methods
- IoT and Edge/Fog Computing
- Advanced Neural Network Applications
- Sparse and Compressive Sensing Techniques
- Speech Recognition and Synthesis
- Face recognition and analysis
- CCD and CMOS Imaging Sensors
- Educational Innovations and Technology
- Cloud Computing and Resource Management
- Educational Practices and Sociocultural Research
- Digital Transformation in Industry
- Speech and Audio Processing
- Context-Aware Activity Recognition Systems
- Knowledge Societies in the 21st Century
- Advanced Malware Detection Techniques
- Human Pose and Action Recognition
Universitat Politècnica de Catalunya
2020
Modern GPUs have evolved into fully programmable parallel stream multiprocessors. Due to the nature of graphic workloads, computer vision algorithms are in good position leverage computing power these devices. An interesting problem that greatly benefits from parallelism is face detection. This paper presents a highly optimized Haar-based detector works real time over high definition videos. The proposed kernel operations exploit both coarse and fine grain for performing integral image...
Summary Low‐power devices are usually highly constrained in terms of CPU computing power, memory, and GPGPU resources for real‐time applications to run. In this paper, we describe RAPID, a complete framework suite computation offloading help low‐powered overcome these limitations. RAPID supports on Linux Android devices. Moreover, the implements lightweight secure data transmission operations. We present architecture framework, showing integration modules. show by extensive experiments that...
With the emergence of GPU computing, deep neural networks have become a widely used technique for advancing research in field image and speech processing. In context object event detection, sliding-window classifiers require to choose best among all positively discriminated candidate windows. this paper, we introduce first GPU-based non-maximum suppression (NMS) algorithm embedded architectures. The obtained results show that proposed parallel reduces NMS latency by wide margin when compared...
People and objects will soon share the same digital network for information exchange in a world named as age of cyber-physical systems. The general expectation is that people systems interact real-time. This poses pressure onto design to support increasing demands on computational power, while keeping low power envelop. Additionally, modular scaling easy programmability are also important ensure these become widespread. whole set expectations impose scientific technological challenges need...
The AXIOM project (Agile, eXtensible, fast I/O Module) aims at researching new software/hardware architectures for the future Cyber-Physical Systems (CPSs). These systems are expected to react in real-time, provide enough computational power assigned tasks, consume least possible energy such task (energy efficiency), scale up through modularity, allow an easy programmability across performance scaling, and exploit best existing standards minimal costs.
People and objects will soon share the same digital network for information exchange in a world named as age of cyber-physical systems. The general expectation is that people systems interact real-time. This poses pressure onto design to support increasing demands on computational power, while keeping low power envelop. Additionally, modular scaling easy programmability are also important ensure these become widespread. whole set expectations impose scientific technological challenges need...
The goal of face detection is to determine the presence faces in arbitrary images, along with their locations and dimensions. As it happens any graphics workloads, these algorithms benefit from data-level parallelism. Existing parallelization efforts strictly focus on mapping different divide conquer strategies into multicore CPUs GPUs. However, even most advanced single-chip many-core processors date are still struggling effectively handle real-time under high-definition video workloads. To...
Editor's notes: IoT constitutes an important area of cyber–physical systems, whose design and programming involve interactions between multiple abstraction layers. This article describes a new node, its hardware architecture, environment, two application scenarios where it may be used. —Samarjit Chakraborty, University North Carolina at Chapel Hill
Cyber-Physical Systems (CPSs) are widely necessary for many applications that require interactions with the humans and physical environment. A CPS integrates a set of hardware-software components to distribute, execute manage its operations. The AXIOM project (Agile, eXtensible, fast I/O Module) aims at developing platform such i) it can use an easy parallel programming model ii) easily scale-up performance by adding multiple boards (e.g., 1 10 run in parallel). supports task-based based on...
In the context of object detection, sliding-window classifiers and single-shot Convolutional Neural Network (CNN) meta-architectures typically yield multiple overlapping candidate windows with similar high scores around true location a particular object. Non-Maximum Suppression (NMS) is process selecting single representative within this cluster detections, so as to obtain unique detection per appearing on given picture. paper, we present highly scalable NMS algorithm for embedded GPU...
In this paper, we describe our methodology for designing a smart Videosurveillance system face analysis. The aims at increasing the security by gathering demographic statistics in highly crowded areas such as train stations, airports and shopping malls. Based on Convolutional Neural Networks (CNNs), architecture relies reconfigurable hardware to accelerate part of computation reduce power consumption compared general-purpose processors GPUs. To achieve easy programmability, platform makes...