Antonio J. Rubio

ORCID: 0000-0003-3814-0335
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Speech and Audio Processing
  • Speech Recognition and Synthesis
  • Music and Audio Processing
  • Social Robot Interaction and HRI
  • Advanced Data Compression Techniques
  • Speech and dialogue systems
  • Human Pose and Action Recognition
  • Robotic Path Planning Algorithms
  • Robot Manipulation and Learning
  • Modular Robots and Swarm Intelligence
  • Advanced Adaptive Filtering Techniques
  • Digital Filter Design and Implementation
  • Robotics and Automated Systems
  • Robotics and Sensor-Based Localization
  • Visual Attention and Saliency Detection
  • Technology Use by Older Adults
  • Natural Language Processing Techniques
  • Hand Gesture Recognition Systems
  • Robotic Locomotion and Control
  • Blind Source Separation Techniques
  • Video Surveillance and Tracking Methods
  • Context-Aware Activity Recognition Systems
  • Reinforcement Learning in Robotics
  • Precipitation Measurement and Analysis
  • Advanced Image and Video Retrieval Techniques

Universidad de Málaga
2014-2024

Universitat Politècnica de Catalunya
2007-2024

Hospital Universitario Virgen de la Arrixaca
2000-2012

Universidad de Murcia
2012

Universidad de Granada
1996-2006

Universities UK
1998-1999

University of York
1998

This paper describes a method of compensating for nonlinear distortions in speech representation caused by noise. The described here is based on the histogram equalization often used digital image processing. Histogram applied to each component feature vector order improve robustness recognition systems. how proposed can be robust and it compared with other compensation techniques. experiments, including results AURORA II framework, demonstrate effectiveness when either alone or combination

10.1109/tsa.2005.845805 article EN IEEE Transactions on Speech and Audio Processing 2005-04-19

Currently, there are technology barriers inhibiting speech processing systems that work in extremely noisy conditions from meeting the demands of modern applications. This letter presents a new voice activity detector (VAD) for improving detection robustness environments and performance recognition systems. The algorithm defines an optimum likelihood ratio test (LRT) involving multiple independent observations. so-defined decision rule reports significant improvements speech/nonspeech...

10.1109/lsp.2005.855551 article EN IEEE Signal Processing Letters 2005-09-20

Background. It is not known whether the pig liver capable of functioning efficiently when transplanted into a primate, neither there experience in transplanting from transgenic pigs expressing human complement regulator decay accelerating factor (h-DAF) baboon. The objective this study was to determine porcine would support metabolic functions non-human primates and establish effect hDAF expression prevention hyperacute rejection livers primates. Methods. Five orthotopic xenotransplants...

10.1097/00007890-200010150-00001 article EN Transplantation 2000-10-01

Since its introduction in 1974 by Ahmed et al., the discrete cosine transform (DCT) has become a significant tool many areas of digital signal processing, especially compression. There exist eight types transforms (DCTs). We obtain DCTs as complete orthonormal set eigenvectors generated general form matrices same way Fourier (DFT) can be obtained an arbitrary circulant matrix. These decomposed sum symmetric Toeplitz matrix plus Hankel or close to scaled some constant factors. also show that...

10.1109/78.482113 article EN IEEE Transactions on Signal Processing 1995-01-01

An effective voice activity detection (VAD) algorithm is proposed for improving speech recognition performance in noisy environments. The approach based on the determination of speech/nonspeech divergence by means specialized order statistics filters (OSFs) working subband log-energies. This differs from many others way decision rule formulated. Instead making current frame, it uses OSFs log-energies which significantly reduces error probability when discriminating nonspeech a signal. Clear...

10.1109/tsa.2005.853212 article EN IEEE Transactions on Speech and Audio Processing 2005-10-18

This letter shows an innovative voice activity detector (VAD) based on the Kullback-Leibler (KL) divergence measure. The algorithm is evaluated in context of recently approved ETSI standard for distributed speech recognition (DSR). VAD uses long-term information noisy signal order to define a more robust decision rule yielding high accuracy. mel-scaled filter bank log-energies (FBE) are modeled by means Gaussian distributions, and symmetric KL used estimation distance between noise...

10.1109/lsp.2003.821762 article EN IEEE Signal Processing Letters 2004-02-01

The noise usually produces a non-linear distortion of the feature space considered for Automatic Speech Recognition. This causes mismatch between training and recognition conditions which significantly degrades performance speech recognizers. In this contribution we analyze effect additive over cepstral based representations compare several approaches to compensate effect. We discuss importance non-linearities introduced by propose method (based on histogram equalization technique)...

10.1109/icassp.2002.5743739 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2002-05-01

Learning by imitation is a natural and intuitive way to teach social robots new behaviors. While these learning systems can use different sensory inputs, vision often their main or even only source of input data. However, while many vision-based robot (RLbI) architectures have been proposed in the last decade, they may be difficult compare due absence common, structured description. The first contribution this survey definition set standard components that used describe any RLbI...

10.1142/s0219843612500065 article EN International Journal of Humanoid Robotics 2012-03-01

The use of new assistive technologies in general, and Socially Assistive Robots (SAR) particular is becoming increasingly common for supporting people’s health well-being. However, it still faces many issues regarding long-term adherence, acceptability utility. Most these are due to design processes that insufficiently take into account the needs, preferences values intended users. Other related currently very limited amount evaluations, performed real world settings, SAR. This study...

10.20944/preprints202402.1568.v1 preprint EN 2024-02-28

This letter presents a new segmental nonlinear feature normalization algorithm to improve the robustness of speech recognition systems against variations acoustic environment. An experimental study best delay-performance tradeoff is conducted within AURORA-2 framework, and comparison with two commonly used algorithms presented. Computationally efficient based on order statistics are also One them linear interpolation between sampling quantiles, other one point estimation probability...

10.1109/lsp.2004.826648 article EN IEEE Signal Processing Letters 2004-04-26

Motion capture systems have recently experienced a strong evolution. New cheap depth sensors and open source frameworks, such as OpenNI, allow for perceiving human motion on-line without using invasive systems. However, these proposals do not evaluate the validity of obtained poses. This paper addresses this issue model-based pose generator to complement OpenNI tracker. The proposed system enforces kinematics constraints, eliminates odd poses filters sensor noise, while learning real...

10.3390/s130708835 article EN cc-by Sensors 2013-07-10

In recent years, commercial and research interest in service robots working everyday environments has grown. These devices are expected to move autonomously crowded environments, maximizing not only movement efficiency safety parameters, but also social acceptability. Extending traditional path planning modules with socially aware criteria, while maintaining fast algorithms capable of reacting human behavior without causing discomfort, can be a complex challenge. Solving this challenge...

10.3390/electronics12071570 article EN Electronics 2023-03-27

Over the past decades, number of robots deployed in museums, trade shows and exhibitions have grown steadily. This new application domain has become a key research topic robotics community. Therefore, are designed to interact with people these domains, using natural intuitive channels. Visual perception speech processing be considered for robots, as they should able detect their environment, recognize degree accessibility engage them social conversations. They also need safely navigate...

10.1109/icarsc.2015.19 article EN IEEE International Conference on Autonomous Robot Systems and Competitions 2015-04-01

One of the main issues within field social robotics is to endow robots with ability direct attention people whom they are interacting. Different approaches follow bio-inspired mechanisms, merging audio and visual cues localize a person using multiple sensors. However, most these fusion mechanisms have been used in fixed systems, such as those video-conference rooms, thus, may incur difficulties when constrained sensors which robot can be equipped. Besides, scope interactive autonomous...

10.3390/s140609522 article EN cc-by Sensors 2014-05-28

The use of new assistive technologies in general, and Socially Assistive Robots (SARs) particular, is becoming increasingly common for supporting people’s health well-being. However, it still faces many issues regarding long-term adherence, acceptability utility. Most these are due to design processes that insufficiently take into account the needs, preferences values intended users. Other related currently very limited amount evaluations, performed real-world settings, SARs. This study...

10.3390/robotics13040061 article EN cc-by Robotics 2024-04-09

The aging of the population in developed and developing countries, together with degree maturity reached by certain technologies, means that design care environments for elderly a high technological innovation is now being seriously considered. Assistive daily living (Ambient Assisted Living, AAL) include deployment sensors actuators home or residence where person to be cared lives so that, help necessary computational management decision-making mechanisms, can live more autonomous life....

10.3390/app14125287 article EN cc-by Applied Sciences 2024-06-19

This letter shows an effective statistical voice activity detection algorithm based on the integrated bispectrum, which is defined as a cross spectrum between signal and its square inherits ability of higher order statistics to detect signals in noise with many other additional advantages: 1) computation leads significant computational savings, 2) variance estimator same that power estimator. The decision rule formulated terms average likelihood ratio test (LRT) involving successive...

10.1109/lsp.2006.873147 article EN IEEE Signal Processing Letters 2006-07-21
Coming Soon ...