NFDI4DS | UHH-SEMS - Publication Details

Telemanipulation Assistance Based on Motion Intention Recognition

OPENALEX - Publications

Wentao Yu Redwan Alqasemi Rajiv Dubey N. Pernalete

In telemanipulation systems, assistance through variable position/velocity mapping or virtual fixture can improve manipulation capability and dexterity [3, 5, 6, 7, 8]. Conventionally, such is based on the sensory data of environment without knowing user's motion intention. this paper, intention combined with real-time information for applying appropriate assistance. If current task following a path, applied. aligning end-effector target, an attractive force field produced. Similarly, if...

10.1109/robot.2005.1570266 article EN 2006-01-18

Tabletop augmented reality 3D display system based on integral imaging

OPENALEX - Publications

Han-Le Zhang Huan Deng Wentao Yu Min-Yang He Dahai Li and 1 more

In this paper, we propose a tabletop augmented reality (AR) three-dimensional (3D) display system based on integral imaging by using holographic optical element (HOE) that performs the function of microlens array. The array HOE records wavefronts set tilted spherical waves theory reflective volume holograms. reconstruction, an elemental image projected projector and collimated relay optics satisfies Bragg matching conditions reconstructs waves. Based imaging, 3D is generated tilting in...

10.1364/josab.34.000b16 article EN Journal of the Optical Society of America B 2017-02-15

A Fault Diagnosis Method for Rotating Machinery Based on PCA and Morlet Kernel SVM

OPENALEX - Publications

Shaojiang Dong Dihua Sun Baoping Tang Zhengyuan Gao Wentao Yu and 1 more

A novel method to solve the rotating machinery fault diagnosis problem is proposed, which based on principal components analysis (PCA) extract characteristic features and Morlet kernel support vector machine (MSVM) achieve classification. Firstly, gathered vibration signals were decomposed by empirical mode decomposition (EMD) obtain corresponding intrinsic function (IMF). The EMD energy entropy that includes dominant information defined as features. However, extracted remained...

10.1155/2014/293878 article EN cc-by Mathematical Problems in Engineering 2014-01-01

Development of a robotic haptic interface to assist the performance of vocational tasks by people with disabilities

OPENALEX - Publications

N. Pernalete Wentao Yu Rajiv Dubey Wilfrido Moreno

This paper describes the development of intelligent mapping from a haptic user interface to remote manipulator assist individuals with disabilities performing vocational tasks. mapping, referred an assistance function, is determined on basis environmental model or sensory data guide motion telerobotic while given task. Human input enhanced rather than superseded by computer. Three manual dexterity assessment tests, commonly used in occupational therapy field, were chosen implement several...

10.1109/robot.2002.1014717 article EN 2003-06-25

Bearing degradation state recognition based on kernel PCA and wavelet kernel SVM

OPENALEX - Publications

Shaojiang Dong Dihua Sun Baoping Tang Zhengyuan Gao Yingrui Wang and 2 more

In order to effectively recognize the bearing’s running state, a new method based on kernel principal component analysis (KPCA) and Morlet wavelet support vector machine (MWSVM) was proposed. First, gathered vibration signals were decomposed by empirical mode decomposition (EMD) obtain corresponding intrinsic function (IMF). The EMD energy entropy that includes dominant fault information is defined as characteristic features. However, extracted features remained high-dimensional, excessive...

10.1177/0954406214563235 article EN Proceedings of the Institution of Mechanical Engineers Part C Journal of Mechanical Engineering Science 2014-12-11

Multimodal Integration for Large-Vocabulary Audio-Visual Speech Recognition

OPENALEX - Publications

Wentao Yu Steffen Zeiler Dorothea Kolossa

For many small- and medium-vocabulary tasks, audio-visual speech recognition can significantly improve the rates compared to audio-only systems. However, there is still an ongoing debate regarding best combination strategy for multi-modal information, which should allow translation of these gains large-vocabulary recognition. While integration at level state-posterior probabilities, using dynamic stream weighting, almost universally helpful small-vocabulary systems, in recognition, accuracy...

10.23919/eusipco47968.2020.9287841 article EN 2021 29th European Signal Processing Conference (EUSIPCO) 2020-12-18

Fusing Information Streams in End-to-End Audio-Visual Speech Recognition

OPENALEX - Publications

Wentao Yu Steffen Zeiler Dorothea Kolossa

End-to-end acoustic speech recognition has quickly gained widespread popularity and shows promising results in many studies. Specifically the joint transformer/CTC model pro-vides very good performance tasks. However, under noisy distorted conditions, still degrades notably. While audio-visual can significantly improve rate of end-to-end models such poor it is not obvious how to best utilize any available information on visual signal quality reliability these models. We thus consider...

10.1109/icassp39728.2021.9414553 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021-05-13

Federated Learning in ASR: Not as Easy as You Think

OPENALEX - Publications

Wentao Yu Jan Freiwald Sören Tewes Fabien Huennemeyer Dorothea Kolossa

With the growing availability of smart devices and cloud services, personal speech assistance systems are increasingly used on a daily basis. Most redirect voice recordings to central server, which uses them for upgrading recognizer model. This leads major privacy concerns, since private data could be misused by server or third parties. Federated learning is decentralized optimization strategy that has been proposed address such concerns. Utilizing this approach, on-device training....

10.48550/arxiv.2109.15108 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Robotic therapy for persons with disabilities using Hidden Markov Model based skill learning

OPENALEX - Publications

Wentao Yu Rajiv Dubey N. Pernalete

This paper describes the Hidden Markov Model (HMM) based skill learning and its application in a motion therapy system using haptic interface. A relatively complex task, requiring along labyrinth is used. normal subject executes this task for number of times best trajectory selected as learned skill, which considered virtual therapist who can train persons with disabilities to complete task. Two on upper limb (cerebral palsy) were trained therapist. The performance before after training,...

10.1109/robot.2004.1308129 article EN 2004-01-01

Reliability-Based Large-Vocabulary Audio-Visual Speech Recognition

OPENALEX - Publications

Wentao Yu Steffen Zeiler Dorothea Kolossa

Audio-visual speech recognition (AVSR) can significantly improve performance over audio-only for small or medium vocabularies. However, current AVSR, whether hybrid end-to-end (E2E), still does not appear to make optimal use of this secondary information stream as the is clearly diminished in noisy conditions large-vocabulary systems. We, therefore, propose a new fusion architecture-the decision net (DFN). A broad range time-variant reliability measures are used an auxiliary input...

10.3390/s22155501 article EN cc-by Sensors 2022-07-23

Modal Analysis of Aeronautic Spiral Bevel Gear in the Temperature Field

OPENALEX - Publications

Wentao Yu Shijun Liu Wenbo Xu Dongfei Wang

The temperature of the bevel gear tooth surface will increase greatly because its special structure. Modal is characteristic reaction In order to design and manufacture higher quality aerial gears, it necessary study modal gears under action field. Firstly, a field calculation model established according heat transfer theory. Then, based on traditional linear theory, analysis influence was considering rotational speed stiffness. Based model, driven certain aviation carried out. It can be...

10.1155/2022/1707808 article EN cc-by Journal of Sensors 2022-09-13

Multimodal Integration for Large-Vocabulary Audio-Visual Speech Recognition

OPENALEX - Publications

Wentao Yu Steffen Zeiler Dorothea Kolossa

For many small- and medium-vocabulary tasks, audio-visual speech recognition can significantly improve the rates compared to audio-only systems. However, there is still an ongoing debate regarding best combination strategy for multi-modal information, which should allow translation of these gains large-vocabulary recognition. While integration at level state-posterior probabilities, using dynamic stream weighting, almost universally helpful small-vocabulary systems, in recognition, accuracy...

10.48550/arxiv.2007.14223 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Eye-hand coordination assessment using a robotic haptic interface

OPENALEX - Publications

N. Pernalete Ramakrishna Gottipati S. Mikkilineni Sandra J. Edwards Eric McCann and 2 more

We discuss the possibility of improving eyehand coordination in children diagnosed with this problem, using a robotic mapping from haptic user interface to virtual environment. Our goal is develop an assessment and training procedure that will result handwriting taking advantage force feedback provided by device. Force can be used guide subject's hand predetermined trajectory when he/she unable move response visual feedback. also incorporate inertia viscosity effects decrease tremor as well...

10.1109/robot.2004.1307168 article EN 2004-01-01

Adaptive Resampling and Weighted Ensemble method for Dynamic Imbalance Data Stream Classification

OPENALEX - Publications

Tuyi Zhang Sanmin Liu Subin Huang Ping Zhang Xinquan Chen and 2 more

<title>Abstract</title> Class imbalance inevitably occurs in dynamic data stream scenarios and can pose tremendous challenges for mining. To address these challenges, an adaptive resampling weighted ensemble method (ARWE) is proposed this paper. First, the subdivision Poisson (DSPR) module ARWE developed to class problem thedata stream. DSPR combines local information from minority samples with rate design a sample-weighting scheme that enhance visibility of samples, particularly those at...

10.21203/rs.3.rs-5021068/v1 preprint EN cc-by Research Square (Research Square) 2024-10-08

Telemanipulation enhancement through user's motion intention recognition and fixture assistance

OPENALEX - Publications

Wentao Yu Rajiv Dubey N. Pemalete

In telemanipulation systems, assistance of virtual fixture can improve manipulation capability and dexterity. This provides aids not only for path following, but also reaching target avoiding obstacles. Conventionally, these assistances are based on the environment information, without knowing user's motion intention. this paper, intention is combined with real-time information applying appropriate assistance. If current task following a path, hard orthogonal to applied. Or if position...

10.1109/iros.2004.1389741 article EN 2005-04-01

Efficient multiband excitation linear predictive coding of speech at 1.6 kbps

OPENALEX - Publications

Wentao Yu Cheung-Fat Chan

10.21437/eurospeech.1995-66 article EN 1995-09-18

Large-vocabulary Audio-visual Speech Recognition in Noisy Environments

OPENALEX - Publications

Wentao Yu Steffen Zeiler Dorothea Kolossa

Audio-visual speech recognition (AVSR) can effectively and significantly improve the rates of small-vocabulary systems, compared to their audio-only counterparts. For large-vocabulary however, there are still many difficulties, such as unsatisfactory video accuracies, that make it hard over baselines. In this paper, we specifically consider scenarios, focusing on task LRS2 database, where performance is far superior video-only making an interesting challenging setup for multi-modal...

10.1109/mmsp53017.2021.9733452 preprint EN 2021-10-06

Gear Fault Diagnosis based on Variational Mode Decomposition and Envelope Spectrum

OPENALEX - Publications

Yingjie Wang Wentao Yu Jiany Qin

Multi-component non-stationary vibration signals produced at local gear fault can easily be covered by periodic harmonic signal and strong background noise, thus causing difficulty in selecting features diagnosing state.To address this issue, a diagnosis method based on variational mode decomposition (VMD) envelope spectrum was proposed study, which then employed to select gear-fault from multi-component adaptively, extract characteristic frequency, determine health conditions of the...

10.25103/jestr.114.09 article EN cc-by-nc Journal of Engineering Science and Technology Review 2018-08-01

Development of a Telerobotic System to Assist Persons With Disabilities

OPENALEX - Publications

N. Pernalete Wentao Yu Bettina Fritz Rajiv Dubey

This paper describes the development of intelligent mapping from a haptic user interface to remote manipulator assist individuals with disabilities performing manipulation tasks. mapping, referred an assistance function, is determined on basis environmental model or sensory data guide motion telerobotic while given task. Human input enhanced rather than superseded by computer. Three manual dexterity assessment tests commonly used in occupational therapy field were chosen implement several...

10.1115/imece2002-32668 article EN Dynamic Systems and Control 2002-01-01

Probabilistic Plan Recognition Under Temporal Logic Tasks

OPENALEX - Publications

Wentao Yu Hao Fang Daiying Tian

In this paper, we consider the plan recognition problem in real-time strategy game. A probabilistic algorithm is proposed to predict future goals and identify temporal logic tasks of non-cooperative agent based on observations. order model tasks, library composed Finite Transition System Nondeterministic Büchi Automation. Specially, provide a unified framework combine planning, propose probability calculation calculate posterior distribution tasks. Finally, verify effectiveness by compared...

10.23919/chicc.2019.8866173 article EN 2019-07-01

Probabilistic Plan Recognition for Multi-Agent Systems under Temporal Logic Tasks

OPENALEX - Publications

Wentao Yu Shanghao Li Daiying Tian Jinqiang Cui

This paper studies the plan recognition problem of multi-agent systems with temporal logic tasks. The high-level tasks are represented as linear (LTL). We present a probabilistic algorithm to predict future goals and identify agent based on observations their states actions. subsequently build library composed Nondeterministic Bu¨chi Automation model also propose Boolean matrix generation map trajectories task parse matrix. Then, probability calculation formula is proposed calculate...

10.3390/electronics11091352 article EN Electronics 2022-04-24

Multiband excitation coding of speech at 2.0 kbps

OPENALEX - Publications

Wentao Yu Cheung-Fat Chan

A 2.0 kbps high quality speech coder is presented in the paper. The coding technique based on multiband excitation (MBE) model which efficient modelling of excitation. spectral envelope by linear predictive (LPC) quantization LPC parameters employs a 2-stage split residual vector (SRVQ) scheme. Performance comparisons with other schemes using discrete cosine transform DCT) and two-dimensional differential LSP (2DdLSP) are given.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML"...

10.1109/sipnn.1994.344850 article EN 2002-12-17

Fusing information streams in end-to-end audio-visual speech recognition

OPENALEX - Publications

Wentao Yu Steffen Zeiler Dorothea Kolossa

End-to-end acoustic speech recognition has quickly gained widespread popularity and shows promising results in many studies. Specifically the joint transformer/CTC model provides very good performance tasks. However, under noisy distorted conditions, still degrades notably. While audio-visual can significantly improve rate of end-to-end models such poor it is not obvious how to best utilize any available information on visual signal quality reliability these models. We thus consider question...

10.48550/arxiv.2104.09482 preprint EN cc-by-sa arXiv (Cornell University) 2021-01-01