- Speech and Audio Processing
- Advanced Adaptive Filtering Techniques
- Underwater Acoustics Research
- Acoustic Wave Phenomena Research
- Hearing Loss and Rehabilitation
- Advanced Statistical Methods and Models
- Blind Source Separation Techniques
- Music and Audio Processing
- Structural Health Monitoring Techniques
- Advanced Computational Techniques and Applications
- Indoor and Outdoor Localization Technologies
- Direction-of-Arrival Estimation Techniques
- Advanced Image and Video Retrieval Techniques
- Visual Attention and Saliency Detection
- Algebraic structures and combinatorial models
- Advanced Topics in Algebra
- Video Surveillance and Tracking Methods
- Anomaly Detection Techniques and Applications
- Remote Sensing and LiDAR Applications
- Advanced Measurement and Detection Methods
- Rings, Modules, and Algebras
- Advanced Algorithms and Applications
- Image and Video Quality Assessment
- Fault Detection and Control Systems
- Advanced Vision and Imaging
Institute of Electronics
2023-2024
Australian National University
2017-2023
PLA Army Engineering University
2015-2017
Jinling Institute of Technology
2014
Chengdu University of Technology
2014
Central South University
2011-2012
National University of Defense Technology
2008-2011
Anhui University
2011
China Academy of Launch Vehicle Technology
2011
Subspace approach of MUSIC (multiple signal classication) has become one the most popular multi-source direction arrival (DOA) estimations due to its easy implementation in practice. However, localization accuracy is vulnerable noise. This paper develops a novel algorithm, more suitable noisy environments, using relative sound pressure measurements higher order microphone array. proposed also decomposed into spherical harmonics domain where frequency smoothing technique allowed de-correlate...
This paper presents an unsupervised multi-source localization algorithm using a recently introduced feature called the relative harmonic coefficients. We derive closed-form expression of and briefly summarize its unique properties. then exploit this to develop single-source frame/bin detector which simplifies challenging problem multiple source into single problem. show that underlying method is suitable for overlapped, disjoint as well simultaneous recordings. Experimental results in both...
This article develops a semi-supervised algorithm to address the challenging multi-source localization problem in noisy and reverberant environment, using spherical harmonics domain source feature of relative harmonic coefficients. We present comprehensive research this feature, including (i) an illustration confirming its sole dependence on position, (ii) estimator presence noise, (iii) selector exploiting inherent directivity over space. Source features at varied modes, representing unique...
Direction-of-arrival (DOA) estimation for multiple simultaneous speakers in reverberant environments is still one of the challenging tasks audio signal processing field. A recent approach addresses this problem using a spherical harmonics domain feature named <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">relative harmonic coefficients</i> (RHC). Based on bin-wise operation across STFT (short-time Fourier transform) domain, method detects...
The relative harmonic coefficients (RHC), recently introduced as a multi-microphone spatial feature, demonstrates promising performance when applied to direction-of-arrival (DOA) estimation. All existing RHC-based DOA estimators suffer from resolution limitation due the inherent grid-based search. In contrast, this paper utilizes first-order RHC propose closed-form estimator by deriving direction vector, which points towards desired source direction. Two objective metrics, namely...
This paper proposes a data-driven source localization approach under noisy and reverberant environment, using newly defined feature named relative harmonic coefficients (RHC) in the modal domain. Being independent of signal, RHC is capable localizing sound source(s) located at unknown position(s). Two distinctive multi-view Gaussian process (MVGP), (i) multi-frequency views (ii) multi-mode views, are developed for regression (GPR) to reveal mapping function from corresponding location. We...
The accuracy and perception of soundfields produced by loudspeaker arrays are strongly influenced the inherent characteristics commercial loudspeakers. This paper analyzes such loudspeakers deriving equivalent theoretical models, studying their impact on soundfield reproduction. A number acoustic models investigated, including plane waves decomposition, point source decomposition mixed decomposition. Each proposed model employs three effective sparse algorithms for optimized solutions,...
Traditionally, direction-of-arrival (DOA) estimations under near- and far-field scenarios are treated as independent tasks based on the corresponding acoustic model, hence necessitating a proper soundfield detector an upstream processing tool, whereas there may not be distinct boundary between different types, especially mixed-field where both sources coexist simultaneously. To handle this issue, article investigates multisource DOA estimator that equally localizes multiple near-, far-,...
Traditional source direction-of-arrival (DOA) estimation algorithms generally localize the elevation and azimuth simultaneously, requiring an exhaustive search over two-dimensional (2-D) space. By contrast, this paper presents two decoupled DOA using a recently introduced feature called relative harmonic coefficients. They are capable to recover source's separately, since components in coefficients decoupled. The proposed highlighted by large reduction of computational complexity, thus...
Ship detection is an important stage for the sea- area surveillance and many algorithms have been proposed dealing with such tasks. Nevertheless, most of them are designed large-sized ships not efficient small ones. In this paper, we present a novel method based on cumulative projection curve(CPC) to estimate number size. We firstly compute Mahalanobis distance between each pixel image intensities distribution water, then project these distances their near coastline vertically. The projected...
Loudspeaker array design for spatial soundfield reproduction is a popular research topic with plethora of past work and potential applications. In soundfiled theory, algorithms are often developed assuming arrays ideal point sources. However in practice, the directional characteristics commercial loudspeakers far from that Therefore, order to successfully realize existing theoretical solutions reproduction, it crucial understand model incident response used system as observed over target...
Monaural speech enhancement is a key yet challenging problem for many important real world applications. Recently, deep neural networks(DNNs)-based methods, which extract useful feature from complex feature, have demonstrated remarkable performance improvement. In this paper, we present novel DNN architecture monaural enhancement. Taking into account the masking properties of human auditory system, piecewise gain function applied in proposed architecture, used to reduce noise and make...
Nonnegative matrix factorization (NMF) is one of the most popular machine learning tools for speech enhancement. The supervised NMF-based enhancement accomplished by updating iteratively with prior knowledge clean and noise spectra bases. However, in many real-world scenarios, it not always possible conducting any training. traditional semi-supervised NMF (SNMF) version overcomes this shortcoming while performance degrades. In letter, without noise, we present an improved algorithm combining...
Accurate direction-of-arrival (DOA) estimation of multiple sources, simultaneously active in a reverberant environment, remains challenge, as the multi-path acoustic reflections and overlapped periods dramatically distort direct-path wave propagation. This article proposes prominent solution localizing sources environment using closed-form estimates, circumventing any exhaustive search over two-dimensional directional space. Apart from low complexity cost, algorithm has robustness to...
A spherical harmonics domain source feature called relative harmonic coefficients (RHC) has recently been applied to address the direction-of-arrival (DOA) estimation problem. This paper presents a compact evaluation and comparison between two existing RHC based DOA estimators: (i) method using full grid search over two-dimensional (2-D) directional space, (ii) decoupled estimator which uses one-dimensional (1-D) separately localize source's elevation azimuth. We also propose new gradient...
The conventional non-negative matrix factorization (NMF)-based speech enhancement is accomplished by updating iteratively with the prior knowledge of clean and noise spectra bases. With probabilistic estimation whether present or not in a certain frame, this letter proposes algorithm incorporating presence probability (SPP) obtained via to NMF process. To take advantage both NMF-based statistical model-based approaches, final enhanced achieved applying filter output SPP weighted NMF....
The conventional definition of a depth function is vector-based. In this paper, novel projection (PD) technique directly based on tensors, such as matrices, instead proposed. Tensor (TPD) still an ideal and its computation can be achieved through the iteration PD. Furthermore, we also discuss cases for sparse samples higher order tensors. Experimental results in data classification with two depths show that TPD performs much better than PD natural tensor form, even when have vector appears...
This paper targets at enhancing the noisy soundfield over entire recording area and all individual channels, while preserving spatial clues of original soundfield. For goal, we utilize a recently proposed spherical harmonics (SH) domain feature denoted relative harmonic coefficients (RHC) as it compactly contains source's information. Specifically, (i) propose an unbiased estimator RHC in environments; (ii) estimate source signal environments using SH beamformer; (iii) enhance by multiplying...