Kamil Wójcicki

ORCID: 0009-0004-5867-2069
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Speech and Audio Processing
  • Advanced Adaptive Filtering Techniques
  • Hearing Loss and Rehabilitation
  • Speech Recognition and Synthesis
  • Acoustic Wave Phenomena Research
  • Blind Source Separation Techniques
  • Ultrasonics and Acoustic Wave Propagation
  • Music and Audio Processing
  • Phonetics and Phonology Research
  • Advanced Data Compression Techniques
  • Structural Health Monitoring Techniques

Cisco Systems (United States)
2024

Starkey Hearing Technologies (United States)
2015-2016

The University of Texas at Dallas
2012-2014

Griffith University
2007-2011

Typical speech enhancement algorithms operate on the short-time magnitude spectrum, while keeping phase spectrum unchanged for synthesis. We propose a novel approach where noisy is recombined with changed to produce modified complex spectrum. During synthesis, low energy components of cancel out more than high components, thus reducing background noise. Using objective quality measures, informal subjective listening tests and spectrogram analysis, we show that proposed method results in...

10.1109/lsp.2008.923579 article EN IEEE Signal Processing Letters 2008-01-01

In speech processing the short-time magnitude spectrum is believed to contain most of information about intelligibility and it normally computed using Fourier transform over 20-40 ms window duration. this paper, we investigate effect analysis duration on in a systematic way. For purpose, both subjective objective experiments are conducted. The experiment form consonant recognition task by human listeners, whereas an automatic (ASR) task. our various durations investigated. construct stimuli...

10.1109/icspcs.2010.5709770 article EN 2010-12-01

Typical speech enhancement algorithms operate on the shorttime magnitude spectrum, while keeping short-time phase spectrum unchanged for synthesis. Recently, a novel approach to has been proposed where noisy is recombined with changed produce modified complex spectrum. During synthesis low energy components of cancel out more than high components, thus reducing background noise. In present work, procedure that employs noise estimates compensate additive distortion formulated. The objectively...

10.21437/interspeech.2008-163 article EN Interspeech 2022 2008-09-22

In this letter, we investigate the effect of analysis window duration on speech intelligibility in a systematic way. processing, short-time magnitude spectrum is believed to contain majority intelligible information. Consequently, our experiments, construct stimuli based purely spectrum. We conduct subjective listening tests form consonant recognition task assess as function duration. investigations, also employ three objective measures transmission index (STI). The experimental results show...

10.1109/lsp.2008.2005755 article EN IEEE Signal Processing Letters 2008-01-01

Background noise reduces the depth of low-frequency envelope modulations known to be important for speech intelligibility. The relative strength target and masker can quantified using a modulation signal-to-noise ratio, (S/N)mod, measure. Such measure used in noise-suppression algorithms extract target-relevant from corrupted (target + masker) envelopes potential improvement In present study, are decomposed spectral domain into number channels spanning range 0–30 Hz. Target-dominant...

10.1121/1.3688488 article EN The Journal of the Acoustical Society of America 2012-04-01

The speech signal is used for transmission of linguistic information. High energy portions the spectrum have higher signal-to-noise ratios than low portions. As a result, these regions are more robust to noise. Since known be very noise, it expected that high carry majority This letter tries derive frequency warping function directly from by sampling axis nonuniformly with sampled densely regions. To achieve this, an ensemble average short-time power computed large corpus....

10.1109/lsp.2009.2014096 article EN IEEE Signal Processing Letters 2009-03-09

In this paper, we propose the modulation-domain Kalman filter (MDKF) for speech enhancement. contrast to previous modulation domain-enhancement methods based on bandpass filtering, MDKF is an adaptive and linear MMSE estimator that uses models of temporal changes magnitude spectrum both noise. Also, because a joint phase estimator, under non-stationarity assumptions, it highly suited modulationdomain processing, as tends contain more information than acoustic phase. Experimental results from...

10.21437/interspeech.2010-330 article EN Interspeech 2022 2010-09-26

Recent studies on binary masking techniques make the assumption that each time-frequency (T-F) unit contributes an equal amount to overall intelligibility of speech. The present study demonstrated importance T-F speech varies in accordance with content. Specifically, units are categorized into two classes, speech-present and speech-absent units. Results indicate is highly related loudness its target component, while according masker component. Two types mask errors also considered, which...

10.1121/1.4869088 article EN The Journal of the Acoustical Society of America 2014-05-01

The short-time Fourier transform (STFT) of a speech signal has two components: the magnitude spectrum and phase spectrum. It is traditionally believed that plays dominant role for perception at small window durations (20-40 ms). However, recent perceptual studies have shown can contribute as much to intelligibility was observed use rectangular (non-tapered) analysis computation more advantageous than Hamming (tapered) window. This paper investigates effect dynamic range an on phase-only...

10.1109/icassp.2007.367016 article EN 2007-04-01

A comparative evaluation of speech enhancement algorithms for robust automatic recognition is presented. The performed on a core test set the TIMIT corpus. Mean objective quality scores as well ASR correctness under two noise conditions are given.

10.1109/icspcs.2010.5709761 article EN 2010-12-01

In this paper, we propose to combine the Kalman filter with a recent speech enhancement technique, called phase spectrum compensation procedure, or PSC. More specifically, apply PSC technique initialise filter, whereby is used clean noisy prior LPC estimation for recursion. We refer combined as Kalman-PSC filter. Using an objective quality measure, formal subjective listening tests and spectrogram analysis, show that proposed method results in improved quality.

10.1109/icassp.2009.4960606 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2009-04-01

In this paper we investigate the enhancement of speech by applying MMSE short-time spectral magnitude estimation in modulation domain. For purpose, traditional analysismodification-synthesis framework is extended to include domain processing. We compensate noisy spectrum for additive noise distortion algorithm Subjective experiments were conducted compare quality stimuli processed estimator those using acoustic and subtraction method. The proposed method shown have better suppression than...

10.21437/interspeech.2011-425 article EN Interspeech 2022 2011-08-27

With the advent of generative audio features, there is an increasing need for rapid evaluation their impact on speech intelligibility. Beyond existing laboratory measures, which are expensive and do not scale well, has been comparatively little work crowdsourced assessment Standards recommendations yet to be defined, publicly available multilingual test materials lacking. In response this challenge, we propose approach a intelligibility assessment. We detail design, collection public release...

10.1109/icassp48485.2024.10447869 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

Mask-based objective speech-intelligibility measures have been successfully proposed for evaluating the performance of binary masking algorithms. These were computed directly by comparing estimated mask against ground truth ideal (IdBM). Most these measures, however, assign equal weight to all time-frequency (T-F) units. In this study, we propose improve existing mask-based weighting each T-F unit according its target or masker loudness. The measure shows significantly better than two other measures.

10.1109/icassp.2013.6639025 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2013-05-01

In this paper we investigate the modulation domain as an alternative to acoustic for speech enhancement. More specifically, wish determine how competitive is spectral subtraction compared domain. For purpose, extend traditional analysis-modification-synthesis framework include processing. We then compensate noisy spectrum additive noise distortion by applying algorithm in Using subjective listening tests and objective quality evaluation show that proposed method results improved quality....

10.21437/interspeech.2009-413 article EN Interspeech 2022 2009-09-06

This paper introduces novel two-channel a priori Signal-to-Noise Ratio (SNR) estimators for use in frequency-domain speech enhancement algorithms. The SNR estimation is based on statistics of the noisy phase difference between two microphones each frequency bin. Namely, corresponding probability distribution derived assuming complex Gaussian model, and written terms only. shifts problem into classical statistical parameter problem, which we propose to solve via an online version Method...

10.1109/iwaenc.2016.7602935 article EN 2016-09-01

In this paper, we investigate the validity of common assumption made in Wiener filtering that clean speech and noise signals are uncorrelated under short-time analysis typically used for enhancement. order to achieve have performed enhancement experiments, where corrupted by additive white Gaussian is enhanced a filter designed time as well frequency domains. Results oracle-style experiments confirm inclusion additivity results negligible degradation quality. Informal listening tests show...

10.21437/interspeech.2007-302 article EN Interspeech 2022 2007-08-27

This work presents a single channel speech enhancement approach aimed at improving clarity for hearing impaired listeners under challenging listening conditions. The proposed method applies nonlinear distortions to components isolated from the observed noisy signal using aggressive enhancement. enhanced are then mixed back into signal. results show that significantly improves in noise.

10.1109/waspaa.2015.7336926 article EN 2015-10-01
Coming Soon ...