NFDI4DS | UHH-SEMS - Publication Details

Coswara — A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis

OPENALEX - Publications

Neeraj Kumar Sharma Prashant Krishnan Rohit Kumar Shreyas Ramoji Srikanth Raj Chetupalli and 3 more

The COVID-19 pandemic presents global challenges transcending boundaries of country, race, religion, and economy.The current gold standard method for detection is the reverse transcription polymerase chain reaction (RT-PCR) testing.However, this expensive, time-consuming, violates social distancing.Also, as expected to stay a while, there need an alternate diagnosis tool which overcomes these limitations, deployable at large scale.The prominent symptoms include cough breathing...

10.21437/interspeech.2020-2768 article EN other-oa Interspeech 2022 2020-10-25

DiCOVA Challenge: Dataset, Task, and Baseline System for COVID-19 Diagnosis Using Acoustics

OPENALEX - Publications

Ananya Muguli Lancelot Pinto R Nirmala Neeraj Kumar Sharma Prashant Krishnan and 7 more

The DiCOVA challenge aims at accelerating research in diagnosing COVID-19 using acoustics (DiCOVA), a topic the intersection of speech and audio processing, respiratory health diagnosis, machine learning.This is an open call for researchers to analyze dataset sound recordings, collected from infected non-COVID-19 individuals, two-class classification.These recordings were via crowdsourcing multiple countries, through website application.The features two tracks, one focusing on cough sounds,...

10.21437/interspeech.2021-74 article EN Interspeech 2022 2021-08-27

Multi-Modal Point-of-Care Diagnostics for COVID-19 Based on Acoustics and Symptoms

OPENALEX - Publications

Srikanth Raj Chetupalli Prashant Krishnan Neeraj Kumar Sharma Ananya Muguli Rohit Kumar and 4 more

Background: The COVID-19 pandemic has highlighted the need to invent alternative respiratory health diagnosis methodologies which provide improvement with respect time, cost, physical distancing and detection performance. In this context, identifying acoustic bio-markers of diseases received renewed interest. Objective: paper, we aim design diagnostics based on analyzing acoustics symptoms data. Towards this, data is composed cough, breathing, speech signals, record, collected using a...

10.1109/jtehm.2023.3250700 article EN cc-by IEEE Journal of Translational Engineering in Health and Medicine 2023-01-01

Review on Reinforcement Learning, Research Evolution and Scope of Application

OPENALEX - Publications

Eisha Akanksha Jyoti Jyoti - Neeraj Kumar Sharma Kamal Gulati

Machine learning is considered as the study of computer algorithms that enables machine to learn and adapt new data without any human intervention. Reinforcement a paradigm by which self-governing agent utilizes its experience communicating with an environmental situation improve behavior. The paper summarizes different reinforcement algorithms, merits demerits existing reviewed methods along applications challenges gives future research direction.

10.1109/iccmc51019.2021.9418283 article EN 2022 6th International Conference on Computing Methodologies and Communication (ICCMC) 2021-04-08

Boli: A dataset for understanding stuttering experience and analyzing stuttered speech

OPENALEX - Publications

Ashita Batra Mannas narang Neeraj Kumar Sharma Pradip K. Das

There is a growing need for diverse, high-quality stuttered speech data, particularly in the context of Indian languages. This paper introduces Project Boli, multi-lingual dataset designed to advance scientific understanding and technology development individuals who stutter, India. The constitutes (a) anonymized metadata (gender, age, country, mother tongue) responses questionnaire about how stuttering affects their daily lives, (b) captures both read (using Rainbow Passage) spontaneous...

10.48550/arxiv.2501.15877 preprint EN arXiv (Cornell University) 2025-01-27

Boli: A dataset for understanding stuttering experience and analyzing stuttered speech

OPENALEX - Publications

Ashita Batra Mannas narang Neeraj Kumar Sharma Pradip K. Das

10.1109/icassp49660.2025.10888349 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

The Second Dicova Challenge: Dataset and Performance Analysis for Diagnosis of Covid-19 Using Acoustics

OPENALEX - Publications

Neeraj Kumar Sharma Srikanth Raj Chetupalli Debarpan Bhattacharya Debottam Dutta Pravin Mote and 1 more

The Second Diagnosis of COVID-19 using Acoustics (DiCOVA) Challenge aimed at accelerating the research in acoustics based detection COVID-19, a topic intersection acoustics, signal processing, machine learning, and healthcare. This paper presents details challenge, which was an open call for researchers to analyze dataset audio recordings consisting breathing, cough speech signals. data collected from individuals with without infection, task challenge two-class classification. development...

10.1109/icassp43922.2022.9747188 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

Leveraging LSTM Models for Overlap Detection in Multi-Party Meetings

OPENALEX - Publications

Neeraj N Sajjan Shobhana Ganesh Neeraj Kumar Sharma Sriram Ganapathy Neville Ryant

The detection of overlapping speech segments is key importance in applications involving analysis multi-party conversations. problem challenging because are typically captured as short utterances far-field microphone recordings. In this paper, we propose overlap using a neural network architecture consisting long-short term memory (LSTM) models. learns the presence by identifying spectrotemporal structure segments. order to evaluate model performance, perform experiments on simulated...

10.1109/icassp.2018.8462548 article EN 2018-04-01

Talker change detection: A comparison of human and machine performance

OPENALEX - Publications

Neeraj Kumar Sharma Shobhana Ganesh Sriram Ganapathy Lori L. Holt

The automatic analysis of conversational audio remains difficult, in part, due to the presence multiple talkers speaking turns, often with significant intonation variations and overlapping speech. majority prior work on psychoacoustic speech system design has focused single-talker or multi-talker (for example, cocktail party effect). There been much less focus how listeners detect a change talker probing acoustic features characterizing talker's voice This study examines human detection...

10.1121/1.5084044 article EN The Journal of the Acoustical Society of America 2019-01-01

Event-triggered sampling using signal extrema for instantaneous amplitude and instantaneous frequency estimation

OPENALEX - Publications

Neeraj Kumar Sharma T.V. Sreenivas

10.1016/j.sigpro.2015.03.025 article EN Signal Processing 2015-04-16

Sparse signal reconstruction based on signal dependent non-uniform samples

OPENALEX - Publications

Neeraj Kumar Sharma T.V. Sreenivas

The classical approach to A/D conversion has been uniform sampling and we get perfect reconstruction for bandlimited signals by satisfying the Nyquist Sampling Theorem. We propose a non-uniform scheme based on level crossing (LC) time information. show stable of bandpass with correct scale factor hence unique from only For crossings make use sparse optimization constraining signal be in its frequency content. While overdetermined system equations is resorted literature an undetermined along...

10.1109/icassp.2012.6288659 article EN 2012-03-01

BAT algorithm based feature selection: Application in credit scoring

OPENALEX - Publications

Diwakar Tripathi B. Ramachandra Reddy Y. C. A. Padmanabha Reddy Alok Kumar Shukla Ravi Kant Kumar and 1 more

Credit scoring plays a vital role for financial institutions to estimate the risk associated with credit applicant applied product. It is estimated based on applicants’ credentials and directly affects viability of issuing institutions. However, there may be large number irrelevant features in dataset. Due features, models lead poorer classification performances higher complexity. So, by removing redundant overcome problem features. In this work, we emphasized feature selection enhance...

10.3233/jifs-189876 article EN Journal of Intelligent & Fuzzy Systems 2021-03-30

The Second DiCOVA Challenge: Dataset and performance analysis for COVID-19 diagnosis using acoustics

OPENALEX - Publications

Neeraj Kumar Sharma Srikanth Raj Chetupalli Debarpan Bhattacharya Debottam Dutta Pravin Mote and 1 more

The Second Diagnosis of COVID-19 using Acoustics (DiCOVA) Challenge aimed at accelerating the research in acoustics based detection COVID-19, a topic intersection acoustics, signal processing, machine learning, and healthcare. This paper presents details challenge, which was an open call for researchers to analyze dataset audio recordings consisting breathing, cough speech signals. data collected from individuals with without infection, task challenge two-class classification. development...

10.48550/arxiv.2110.01177 preprint EN cc-by-nc-nd arXiv (Cornell University) 2021-01-01

Left-Arm Dominance in Active Positioning

OPENALEX - Publications

George Thomas Kurian Neeraj Kumar Sharma K. Santhakumari

The relative accuracy of the left and right arms in active positioning was studied a group 24 male right-handed undergraduates. task required at each four angular positions (30°, 45°, 60°, 75°). arm more accurate than arm. There progressive increase error for both as flexed reducing angle joint. Results are discussed light suggestions concerning superiority hemisphere processing kinesthetic proprioceptive information.

10.2466/pms.1989.68.3c.1312 article EN Perceptual and Motor Skills 1989-06-01

A Novel Communication System For Deaf And Dumb People using gesture

OPENALEX - Publications

Pritesh Ambavane Rahul Karjavkar Hemant Pramod Pathare Shubham Relekar Bhavana Alte and 1 more

Human Beings know each other and contact with themselves through thoughts ideas.The best way to present our idea is speech. Some people don’t have the power of speech; only they communicate others sign language. Now a days technology has reduced gap systems which can be used change language by these Sign recognition (SLR) gesture-based control are two major applications for hand gesture technologies. On side controller converts in text speech gets converted help conversion analog digital...

10.1051/itmconf/20203202003 article EN cc-by ITM Web of Conferences 2020-01-01

Mel-scale sub-band modelling for perceptually improved time-scale modification of speech and audio signals

OPENALEX - Publications

Neeraj Kumar Sharma Shreepad Potadar Srikanth Raj Chetupalli T.V. Sreenivas

Good quality time-scale modification (TSM) of speech, and audio is a long standing challenge. The crux the challenge to maintain perceptual subtilities temporal variations in pitch timbre even after time-scaling signal. Widely used approaches, such as phase vocoder, waveform overlap-add (OLA), are based on quasi-stationary assumption time-scaled signals have perceivable artifacts. In contrast these we propose application time-varying sinusoidal modeling for TSM, without any assumption....

10.1109/ncc.2017.8077073 article EN 2017-03-01

Acoustic and linguistic features influence talker change detection

OPENALEX - Publications

Neeraj Kumar Sharma Venkat Krishnamohan Sriram Ganapathy Ahana Gangopadhayay Lauren K. Fink

A listening test is proposed in which human participants detect talker changes two natural, multi-talker speech stimuli sets—a familiar language (English) and an unfamiliar (Chinese). Miss rate, false-alarm response times (RT) showed a significant dependence on familiarity. Linear regression modeling of RTs using diverse acoustic features derived from the recruitment pool for change detection task. Further, benchmarking same task against state-of-the-art machine diarization system that...

10.1121/10.0002462 article EN public-domain The Journal of the Acoustical Society of America 2020-11-01

Towards sound based testing of COVID-19—Summary of the first Diagnostics of COVID-19 using Acoustics (DiCOVA) Challenge

OPENALEX - Publications

Neeraj Kumar Sharma Ananya Muguli Prashant Krishnan Rohit Kumar Srikanth Raj Chetupalli and 1 more

10.1016/j.csl.2021.101320 article EN Computer Speech & Language 2021-11-24

Moving sound source parameter estimation using a single microphone and signal extrema samples

OPENALEX - Publications

Neeraj Kumar Sharma Sai Gunaranjan Pelluri T.V. Sreenivas

Estimating the parameters of moving sound sources using only source signal is interest in low-power, and contact-less monitoring applications, such as, industrial robotics bio-acoustics. The received embeds motion attributes via Doppler effect. In this paper, we analyze effect on mixture time-varying sinusoids. Focusing, instantaneous frequency (IF) signal, show that IF profile composed its first two derivatives can be used to obtain parameters. This requires a smooth estimate profile....

10.1109/icassp.2015.7178387 article EN 2015-04-01

Event-triggered sampling and reconstruction of sparse real-valued trigonometric polynomials

OPENALEX - Publications

Neeraj Kumar Sharma T.V. Sreenivas

We propose data acquisition from continuous-time signals belonging to the class of real-valued trigonometric polynomials using an event-triggered sampling paradigm. The schemes proposed are: level crossing (LC), close extrema LC, and sampling. Analysis robustness these jitter, bandpass additive gaussian noise is presented. In general will result in non-uniformly spaced sample instants. address issue signal reconstruction acquired data-set by imposing structure sparsity on model circumvent...

10.1109/spcom.2014.6983916 article EN 2014-07-01

Analyzing the impact of SARS-CoV-2 variants on respiratory sound signals

OPENALEX - Publications

Debarpan Bhattacharya Debottam Dutta Neeraj Kumar Sharma Srikanth Raj Chetupalli Pravin Mote and 6 more

The COVID-19 outbreak resulted in multiple waves of infections that have been associated with different SARS-CoV-2 variants.Studies reported differential impact the variants on respiratory health patients.We explore whether acoustic signals, collected from subjects, show computationally distinguishable patterns suggesting a possibility to predict underlying virus variant.We analyze Coswara dataset which is three subject pools, namely, i) healthy, ii) subjects recorded during delta variant...

10.21437/interspeech.2022-10389 article EN Interspeech 2022 2022-09-16

Unsupervised modeling of vowel harmony using WaveGAN

OPENALEX - Publications

Sneha Ray Barman Shakuntala Mahanta Neeraj Kumar Sharma

10.21437/speechprosody.2024-41 article EN Speech prosody 2024-06-30

Deciphering Assamese Vowel Harmony with Featural InfoWaveGAN

OPENALEX - Publications

Sneha Ray Barman Shakuntala Mahanta Neeraj Kumar Sharma

Traditional approaches for understanding phonological learning have predominantly relied on curated text data. Although insightful, such limit the knowledge captured in textual representations of spoken language. To overcome this limitation, we investigate potential Featural InfoWaveGAN model to learn iterative long-distance vowel harmony using raw speech We focus Assamese, a language known its phonologically regressive and word-bound harmony. demonstrate that is adept at grasping...

10.48550/arxiv.2407.06547 preprint EN arXiv (Cornell University) 2024-07-09

Deciphering Assamese Vowel Harmony with Featural InfoWaveGAN

OPENALEX - Publications

Sneha Ray Barman Shakuntala Mahanta Neeraj Kumar Sharma

Traditional approaches for understanding phonological learning have predominantly relied on curated text data. Although insightful, such limit the knowledge captured in textual representations of spoken language. To overcome this limitation, we investigate potential Featural InfoWaveGAN model to learn iterative long-distance vowel harmony using raw speech We focus Assamese, a language known its phonologically regressive and word-bound harmony. demonstrate that is adept at grasping...

10.21437/interspeech.2024-1990 article EN Interspeech 2022 2024-09-01