NFDI4DS | UHH-SEMS - Publication Details

Md. Jahangir Alam

ORCID: 0000-0002-3743-9661

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100737270

Research Areas

Speech and Audio Processing
Speech Recognition and Synthesis
Music and Audio Processing
Advanced Adaptive Filtering Techniques
Advanced Steganography and Watermarking Techniques
Digital Media Forensic Detection
Hand Gesture Recognition Systems
Hearing Loss and Rehabilitation
Chaos-based Image/Signal Encryption
Video Coding and Compression Technologies
Advanced Data Compression Techniques
Advanced Software Engineering Methodologies
Human Pose and Action Recognition
Multimedia Communication and Technology
Software Engineering Research
Advanced Vision and Imaging
Spectroscopy and Laser Applications
Image and Object Detection Techniques
Atmospheric chemistry and aerosols
Wireless Communication Networks Research
Natural Language Processing Techniques
Manufacturing Process and Optimization
Infant Health and Development
Medical Image Segmentation Techniques
Model-Driven Software Engineering Techniques

Computer Research Institute of Montréal
2011-2024

United States Military Academy
2024

Auburn University
2021

King Khalid University
2019-2020

University of Louisville
2015-2018

Powerlink Queensland (Australia)
2016

UNSW Sydney
2012-2015

University of Canberra
2013-2015

Deakin University
2014

UNSW Canberra
2012-2013

Full-covariance UBM and heavy-tailed PLDA in i-vector speaker verification

OPENALEX - Publications

Pavel Matějka Ondřej Glembek Fabio Castaldo Md. Jahangir Alam Oldřich Plchot and 3 more

In this paper, we describe recent progress in i-vector based speaker verification. The use of universal background models (UBM) with full-covariance matrices is suggested and thoroughly experimentally tested. i-vectors are scored using a simple cosine distance advanced techniques such as Probabilistic Linear Discriminant Analysis (PLDA) heavy-tailed variant PLDA (PLDA-HT). Finally, investigate into dimensionality reduction before entering the PLDA-HT modeling. results very competitive: on...

10.1109/icassp.2011.5947436 article EN 2011-05-01

Imperceptible and Robust Blind Video Watermarking Using Chrominance Embedding: A Set of Approaches in the DT CWT Domain

OPENALEX - Publications

Md. Asikuzzaman Md. Jahangir Alam Andrew Lambert Mark R. Pickering

Illegal distribution of a digital movie is significant threat to the film industries. With advent high-speed broadband Internet access, pirated copy video can be easily distributed global audience. Digital watermarking possible means limiting this type distribution. In existing methods, watermark usually embedded into luminance channel frame, which affects imperceptibility. addition, none techniques are robust combination commonly used attacks, such as compression, upscaling, rotation,...

10.1109/tifs.2014.2338274 article EN IEEE Transactions on Information Forensics and Security 2014-07-11

Robust DT CWT-Based DIBR 3D Video Watermarking Using Chrominance Embedding

OPENALEX - Publications

Md. Asikuzzaman Md. Jahangir Alam Andrew Lambert Mark R. Pickering

The popularity of 3D video is increasing daily due to the availability low-cost televisions and high-speed Internet access. However, currently contents can be distributed illegally without any protection. For views generated using a depth-image-based rendering technique, not only left right as content, but also center, left, or individually 2D content. As digital watermarking possible way protecting these from unauthorized distribution, in this paper, we propose method for rendered video. In...

10.1109/tmm.2016.2589208 article EN IEEE Transactions on Multimedia 2016-07-07

Recursive Joint Cross-Modal Attention for Multimodal Fusion in Dimensional Emotion Recognition

OPENALEX - Publications

R. Gnana Praveen Md. Jahangir Alam

10.1109/cvprw63382.2024.00483 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2024-06-17

Development of CRIM system for the automatic speaker verification spoofing and countermeasures challenge 2015

OPENALEX - Publications

Md. Jahangir Alam Patrick Kenny Gautam Bhattacharya Themos Stafylakis

The automatic speaker verification spoofing and countermeasures challenge 2015 provides a common framework for the evaluation of or anti-spoofing techniques in presence various seen unseen attacks. This contribution proposes system consisting amplitude, phase, linear prediction residual, combined amplitude - phase-based detection In this task we use following features: Mel-frequency cepstral coefficients (MFCC), product spectrum-based coefficients, modified group delay weighted residual...

10.21437/interspeech.2015-469 article EN Interspeech 2022 2015-09-06

Adaptive skin color model for hand segmentation

OPENALEX - Publications

Ahmad Yahya Dawod Junaidi Abdullah Md. Jahangir Alam

Hand segmentation is often the first step in applications such as gesture recognition, hand tracking and recognition. We propose a new technique for of color images using adaptive skin model. Our method captures pixel values person's converts them into YCbCr space. The will then map CbCr space to plane construct clustered region person. Edge detection applied cluster order create an boundaries classification. Experimental results demonstrate successful over variety variations color,...

10.1109/iccaie.2010.5735129 article EN International Conference on Computer Applications and Industrial Electronics 2010-12-01

JFA-based front ends for speaker recognition

OPENALEX - Publications

Patrick Kenny Themos Stafylakis Pierre Ouellet Md. Jahangir Alam

We discuss the limitations of i-vector representation speech segments in speaker recognition and explain how Joint Factor Analysis (JFA) can serve as an alternative feature extractor a variety ways. Building on work Zhao Dong, we implemented variational Bayes treatment JFA which accommodates adaptation universal background models (UBMs) natural way. This allows us to experiment with several types features for recognition: factors diagonal addition i-vectors, extracted without UBM each case....

10.1109/icassp.2014.6853889 article EN 2014-05-01

Supervised/Unsupervised Voice Activity Detectors for Text-dependent Speaker Recognition on the RSR2015 Corpus

OPENALEX - Publications

Patrick Kenny Themos Stafylakis Pierre Ouellet Md. Jahangir Alam Pierre Dumouchel

10.21437/odyssey.2014-14 article EN 2014-06-16

A blind and robust video watermarking scheme in the DT CWT and SVD domain

OPENALEX - Publications

Md. Asikuzzaman Md. Jahangir Alam Mark R. Pickering

The piracy of a digital movie is significant problem for studios and producers but can be prevented by video watermarking. In existing watermarking algorithms, robustness to several attacks on the watermark has been improved. However, none these techniques are robust combination common geometric distortions scaling, rotation, cropping downscaling in resolution with other such as compression. this paper, blind algorithm proposed where embedded singular values dual-tree complex wavelet...

10.1109/pcs.2015.7170090 article EN 2015-05-01

Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems

OPENALEX - Publications

Md. Jahangir Alam Patrick Kenny Douglas O’Shaughnessy

10.1007/s12559-012-9197-5 article EN Cognitive Computation 2012-12-06

A blind watermarking scheme for depth-image-based rendered 3D video using the dual-tree complex wavelet transform

OPENALEX - Publications

Md. Asikuzzaman Md. Jahangir Alam Andrew Lambert Mark R. Pickering

The amount of unauthorized distribution 3D video is increasing day by due to the availability high speed Internet and low cost TV. Note that, not only both left right views generated using depth-image-based rendering can be distributed as content but also centre, or view individually 2D content. Video watermarking a possible way protect this type illegal distribution. In paper, we propose digital method for rendered each left, view. method, watermark embedded into centre dual-tree complex...

10.1109/icip.2014.7026112 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2014-10-01

Combining amplitude and phase-based features for speaker verification with short duration utterances

OPENALEX - Publications

Md. Jahangir Alam Patrick Kenny Themos Stafylakis

Due to the increasing use of fusion in speaker recognition systems, one trend current research activity focuses on new features that capture complementary information MFCC (Mel-frequency cepstral coefficients) for improving performance. The goal this work is combine (or fuse) amplitude and phase-based improve verification Based phase spectra we investigate some possible variations extraction coefficients produce diversity with respect fused subsystems. Among amplitude-based consider widely...

10.21437/interspeech.2015-94 article EN Interspeech 2022 2015-09-06

Speaker and Channel Factors in Text-Dependent Speaker Recognition

OPENALEX - Publications

Themos Stafylakis Patrick Kenny Md. Jahangir Alam Marcel Kockmann

We reformulate joint factor analysis so that it can serve as a feature extractor for text-dependent speaker recognition. The new formulation is based on left-to-right modeling with tied mixture HMMs and designed to deal problems such the inadequacy of subspace methods in speaker-phrase variability, UBM mismatches arise result variable phonetic content, need exploit text-independent resources pass features extracted by trainable backend which plays role analogous PLDA i-vector/PLDA cascade...

10.1109/taslp.2015.2497248 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2015-11-03

A new method for hand segmentation using free-form skin color model

OPENALEX - Publications

Ahmad Yahya Dawod Junaidi Abdullah Md. Jahangir Alam

Accurate hand segmentation is a challenging task in computer vision applications. We propose new method to segment based on free-form skin color model. The pixel value of person's captured and represented YCbCr CbCr space mapped plane order produce clustered region color. Then, instead using ellipse model the color, edge detection performed construct result, tested various complex backgrounds gives promising results.

10.1109/icacte.2010.5579466 article EN 2010-08-01

A Blind Digital Video Watermarking Scheme with Enhanced Robustness to Geometric Distortion

OPENALEX - Publications

Md. Asikuzzaman Md. Jahangir Alam Andrew Lambert Mark R. Pickering

Unauthorized redistribution of a movie is common threat to digital media that can be prevented by video watermarking. The watermark commonly embedded into the luminance (Y) component frame. chrominance (U) supports more distortion than Y without being perceived human eyes. Thus, in our proposed approach, U each frame sequence using dual-tree complex wavelet transform (DT CWT). This approach aims provide perceptually invisible high quality watermarked video. detection performed original...

10.1109/dicta.2012.6411696 article EN 2012-12-01

Speech recognition in reverberant and noisy environments employing multiple feature extractors and i-vector speaker adaptation

OPENALEX - Publications

Md. Jahangir Alam Vishwa Gupta Patrick Kenny Pierre Dumouchel

The REVERB challenge provides a common framework for the evaluation of feature extraction techniques in presence both reverberation and additive background noise. State-of-the-art speech recognition systems perform well controlled environments, but their performance degrades realistic acoustical conditions, especially real as simulated reverberant environments. In this contribution, we utilize multiple extractors including conventional mel-filterbank, multi-taper spectrum estimation-based...

10.1186/s13634-015-0238-6 article EN cc-by EURASIP Journal on Advances in Signal Processing 2015-06-18

Motion segmentation initialization strategies for bi-directional inter-frame prediction

OPENALEX - Publications

Ashek Ahmmed Rui Xu Aous Thabit Naman Md. Jahangir Alam Mark R. Pickering and 1 more

Experimental results and the latest standards have proved that segmentation based video coding systems can outperform traditional block-based systems. However, this approach requires simultaneous estimation of both shape motion moving objects in a scene. In most cases neither nor are known initially. Another critical aspect tightly-coupled relationship is inaccurate may cause poor erroneous negatively impact estimation. While some existing approaches require user intervention use clues such...

10.1109/mmsp.2013.6659264 article EN 2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP) 2013-09-01

Sample-Specific MS/MS Methods in High-Throughput Mass Spectrometry

OPENALEX - Publications

David M. Cox Xuejiao Yin Md. Jahangir Alam Bogdan Georgescu Adam Latawiec and 4 more

The drug discovery process increasingly relies on high-throughput sample analysis to accelerate the identification of viable candidates. Recently, chromatographic-free mass spectrometry (HT-MS) technologies have emerged, significantly increasing readout speed and enabling large sets. These HT-MS platforms continuously acquire data from various samples into a single file, presenting challenges in applying distinctive acquisition methods specific samples. This study introduces novel approach...

10.1021/jasms.4c00278 article EN other-oa Journal of the American Society for Mass Spectrometry 2024-08-02

Robust feature extraction for speech recognition by enhancing auditory spectrum

OPENALEX - Publications

Md. Jahangir Alam Patrick Kenny Douglas O’Shaughnessy

10.21437/interspeech.2012-392 article EN Interspeech 2022 2012-09-09

Motion hints based inter-frame prediction for hybrid video coding

OPENALEX - Publications

Ashek Ahmmed Md. Jahangir Alam Mark R. Pickering Rui Xu Aous Thabit Naman and 1 more

Experimental results and the latest standards have proved video coding systems with ability to adapt size shape of motion estimation area objects in scene can outperform traditional block-based systems. In this paper, a segmentation-based strategy that employs bi-directional hints for interframe prediction is proposed. The appealing thing about they are continuous invertible, even though observed field frame will be discontinuous non-invertible. proposed scheme outperforms rate-distortion...

10.1109/pcs.2013.6737712 article EN 2013-12-01

Amplitude modulation features for emotion recognition from speech

OPENALEX - Publications

Md. Jahangir Alam Yazid Attabi Pierre Dumouchel Patrick Kenny Douglas O’Shaughnessy

The goal of speech emotion recognition (SER) is to identify the emotional or physical state a human being from his her voice. One most important things in SER task extract and select relevant features with which emotions could be recognized. In this paper, we present smoothed nonlinear energy operator (SNEO)-based amplitude modulation cepstral coefficients (AMCC) feature for recognizing signals. SNEO estimates required produce AM-FM signal, then estimated separated into its frequency...

10.21437/interspeech.2013-563 article EN Interspeech 2022 2013-08-25

A Blind and Robust Video Watermarking Scheme Using Chrominance Embedding

OPENALEX - Publications

Md. Asikuzzaman Md. Jahangir Alam Andrew Lambert Mark R. Pickering

Piracy of a digital movie is significant threat for studios and producers. Digital video watermarking an important technique that can be used to protect the content. In existing algorithms, robustness several attacks watermark has been improved. However, none techniques are robust combination common geometric distortions scaling, rotation, cropping with other attacks. this paper, we propose blind algorithm where embedded into both chrominance channels using dual-tree complex wavelet...

10.1109/dicta.2014.7008083 article EN 2014-11-01

A blind high definition videowatermarking scheme robust to geometric and temporal synchronization attacks

OPENALEX - Publications

Md. Asikuzzaman Md. Jahangir Alam Andrew Lambert Mark R. Pickering

Due to the availability of high speed online streaming sites, a pirated copy digital video can be easily distributed global audience. This paper proposes watermarking technique based on dual-tree complex wavelet transform that protect this content. In scheme, watermark is embedded into chrominance channel frames provide quality watermarked video. The detectable without reference content as well original which makes method robust temporal synchronization attacks such frame dropping and rate...

10.1109/vcip.2013.6706395 article EN 2013-11-01

Robust Feature Extractors for Continuous Speech Recognition

OPENALEX - Publications

Md. Jahangir Alam Patrick Kenny Pierre Dumouchel Douglas O’Shaughnessy

This paper presents robust feature extractors for a continuous speech recognition task in matched and mismatched environments. The conditions may occur due to additive noise, different channel, acoustic reverberation. In the conventional Mel-frequency cepstral coefficient (MFCC) extraction framework, subband spectrum enhancement technique is incorporated improve its robustness. We denote this front-end as MFCCs (RMFCC). Based on gammatone compressive gammachirp filter-banks, filterbank...

10.5281/zenodo.44180 article EN European Signal Processing Conference 2014-11-13

Structural and electronic properties of an [(Al2O3)4]+ cluster

OPENALEX - Publications

Justyna Jaroszyńska‐Wolińska Brady D. Garabato Md. Jahangir Alam Asmaul Reza Pawel M. Kozlowski

10.1007/s00894-015-2711-4 article EN Journal of Molecular Modeling 2015-06-09

Coming Soon ...