NFDI4DS | UHH-SEMS - Publication Details

Yukun Ma

ORCID: 0000-0002-4419-4287

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5101568886

Research Areas

Speech Recognition and Synthesis
Topic Modeling
Speech and Audio Processing
Music and Audio Processing
Biometric Identification and Security
Natural Language Processing Techniques
Face recognition and analysis
Advanced Text Analysis Techniques
Sentiment Analysis and Opinion Mining
Digital Media Forensic Detection
Advanced Neural Network Applications
Stock Market Forecasting Methods
Domain Adaptation and Few-Shot Learning
Text and Document Classification Technologies
Machine Learning and ELM
Financial Markets and Investment Strategies
Recommender Systems and Techniques
Face and Expression Recognition
Complex Systems and Time Series Analysis
Imbalanced Data Classification Techniques
Artificial Intelligence in Healthcare
Generative Adversarial Networks and Image Synthesis
Handwritten Text Recognition Techniques
Image Enhancement Techniques
Liver Disease Diagnosis and Treatment

Henan Institute of Science and Technology
2017-2024

Alibaba Group (United States)
2022-2024

Alibaba Group (Cayman Islands)
2024

Nanyang Technological University
2012-2020

Beijing University of Technology
2015-2020

Continental (Canada)
2020

National University of Singapore
2018

University of Genoa
2018

Tsinghua University
2016

Tianjin University
2012

Targeted Aspect-Based Sentiment Analysis via Embedding Commonsense Knowledge into an Attentive LSTM

OPENALEX - Publications

Yukun Ma Haiyun Peng Erik Cambria

Analyzing people’s opinions and sentiments towards certain aspects is an important task of natural language understanding. In this paper, we propose a novel solution to targeted aspect-based sentiment analysis, which tackles the challenges both analysis by exploiting commonsense knowledge. We augment long short-term memory (LSTM) network with hierarchical attention mechanism consisting target-level sentence-level attention. Commonsense knowledge sentiment-related concepts incorporated into...

10.1609/aaai.v32i1.12048 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-26

Sentic LSTM: a Hybrid Network for Targeted Aspect-Based Sentiment Analysis

OPENALEX - Publications

Yukun Ma Haiyun Peng Tahir Abbas Khan Erik Cambria Amir Hussain

10.1007/s12559-018-9549-x article EN Cognitive Computation 2018-03-14

Technical analysis and sentiment embeddings for market trend prediction

OPENALEX - Publications

Andrea Picasso Simone Merello Yukun Ma Luca Oneto Erik Cambria

10.1016/j.eswa.2019.06.014 article EN Expert Systems with Applications 2019-06-07

A survey on empathetic dialogue systems

OPENALEX - Publications

Yukun Ma Khanh Linh Nguyen Frank Xing Erik Cambria

10.1016/j.inffus.2020.06.011 article EN Information Fusion 2020-06-25

Learning multi-grained aspect target sequence for Chinese sentiment analysis

OPENALEX - Publications

Haiyun Peng Yukun Ma Yang Li Erik Cambria

10.1016/j.knosys.2018.02.034 article EN Knowledge-Based Systems 2018-03-15

Deep learning enables automated scoring of liver fibrosis stages

OPENALEX - Publications

Yang Yu Jiahao Wang Chan Way Ng Yukun Ma Shupei Mo and 9 more

Abstract Current liver fibrosis scoring by computer-assisted image analytics is not fully automated as it requires manual preprocessing (segmentation and feature extraction) typically based on domain knowledge in pathology. Deep learning-based algorithms can potentially classify these images without the need for through learning from a large dataset of images. We investigated performance classification models built using deep algorithm pre-trained multiple sources to score compared them...

10.1038/s41598-018-34300-2 article EN cc-by Scientific Reports 2018-10-24

Phonetic-enriched text representation for Chinese sentiment analysis with reinforcement learning

OPENALEX - Publications

Haiyun Peng Yukun Ma Soujanya Poria Yang Li Erik Cambria

10.1016/j.inffus.2021.01.005 article EN Information Fusion 2021-01-14

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation

OPENALEX - Publications

Shengkui Zhao Yukun Ma Chongjia Ni Chong Zhang Hao Wang and 5 more

Our previously proposed MossFormer has achieved promising performance in monaural speech separation. However, it predominantly adopts a self-attention-based module, which tends to emphasize longer-range, coarser-scale dependencies, with deficiency effectively modelling finer-scale recurrent patterns. In this paper, we introduce novel hybrid model that provides the capabilities both long-range, coarse-scale dependencies and fine-scale patterns by integrating module into framework. Instead of...

10.1109/icassp48485.2024.10445985 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

A Secure Face-Verification Scheme Based on Homomorphic Encryption and Deep Neural Networks

OPENALEX - Publications

Yukun Ma Lifang Wu Xiaofeng Gu Jiaoyu He Zhou Yang

With the increase in applications of face verification, increasing attention has been paid to their accuracy and security. To ensure both safety these systems, this paper proposes an encrypted face-verification system. In paper, features are extracted using deep neural networks then with Paillier algorithm saved a data set. The framework whole system involves three parties: client, server, verification server. server saves user ID, performs client is responsible for collecting requester's...

10.1109/access.2017.2737544 article EN cc-by-nc-nd IEEE Access 2017-01-01

A novel face presentation attack detection scheme based on multi-regional convolutional neural networks

OPENALEX - Publications

Yukun Ma Lifang Wu Zeyu Li Fanghao Liu

10.1016/j.patrec.2020.01.002 article EN Pattern Recognition Letters 2020-01-08

Loss Masking Is Not Needed In Decoder-Only Transformer For Discrete-Token-Based ASR

OPENALEX - Publications

Qian Chen Wen Wang Qinglin Zhang Siqi Zheng Shiliang Zhang and 5 more

Recently, unified speech-text models, such as SpeechGPT, VioLA, and AudioPaLM, have achieved remarkable performance on various speech tasks. These models discretize signals into tokens (speech discretization) use a shared vocabulary for both text tokens. Then they train single decoder-only Transformer mixture of However, these rely the Loss Masking strategy ASR task, which ignores dependency among In this paper, we propose to model in an autoregressive way, similar text. We find that...

10.1109/icassp48485.2024.10447296 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

Conditional Latent Diffusion-Based Speech Enhancement Via Dual Context Learning

OPENALEX - Publications

Shengkui Zhao Zexu Pan Kun Zhou Yukun Ma Chong Zhang and 1 more

Recently, the application of diffusion probabilistic models has advanced speech enhancement through generative approaches. However, existing diffusion-based methods have focused on generation process in high-dimensional waveform or spectral domains, leading to increased complexity and slower inference speeds. Additionally, these primarily modelled clean distributions, with limited exploration noise thereby constraining discriminative capability for enhancement. To address issues, we propose...

10.48550/arxiv.2501.10052 preprint EN arXiv (Cornell University) 2025-01-17

Conditional Latent Diffusion-Based Speech Enhancement via Dual Context Learning

OPENALEX - Publications

Shengkui Zhao Zexu Pan Kun Zhou Yukun Ma Chong Zhang and 1 more

10.1109/icassp49660.2025.10890477 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

A Neural Model for Method Name Generation from Functional Description

OPENALEX - Publications

Sa Gao Chunyang Chen Zhenchang Xing Yukun Ma Wen Song and 1 more

The names of software artifacts, e.g., method names, are important for understanding and maintenance, as good can help developers easily understand others' code. However, the existing naming guidelines difficult developers, especially novices, to come up with meaningful, concise compact variables, methods, classes files. With popularity open source, an enormous amount project source code be accessed, exhaustiveness instability manually methods could now relieved by automatically learning a...

10.1109/saner.2019.8667994 article EN 2019-02-01

Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning

OPENALEX - Publications

Lei Cai Qiankun Sun Tao Xu Yukun Ma Zhenxue Chen

Due to the existence of unfavorable factors such as turbid water quality and target occlusion, it is difficult obtain valid data features. repeated calculation similar data, real-time performance algorithm poor. In view above problems, this paper proposes a multi-AUV collaborative recognition method based on transfer-reinforcement learning. The features information which collected by are fused wavelet transformation affine invariance. similarity calculated Mahalanobis distance learning model...

10.1109/access.2020.2976121 article EN cc-by IEEE Access 2020-01-01

Neural Named Entity Boundary Detection

OPENALEX - Publications

Jing Li Aixin Sun Yukun Ma

In this paper, we focus on named entity boundary detection, which is to detect the start and end boundaries of an mention in text, without predicting its type. The detected entities are input linking or fine-grained typing systems for semantic enrichment. We propose BdryBot, a recurrent neural network encoder-decoder framework with pointer from given sentence. encoder considers both character-level representations word-level embeddings represent words. way, BdryBot does not require any...

10.1109/tkde.2020.2981329 article EN IEEE Transactions on Knowledge and Data Engineering 2020-03-17

De’hubert: Disentangling Noise in a Self-Supervised Model for Robust Speech Recognition

OPENALEX - Publications

Dianwen Ng Ruixi Zhang Jia Qi Yip Zhao Yang Jinjie Ni and 5 more

Existing self-supervised pre-trained speech models have offered an effective way to leverage massive unannotated corpora build good automatic recognition (ASR). However, many current are trained on a clean corpus from single source, which tends do poorly when noise is present during testing. Nonetheless, it crucial overcome the adverse influence of for real-world applications. In this work, we propose novel training framework, called deHuBERT, reduction encoding inspired by H. Barlow's...

10.1109/icassp49357.2023.10096603 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Ensemble of Technical Analysis and Machine Learning for Market Trend Prediction

OPENALEX - Publications

Andrea Picasso Ratto Simone Merello Luca Oneto Yukun Ma Lorenzo Malandri and 1 more

Over the last twenty years, researchers and practitioners have attempted in many ways to effectively predict market trends. Till date, however, no satisfactory solution has been found. Many approaches applied trends, from technical analysis fundamental passing through sentiment analysis. A promising research direction is exploit indicators together with sentiments extracted social media for predicting directional movements. In this paper, we propose a new approach that leverages particular,...

10.1109/ssci.2018.8628795 article EN 2021 IEEE Symposium Series on Computational Intelligence (SSCI) 2018-11-01

Deep Heterogeneous Autoencoders for Collaborative Filtering

OPENALEX - Publications

Tianyu Li Yukun Ma Jiu Xu Björn Stenger Chen Liu and 1 more

This paper leverages heterogeneous auxiliary information to address the data sparsity problem of recommender systems. We propose a model that learns shared feature space from data, such as item descriptions, product tags and online purchase history, obtain better predictions. Our consists autoencoders, not only for numerical categorical but also sequential which enables capturing user tastes, characteristics recent dynamics preference. learn autoencoder architecture each source independently...

10.1109/icdm.2018.00153 article EN 2021 IEEE International Conference on Data Mining (ICDM) 2018-11-01

SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance

OPENALEX - Publications

Jia Qi Yip Shengkui Zhao Yukun Ma Chongjia Ni Chong Zhang and 6 more

Dual-path is a popular architecture for speech separation models (e.g. Sepformer) which splits long sequences into overlapping chunks its intra- and inter-blocks that separately model intra-chunk local features inter-chunk global relationships. However, it has been found inter-blocks, comprise half dual-path model's parameters, contribute minimally to performance. Thus, we propose the Single-Path Global Modulation (SPGM) block replace inter-blocks. SPGM named after structure consisting of...

10.1109/icassp48485.2024.10447030 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

Contrastive Speech Mixup for Low-Resource Keyword Spotting

OPENALEX - Publications

Dianwen Ng Ruixi Zhang Jia Qi Yip Chong Zhang Yukun Ma and 4 more

Most of the existing neural-based models for keyword spotting (KWS) in smart devices require thousands training samples to learn a decent audio representation. However, with rising demand become more person-alized, KWS need adapt quickly smaller user samples. To tackle this challenge, we propose contrastive speech mixup (CosMix) learning algorithm low-resource KWS. CosMix introduces an auxiliary loss augmentation technique maximize relative similarity between original pre-mixed and augmented...

10.1109/icassp49357.2023.10096976 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Popularity prediction on vacation rental websites

OPENALEX - Publications

Yang Li Suhang Wang Yukun Ma Quan Pan Erik Cambria

10.1016/j.neucom.2020.05.092 article EN Neurocomputing 2020-06-13

Identity-constrained noise modeling with metric learning for face anti-spoofing

OPENALEX - Publications

Yaowen Xu Lifang Wu Meng Jian Wei‐Shi Zheng Yukun Ma and 1 more

10.1016/j.neucom.2020.12.095 article EN Neurocomputing 2021-01-09

Coming Soon ...