NFDI4DS | UHH-SEMS - Publication Details

Ran Zhang

ORCID: 0009-0000-0708-5287

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5114500347

Research Areas

Topic Modeling
Natural Language Processing Techniques
EEG and Brain-Computer Interfaces
Speech Recognition and Synthesis
Software Engineering Research
Stock Market Forecasting Methods
Human Pose and Action Recognition
Multimodal Machine Learning Applications
Speech and Audio Processing
Software Reliability and Analysis Research
Neural Networks and Applications
Time Series Analysis and Forecasting
Neural dynamics and brain function
Advanced Malware Detection Techniques
Advanced Memory and Neural Computing
Domain Adaptation and Few-Shot Learning
Privacy-Preserving Technologies in Data
Gait Recognition and Analysis
Advanced Text Analysis Techniques
Speech and dialogue systems
Semantic Web and Ontologies
Sentiment Analysis and Opinion Mining
Text and Document Classification Technologies
Imbalanced Data Classification Techniques
Hand Gesture Recognition Systems

Taiyuan University of Science and Technology
2025

South China University of Technology
2024

Hertie School
2022

Kuaishou (China)
2022

Deakin University
2021

North China Electric Power University
2019

China Electric Power Research Institute
2019

Shandong Institute of Automation
2014

Interpretable deep classification of time series based on class discriminative prototype learning

OPENALEX - Publications

Yupeng Wang Jianghui Cai Haifeng Yang Chenhui Shi M. Zhang and 3 more

Prototypes help to explain the predictions of deep classification models for time series. However, most learn prototypes by randomly initializing an uncertain number low-discriminative prototypes, which may lead unstable and unreliable results. To address these issues, we propose a new class Discriminative Prototype Learning Network (DPL-Net), learns appropriate class-discriminative thus improving performance. Specifically, proposed Initialization Mechanism (PIM) introduces proximity metric...

10.1177/1088467x251319188 article EN other-oa Intelligent Data Analysis 2025-02-27

EEG-based Auditory Attention Detection with Spiking Graph Convolutional Network

OPENALEX - Publications

Siqi Cai Ran Zhang Malu Zhang Jibin Wu Haizhou Li

Decoding auditory attention from brain activities, such as electroencephalography (EEG), sheds light on solving the machine cocktail party problem. However, effective representation of EEG signals remains a challenge. One reasons is that current feature extraction techniques have not fully exploited spatial information along signals. reflect collective dynamics activities across different regions. The intricate interactions among these channels, rather than individual channels alone,...

10.1109/tcds.2024.3376433 article EN IEEE Transactions on Cognitive and Developmental Systems 2024-03-12

Robust Decoding of the Auditory Attention from EEG Recordings Through Graph Convolutional Networks

OPENALEX - Publications

Siqi Cai Ran Zhang Haizhou Li

Auditory attention decoding (AAD) with electroencephalography (EEG) holds great promise in brain-computer interface (BCI). Despite much progress, it remains a research topic on how to effectively evaluate the performance of EEG-based AAD algorithms under an appropriate setting that reflects use scenarios. It is desired systems are evaluated cross-subject and cross-trial settings. However, often reported same-subject, same-trial settings, where test data not truly separated from training...

10.1109/icassp48485.2024.10447633 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

Sign Language Recognition Based on CBAM-ResNet

OPENALEX - Publications

Huang Gui Chao Fenhua Wang Ran Zhang

As we are aware, there millions of deaf-mutes around the world. It is a necessity to conduct research into sign language recognition as it massive significance helping normal people and deaf-mute communicate smoothly with others. A behavior method was proposed in this paper address issue. Inspired by Multi-Fiber Networks, CBAM-ResNet neural network that extended structure ResNet 3D convolution convolutional block attention module added. In fifth layer network, unit 3D-Res2Net used preserve...

10.1145/3358331.3358379 article EN 2019-10-17

ChipSong: A Controllable Lyric Generation System for Chinese Popular Song

OPENALEX - Publications

Nayu Liu Wenjing Han Guangcan Liu Da Peng Ran Zhang and 2 more

In this work, we take a further step towards satisfying practical demands in Chinese lyric generation from musical short-video creators, respect of the challenges on songs' format constraints, creating specific lyrics open-ended inspiration inputs, and language rhyme grace. One representative detail these is to control at word level, that is, for songs, creators even expect fix-length words certain positions match special melody, while previous methods lack such ability. Although recent...

10.18653/v1/2022.in2writing-1.13 article EN cc-by 2022-01-01

A novel hybrid mandarin speech synthesis system using different base units for model training and concatenation

OPENALEX - Publications

Ran Zhang Jianhua Tao Ya Li Zhengqi Wen

The hybrid speech synthesis system, which uses the acoustic model trained according to criterion of Maximum Likelihood select proper candidates from corpus, has become a hot topic in recent days. For this performance is affected by size base training unit and candidate unit. Most existed systems use same kind such as syllable or phone for both concatenation. In Mandarin, initials finals form fundamental elements pronunciation, are always chosen statistical parametric TTS system. paper new...

10.1109/icassp.2014.6853605 article EN 2014-05-01

Federated Learning with Extreme Label Skew: A Data Extension Approach

OPENALEX - Publications

Saheed Ademola Tijani Xingjun Ma Ran Zhang Frank Jiang Robin Doss

The real-world data sets often leveraged by Federated Learning (FL) applications are mostly non-independent and non-identically distributed (non-IID). This usually results from the diverse nature of participating clients their individual data-gathering contexts. An effective FL algorithm must incorporate capability to produce a joint model that generalizes captures these patterns. In this work, we show how using some wild external samples as placeholders for missing classes on client devices...

10.1109/ijcnn52387.2021.9533879 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2021-07-18

Graph-Guided Textual Explanation Generation Framework

OPENALEX - Publications

Shuzhou Yuan Jingyi Sun Ran Zhang Michael Färber Steffen Eger and 2 more

Natural language explanations (NLEs) are commonly used to provide plausible free-text of a model's reasoning about its predictions. However, recent work has questioned the faithfulness NLEs, as they may not accurately reflect internal process regarding predicted answer. In contrast, highlight -- input fragments identified critical for predictions exhibit measurable faithfulness, which been incrementally improved through existing research. Building on this foundation, we propose G-Tex,...

10.48550/arxiv.2412.12318 preprint EN arXiv (Cornell University) 2024-12-16

Defect Prediction Model for Object Oriented Software Based on Particle Swarm Optimized SVM

OPENALEX - Publications

Yanan Wang Ran Zhang Xiangzhou Chen Shanjie Jia Huixia Ding and 2 more

In terms of the security problem power information system, this paper analysed importance software defect prediction method in object-oriented development, and proposed a model based on particle swarm optimized Support Vector Machine (SVM) corresponding to features software. The mainly consists three parts: first is pre-processing module which normalizes original data selects feature, then second adaptive inertia weight optimizes parameters SVM with accuracy as fitness. Finally, last...

10.1088/1742-6596/1187/4/042082 article EN Journal of Physics Conference Series 2019-04-01

PPSpeech: Phrase based Parallel End-to-End TTS System

OPENALEX - Publications

Yahuan Cong Ran Zhang Jian Luan

Current end-to-end autoregressive TTS systems (e.g. Tacotron 2) have outperformed traditional parallel approaches on the quality of synthesized speech. However, they introduce new problems at same time. Due to nature, time cost inference has be proportional length text, which pose a great challenge for online serving. On other hand, style synthetic speech becomes unstable and may change obviously among sentences. In this paper, we propose Phrase based Parallel End-to-End System (PPSpeech)...

10.48550/arxiv.2008.02490 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Applying NLP Techniques to Classify Businesses by their International Standard Industrial Classification (ISIC) Code

OPENALEX - Publications

Hannah Béchara Ran Zhang Shuzhou Yuan Slava Mikhaylov

The application of machine learning has played an important role in several aspects text classification across domains, and brought with it great changes to the current state art. In this paper, we propose a novel NLP techniques classify entities by their International Standard Industrial Classification (ISIC) code based on descriptions provided business owners themselves names said businesses. Faced issues irregularity small amount noisy training data, employ different models data...

10.1109/bigdata55660.2022.10020787 article EN 2021 IEEE International Conference on Big Data (Big Data) 2022-12-17

Coming Soon ...