Shuai Zhang

ORCID: 0000-0001-8502-2927
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • Natural Language Processing Techniques
  • Music and Audio Processing
  • Speech Recognition and Synthesis
  • Speech and Audio Processing
  • Advanced Graph Neural Networks
  • Time Series Analysis and Forecasting
  • Complex Network Analysis Techniques
  • Machine Learning in Healthcare
  • Privacy-Preserving Technologies in Data
  • Domain Adaptation and Few-Shot Learning
  • Advanced Computational Techniques and Applications
  • Stochastic Gradient Optimization Techniques
  • Neural Networks and Applications
  • Energy Load and Power Forecasting
  • Biomedical Text Mining and Ontologies
  • Network Security and Intrusion Detection
  • Stock Market Forecasting Methods
  • Internet Traffic Analysis and Secure E-voting
  • Artificial Intelligence in Healthcare
  • Anomaly Detection Techniques and Applications
  • Digital Media Forensic Detection
  • Interconnection Networks and Systems
  • Mobile Crowdsensing and Crowdsourcing
  • Age of Information Optimization

Liaoning Normal University
2024

Tsinghua University
2024

PLA Information Engineering University
2024

North China University of Technology
2024

Zhengzhou University
2023

Beihang University
2017-2023

Guangzhou University
2023

ETH Zurich
2021-2023

L'Oreal (United States)
2023

Chinese Academy of Sciences
2020-2022

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long time-series forecasting (LSTF) demands a high capacity model, which is ability to capture precise long-range dependency coupling between output and input efficiently. Recent studies have shown potential Transformer increase capacity. However, there are several severe issues with that prevent it from being directly applicable LSTF, including quadratic time...

10.1609/aaai.v35i12.17325 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Audio deepfake detection is an emerging topic, which was included in the ASVspoof 2021. However, recent shared tasks have not covered many real-life and challenging scenarios. The first Deep synthesis Detection challenge (ADD) motivated to fill gap. ADD 2022 includes three tracks: low-quality fake audio (LF), partially (PF) game (FG). LF track focuses on dealing with bona fide fully utterances various real-world noises etc. PF aims distinguish from real. FG a rivalry game, two tasks:...

10.1109/icassp43922.2022.9746939 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

Named entity recognition (NER) is a widely studied task in natural language processing. Recently, growing number of studies have focused on the nested NER. The span-based methods, considering as span classification task, can deal with entities naturally. But they suffer from huge search space and lack interactions between entities. To address these issues, we propose novel sequence-to-set neural network for Instead specifying candidate spans advance, provide fixed set learnable vectors to...

10.24963/ijcai.2021/542 article EN 2021-08-01

Graph neural networks (GNNs) have shown excellent performance in a wide range of applications such as recommendation, risk control, and drug discovery. With the increase volume graph data, distributed GNN systems become essential to support efficient training. However, existing training suffer from various issues including high network communication cost, low CPU utilization, poor end-to-end performance. In this paper, we propose ByteGNN, which addresses limitations with three key designs:...

10.14778/3514061.3514069 article EN Proceedings of the VLDB Endowment 2022-02-01

Named entity recognition (NER) is a fundamental task to recognize specific types of entities from given sentence. Depending on how the appear in sentence, it can be divided into three subtasks, namely, Flat NER, Nested and Discontinuous NER. Among existing approaches, only generative model uniformly adapted these subtasks. However, when applied its optimization objective not consistent with task, which makes vulnerable incorrect biases. In this paper, we analyze biases generation process...

10.18653/v1/2022.acl-long.59 article EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2022-01-01

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long time-series forecasting (LSTF) demands a high capacity model, which is ability to capture precise long-range dependency coupling between output and input efficiently. Recent studies have shown potential Transformer increase capacity. However, there are several severe issues with that prevent it from being directly applicable LSTF, including quadratic time...

10.48550/arxiv.2012.07436 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Anomalous subgraph detection has been successfully applied to event in social media. However, the problembecomes challenging when media network incorporates abundant attributes, which leads a multivariate network. The characteristic makes most existing methods incapable tackle this problem effectively and efficiently, as it involves joint feature selection that not well addressed current literature, especially, dynamic networks attributes evolve over time.

10.1145/3038912.3052588 article EN 2017-04-03

At online retail platforms, it is crucial to actively detect the risks of transactions improve customer experience and minimize financial loss. In this work, we propose xFraud, an explainable fraud transaction prediction framework which mainly composed a detector explainer. The xFraud can effectively efficiently predict legitimacy incoming transactions. Specifically, utilizes heterogeneous graph neural network learn expressive representations from informative heterogeneously typed entities...

10.14778/3494124.3494128 article EN Proceedings of the VLDB Endowment 2021-11-01

The traffic dynamics of multi-layer networks has become a hot research topic since many are comprised two or more layers subnetworks. Due to its low capacity, the traditional shortest path routing (SPR) protocol is susceptible congestion on two-layer complex networks. In this paper, we propose an efficient strategy named improved global awareness (IGAR) which based betweenness centrality nodes in layers. With proposed strategy, paths can bypass hub both enhance transport efficiency....

10.1142/s0129183116500443 article EN International Journal of Modern Physics C 2015-10-09

Mining from graph-structured data is an integral component of graph management. A recent trending technique, convolutional network (GCN), has gained momentum in the mining field, and plays essential part numerous graph-related tasks. Although emerging GCN optimization techniques bring improvements to specific scenarios, they perform diversely different applications introduce many trial-and-error costs for practitioners. Moreover, existing models often suffer oversmoothing problem. Besides,...

10.1145/3447548.3467312 article EN 2021-08-13

The transport efficiency of a network is strongly related to the underlying structure. In this paper, we propose an efficient strategy named high-betweenness-first (HBF) for purpose improving traffic handling capacity scale-free networks by limiting fraction undirected links be unidirectional ones based on links’ betweenness. Compared with high-degree-first (HDF) strategy, can more significantly enhanced under proposed link-directed shortest path (SP) routing protocol. Simulation results in...

10.1142/s0129183116500285 article EN International Journal of Modern Physics C 2015-07-21

Lijie Wang, Yaozong Shen, Shuyuan Peng, Shuai Zhang, Xinyan Xiao, Hao Liu, Hongxuan Tang, Ying Chen, Hua Wu, Haifeng Wang. Proceedings of the 26th Conference on Computational Natural Language Learning (CoNLL). 2022.

10.18653/v1/2022.conll-1.6 article EN cc-by 2022-01-01

Recently, numerous efforts have continued to push up performance boundaries of document-level relation extraction (DocRE) and claimed significant progress in DocRE. In this paper, we do not aim at proposing a novel model for Instead, take closer look the field see if these gains are actually true. By taking comprehensive literature review thorough examination popular DocRE datasets, find that achieved upon strong or even untenable assumption common: all named entities perfectly localized,...

10.18653/v1/2023.findings-acl.353 article EN cc-by Findings of the Association for Computational Linguistics: ACL 2022 2023-01-01

Federated learning (FL) is an emerging distributed machine method that empowers in-situ model training on decentralized edge devices. However, multiple simultaneous FL tasks could overload resource-constrained In this work, we propose the first system to effectively coordinate and train tasks. We formalize problem of Then, present our new approach, MAS (Merge Split), optimize performance starts by merging into all-in-one task with a multi-task architecture. After for few rounds, splits two...

10.1109/iccv51070.2023.02140 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Traffic capacity is critical for various networks and strongly depends on the distribution of link's bandwidth resources. In this paper, we propose a betweenness-based allocation strategy in which each link l ij allocated proportionally to product (1 + B i ) α j , where tunable parameter, are betweenness node j, respectively. The optimal value achieved by extensive simulations slightly increases with network size. Our new achieves highest traffic when compared average previously proposed...

10.1142/s0129183112500659 article EN International Journal of Modern Physics C 2012-08-07

Reasoning is a fundamental problem for computers and deeply studied in Artificial Intelligence. In this paper, we specifically focus on answering multi-hop logical queries Knowledge Graphs (KGs). This complicated task because, real-world scenarios, the graphs tend to be large incomplete. Most previous works have been unable create models that accept full First-Order Logical (FOL) queries, which include negative only able process limited set of query structures. Additionally, most methods...

10.48550/arxiv.2209.14464 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Transducer-based models, such as RNN-Transducer and transformer-transducer, have achieved great success in speech recognition. A typical transducer model decodes the output sequence conditioned on current acoustic state previously predicted tokens step by step. Statistically, The number of blank prediction results accounts for nearly 90\% all tokens. It takes a lot computation time to predict tokens, but only non-blank will appear final sequence. Therefore, we propose method named fast-skip...

10.21437/interspeech.2021-1367 article EN Interspeech 2022 2021-08-27

Molecular Relational Learning (MRL), aiming to understand interactions between molecular pairs, plays a pivotal role in advancing biochemical research. Recently, the adoption of large language models (LLMs), known for their vast knowledge repositories and advanced logical inference capabilities, has emerged as promising way efficient effective MRL. Despite potential, these methods predominantly rely on textual data, thus not fully harnessing wealth structural information inherent graphs....

10.48550/arxiv.2402.03781 preprint EN arXiv (Cornell University) 2024-02-06

As The Onion Router (Tor) becomes increasingly prevalent, attackers have initiated Sybil attacks by controlling a plethora of malicious relay nodes, severely compromising user privacy. Thus, the identification nodes is crucial for ensuring security Tor network. Previous studies suggested that tend to similar configuration and close uptime, leading design Nearest-neighbor ranking algorithm analyzing similarity between nodes. This was followed further manual analysis sift from ones. However,...

10.1109/iccect60629.2024.10545958 article EN 2024-04-26

Abstract The search of the new physics (NP) beyond Standard Model is one most important topics in current high energy physics. With increasing luminosities at colliders, for NP signals requires analysis more and data, efficiency data processing becomes particularly important. As a machine learning algorithm, support vector (SVM) expected to be useful NP. Meanwhile, quantum computing has potential offer huge advantages when dealing with large amounts which suggests that SVM (QSVM) tool future...

10.1140/epjc/s10052-024-13208-4 article EN cc-by The European Physical Journal C 2024-08-20

With the rapid development of information technologies, which facilitates perfection healthcare systems, a variety clinical data is becoming available. The patient Electronic Health Records (EHR) one important sources in on conducts personalized medicine. However, it challenging if raw EHRs are directly used to conduct related medical prediction due its heterogeneity, sparsity and existence noise. To address this issue, paper proposes an integrative driven approach called Medical Temporal...

10.1109/trustcom.2016.0191 article EN 2015 IEEE Trustcom/BigDataSE/ISPA 2016-08-01
Coming Soon ...