NFDI4DS | UHH-SEMS - Publication Details

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

OPENALEX - Publications

Haoyi Zhou Shanghang Zhang Jieqi Peng Shuai Zhang Jianxin Li and 2 more

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long time-series forecasting (LSTF) demands a high capacity model, which is ability to capture precise long-range dependency coupling between output and input efficiently. Recent studies have shown potential Transformer increase capacity. However, there are several severe issues with that prevent it from being directly applicable LSTF, including quadratic time...

10.1609/aaai.v35i12.17325 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

ADD 2022: the first Audio Deep Synthesis Detection Challenge

OPENALEX - Publications

Jiangyan Yi Ruibo Fu Jianhua Tao Shuai Nie Haoxin Ma and 12 more

Audio deepfake detection is an emerging topic, which was included in the ASVspoof 2021. However, recent shared tasks have not covered many real-life and challenging scenarios. The first Deep synthesis Detection challenge (ADD) motivated to fill gap. ADD 2022 includes three tracks: low-quality fake audio (LF), partially (PF) game (FG). LF track focuses on dealing with bona fide fully utterances various real-world noises etc. PF aims distinguish from real. FG a rivalry game, two tasks:...

10.1109/icassp43922.2022.9746939 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

A Sequence-to-Set Network for Nested Named Entity Recognition

OPENALEX - Publications

Zeqi Tan Yongliang Shen Shuai Zhang Weiming Lü Yueting Zhuang

Named entity recognition (NER) is a widely studied task in natural language processing. Recently, growing number of studies have focused on the nested NER. The span-based methods, considering as span classification task, can deal with entities naturally. But they suffer from huge search space and lack interactions between entities. To address these issues, we propose novel sequence-to-set neural network for Instead specifying candidate spans advance, provide fixed set learnable vectors to...

10.24963/ijcai.2021/542 article EN 2021-08-01

ByteGNN

OPENALEX - Publications

Chenguang Zheng Hongzhi Chen Yuxuan Cheng Zhezheng Song Yifan Wu and 4 more

Graph neural networks (GNNs) have shown excellent performance in a wide range of applications such as recommendation, risk control, and drug discovery. With the increase volume graph data, distributed GNN systems become essential to support efficient training. However, existing training suffer from various issues including high network communication cost, low CPU utilization, poor end-to-end performance. In this paper, we propose ByteGNN, which addresses limitations with three key designs:...

10.14778/3514061.3514069 article EN Proceedings of the VLDB Endowment 2022-02-01

Expanding the prediction capacity in long sequence time-series forecasting

OPENALEX - Publications

Haoyi Zhou Jianxin Li Shanghang Zhang Shuai Zhang Mengyi Yan and 1 more

10.1016/j.artint.2023.103886 article EN Artificial Intelligence 2023-02-28

De-Bias for Generative Extraction in Unified NER Task

OPENALEX - Publications

Shuai Zhang Yongliang Shen Zeqi Tan Yiquan Wu Weiming Lü

Named entity recognition (NER) is a fundamental task to recognize specific types of entities from given sentence. Depending on how the appear in sentence, it can be divided into three subtasks, namely, Flat NER, Nested and Discontinuous NER. Among existing approaches, only generative model uniformly adapted these subtasks. However, when applied its optimization objective not consistent with task, which makes vulnerable incorrect biases. In this paper, we analyze biases generation process...

10.18653/v1/2022.acl-long.59 article EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2022-01-01

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

OPENALEX - Publications

Haoyi Zhou Shanghang Zhang Jieqi Peng Shuai Zhang Jianxin Li and 2 more

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long time-series forecasting (LSTF) demands a high capacity model, which is ability to capture precise long-range dependency coupling between output and input efficiently. Recent studies have shown potential Transformer increase capacity. However, there are several severe issues with that prevent it from being directly applicable LSTF, including quadratic time...

10.48550/arxiv.2012.07436 preprint EN other-oa arXiv (Cornell University) 2020-01-01

An Efficient Approach to Event Detection and Forecasting in Dynamic Multivariate Social Media Networks

OPENALEX - Publications

Minglai Shao Jianxin Li Feng Chen Hongyi Huang Shuai Zhang and 1 more

Anomalous subgraph detection has been successfully applied to event in social media. However, the problembecomes challenging when media network incorporates abundant attributes, which leads a multivariate network. The characteristic makes most existing methods incapable tackle this problem effectively and efficiently, as it involves joint feature selection that not well addressed current literature, especially, dynamic networks attributes evolve over time.

10.1145/3038912.3052588 article EN 2017-04-03

xFraud

OPENALEX - Publications

Susie Xi Rao Shuai Zhang Zhichao Han Zitao Zhang Wei Min and 4 more

At online retail platforms, it is crucial to actively detect the risks of transactions improve customer experience and minimize financial loss. In this work, we propose xFraud, an explainable fraud transaction prediction framework which mainly composed a detector explainer. The xFraud can effectively efficiently predict legitimacy incoming transactions. Specifically, utilizes heterogeneous graph neural network learn expressive representations from informative heterogeneously typed entities...

10.14778/3494124.3494128 article EN Proceedings of the VLDB Endowment 2021-11-01

Improved efficient routing strategy on two-layer complex networks

OPENALEX - Publications

Jinlong Ma Weizhan Han Qing Guo Shuai Zhang Junfang Wang and 1 more

The traffic dynamics of multi-layer networks has become a hot research topic since many are comprised two or more layers subnetworks. Due to its low capacity, the traditional shortest path routing (SPR) protocol is susceptible congestion on two-layer complex networks. In this paper, we propose an efficient strategy named improved global awareness (IGAR) which based betweenness centrality nodes in layers. With proposed strategy, paths can bypass hub both enhance transport efficiency....

10.1142/s0129183116500443 article EN International Journal of Modern Physics C 2015-10-09

ConceptPsy: A comprehensive benchmark suite for hierarchical psychological concept understanding in LLMs

OPENALEX - Publications

Junlei Zhang Hongliang He Lizhi Ma Nirui Song Shuyuan He and 7 more

10.1016/j.neucom.2025.130070 article EN Neurocomputing 2025-03-01

DeGNN

OPENALEX - Publications

Xupeng Miao Nezihe Merve Gürel Wentao Zhang Zhichao Han Bo Li and 15 more

Mining from graph-structured data is an integral component of graph management. A recent trending technique, convolutional network (GCN), has gained momentum in the mining field, and plays essential part numerous graph-related tasks. Although emerging GCN optimization techniques bring improvements to specific scenarios, they perform diversely different applications introduce many trial-and-error costs for practitioners. Moreover, existing models often suffer oversmoothing problem. Besides,...

10.1145/3447548.3467312 article EN 2021-08-13

Enhancing traffic capacity of scale-free networks by link-directed strategy

OPENALEX - Publications

Jinlong Ma Weizhan Han Qing Guo Shuai Zhang

The transport efficiency of a network is strongly related to the underlying structure. In this paper, we propose an efficient strategy named high-betweenness-first (HBF) for purpose improving traffic handling capacity scale-free networks by limiting fraction undirected links be unidirectional ones based on links’ betweenness. Compared with high-degree-first (HDF) strategy, can more significantly enhanced under proposed link-directed shortest path (SP) routing protocol. Simulation results in...

10.1142/s0129183116500285 article EN International Journal of Modern Physics C 2015-07-21

A Fine-grained Interpretability Evaluation Benchmark for Neural NLP

OPENALEX - Publications

Lijie Wang Yaozong Shen Shuyuan Peng Shuai Zhang Xinyan Xiao and 5 more

Lijie Wang, Yaozong Shen, Shuyuan Peng, Shuai Zhang, Xinyan Xiao, Hao Liu, Hongxuan Tang, Ying Chen, Hua Wu, Haifeng Wang. Proceedings of the 26th Conference on Computational Natural Language Learning (CoNLL). 2022.

10.18653/v1/2022.conll-1.6 article EN cc-by 2022-01-01

Rethinking Document-Level Relation Extraction: A Reality Check

OPENALEX - Publications

Jing Li Yequan Wang Shuai Zhang Min Zhang

Recently, numerous efforts have continued to push up performance boundaries of document-level relation extraction (DocRE) and claimed significant progress in DocRE. In this paper, we do not aim at proposing a novel model for Instead, take closer look the field see if these gains are actually true. By taking comprehensive literature review thorough examination popular DocRE datasets, find that achieved upon strong or even untenable assumption common: all named entities perfectly localized,...

10.18653/v1/2023.findings-acl.353 article EN cc-by Findings of the Association for Computational Linguistics: ACL 2022 2023-01-01

MAS: Towards Resource-Efficient Federated Multiple-Task Learning

OPENALEX - Publications

Weiming Zhuang Yonggang Wen Lingjuan Lyu Shuai Zhang

Federated learning (FL) is an emerging distributed machine method that empowers in-situ model training on decentralized edge devices. However, multiple simultaneous FL tasks could overload resource-constrained In this work, we propose the first system to effectively coordinate and train tasks. We formalize problem of Then, present our new approach, MAS (Merge Split), optimize performance starts by merging into all-in-one task with a multi-task architecture. After for few rounds, splits two...

10.1109/iccv51070.2023.02140 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

AN EFFICIENT BANDWIDTH ALLOCATION STRATEGY FOR SCALE-FREE NETWORKS

OPENALEX - Publications

Zhongyuan Jiang Mangui Liang Shuai Zhang Shujuan Wang Dongchao Guo

Traffic capacity is critical for various networks and strongly depends on the distribution of link's bandwidth resources. In this paper, we propose a betweenness-based allocation strategy in which each link l ij allocated proportionally to product (1 + B i ) α j , where tunable parameter, are betweenness node j, respectively. The optimal value achieved by extensive simulations slightly increases with network size. Our new achieves highest traffic when compared average previously proposed...

10.1142/s0129183112500659 article EN International Journal of Modern Physics C 2012-08-07

Neural Methods for Logical Reasoning Over Knowledge Graphs

OPENALEX - Publications

Alfonso Amayuelas Shuai Zhang Susie Xi Rao Ce Zhang

Reasoning is a fundamental problem for computers and deeply studied in Artificial Intelligence. In this paper, we specifically focus on answering multi-hop logical queries Knowledge Graphs (KGs). This complicated task because, real-world scenarios, the graphs tend to be large incomplete. Most previous works have been unable create models that accept full First-Order Logical (FOL) queries, which include negative only able process limited set of query structures. Additionally, most methods...

10.48550/arxiv.2209.14464 preprint EN cc-by arXiv (Cornell University) 2022-01-01

reducing multilingual context confusion for end-to-end code-switching automatic speech recognition

OPENALEX - Publications

Shuai Zhang Jiangyan Yi Zhengkun Tian Jianhua Tao Yu Ting Yeung and 1 more

10.21437/interspeech.2022-10286 article EN Interspeech 2022 2022-09-16

FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization

OPENALEX - Publications

Zhengkun Tian Jiangyan Yi Ye Bai Jianhua Tao Shuai Zhang and 1 more

Transducer-based models, such as RNN-Transducer and transformer-transducer, have achieved great success in speech recognition. A typical transducer model decodes the output sequence conditioned on current acoustic state previously predicted tokens step by step. Statistically, The number of blank prediction results accounts for nearly 90\% all tokens. It takes a lot computation time to predict tokens, but only non-blank will appear final sequence. Therefore, we propose method named fast-skip...

10.21437/interspeech.2021-1367 article EN Interspeech 2022 2021-08-27

MolTC: Towards Molecular Relational Modeling In Language Models

OPENALEX - Publications

Junfeng Fang Shuai Zhang Wu Chang Zhiyuan Liu Sihang Li and 4 more

Molecular Relational Learning (MRL), aiming to understand interactions between molecular pairs, plays a pivotal role in advancing biochemical research. Recently, the adoption of large language models (LLMs), known for their vast knowledge repositories and advanced logical inference capabilities, has emerged as promising way efficient effective MRL. Despite potential, these methods predominantly rely on textual data, thus not fully harnessing wealth structural information inherent graphs....

10.48550/arxiv.2402.03781 preprint EN arXiv (Cornell University) 2024-02-06

Refining Node Similarity Analysis: An Optimized Nearest-Neighbor Ranking Algorithm

OPENALEX - Publications

Jiaqi Sui Wei Guo Xin Shi Shuai Zhang Hongchao Hu

As The Onion Router (Tor) becomes increasingly prevalent, attackers have initiated Sybil attacks by controlling a plethora of malicious relay nodes, severely compromising user privacy. Thus, the identification nodes is crucial for ensuring security Tor network. Previous studies suggested that tend to similar configuration and close uptime, leading design Nearest-neighbor ranking algorithm analyzing similarity between nodes. This was followed further manual analysis sift from ones. However,...

10.1109/iccect60629.2024.10545958 article EN 2024-04-26

Optimize the event selection strategy to study the anomalous quartic gauge couplings at muon colliders using the support vector machine and quantum support vector machine

OPENALEX - Publications

Shuai Zhang Yu-Chen Guo Ji-Chong Yang

Abstract The search of the new physics (NP) beyond Standard Model is one most important topics in current high energy physics. With increasing luminosities at colliders, for NP signals requires analysis more and data, efficiency data processing becomes particularly important. As a machine learning algorithm, support vector (SVM) expected to be useful NP. Meanwhile, quantum computing has potential offer huge advantages when dealing with large amounts which suggests that SVM (QSVM) tool future...

10.1140/epjc/s10052-024-13208-4 article EN cc-by The European Physical Journal C 2024-08-20

MTPGraph: A Data-Driven Approach to Predict Medical Risk Based on Temporal Profile Graph

OPENALEX - Publications

Shuai Zhang Lei Liu Hui Li Zongshui Xiao Lizhen Cui

With the rapid development of information technologies, which facilitates perfection healthcare systems, a variety clinical data is becoming available. The patient Electronic Health Records (EHR) one important sources in on conducts personalized medicine. However, it challenging if raw EHRs are directly used to conduct related medical prediction due its heterogeneity, sparsity and existence noise. To address this issue, paper proposes an integrative driven approach called Medical Temporal...

10.1109/trustcom.2016.0191 article EN 2015 IEEE Trustcom/BigDataSE/ISPA 2016-08-01