NFDI4DS | UHH-SEMS - Publication Details

Jie Zhang

ORCID: 0000-0001-6331-4005

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100436613

Research Areas

Topic Modeling
Natural Language Processing Techniques
Multimodal Machine Learning Applications
Advanced Text Analysis Techniques
Text and Document Classification Technologies
Domain Adaptation and Few-Shot Learning
Sentiment Analysis and Opinion Mining
Advanced Graph Neural Networks
Advanced Image and Video Retrieval Techniques
Machine Learning and Data Classification
Recommender Systems and Techniques
Advanced Neural Network Applications
Image and Video Quality Assessment
Computer Graphics and Visualization Techniques
Speech and dialogue systems
Data Stream Mining Techniques
Music and Audio Processing
Anomaly Detection Techniques and Applications
Human Pose and Action Recognition
Generative Adversarial Networks and Image Synthesis
Machine Learning and ELM
Game Theory and Voting Systems
Educational Technology and Pedagogy
Green IT and Sustainability
Reinforcement Learning in Robotics

Nanjing University of Science and Technology
2023-2024

Hunan Software Vocational Institute
2024

Institute of Information Engineering
2024

Nanyang Technological University
2010-2023

University of Chinese Academy of Sciences
2023

Liaoning University
2023

Minzu University of China
2022

Space Engineering University
2020

Zhejiang Financial College
2020

Shanghai Center for Brain Science and Brain-Inspired Technology
2020

Improving IoT Data Quality in Mobile Crowd Sensing: A Cross Validation Approach

OPENALEX - Publications

Tie Luo Jianwei Huang Salil S. Kanhere Jie Zhang Sajal K. Das

Data quality, or sometimes referred to as data credibility, is a critical issue in mobile crowd sensing (MCS) and more generally Internet of Things (IoT). While candidate solutions, such incentive mechanisms mining have been well explored the literature, power crowds has largely overlooked under-exploited. In this paper, we propose cross validation approach which seeks validating ratify contributing terms sensor contributed by latter, uses result reshape into credible posterior belief ground...

10.1109/jiot.2019.2904704 article EN publisher-specific-oa IEEE Internet of Things Journal 2019-03-13

Multi-Decoder Attention Model with Embedding Glimpse for Solving Vehicle Routing Problems

OPENALEX - Publications

Liang Xin Wen Song Zhiguang Cao Jie Zhang

We present a novel deep reinforcement learning method to learn construction heuristics for vehicle routing problems. In specific, we propose Multi-Decoder Attention Model (MDAM) train multiple diverse policies, which effectively increases the chance of finding good solutions compared with existing methods that only one policy. A customized beam search strategy is designed fully exploit diversity MDAM. addition, an Embedding Glimpse layer in MDAM based on recursive nature construction, can...

10.1609/aaai.v35i13.17430 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Effective adaptation of a Hidden Markov Model-based named entity recognizer for biomedical domain

OPENALEX - Publications

Dan Shen Jie Zhang Guodong Zhou Jian Su Chew-Lim Tan

In this paper, we explore how to adapt a general Hidden Markov Model-based named entity recognizer effectively biomedical domain. We integrate various features, including simple deterministic morphological POS features and semantic trigger capture evidences especially for evaluate their contributions. also present algorithm solve the abbreviation problem rule-based method deal with cascaded phenomena in Our experiments on GENIA V3.0 V1.1 achieve 66.1 62.5 F-measure respectively, which...

10.3115/1118958.1118965 article EN 2003-01-01

M6: A Chinese Multimodal Pretrainer

OPENALEX - Publications

Junyang Lin Rui Men Yang An Chang Zhou Ming Ding and 20 more

In this work, we construct the largest dataset for multimodal pretraining in Chinese, which consists of over 1.9TB images and 292GB texts that cover a wide range domains. We propose cross-modal method called M6, referring to Multi-Modality Multitask Mega-transformer, unified on data single modality multiple modalities. scale model size up 10 billion 100 parameters, build pretrained Chinese. apply series downstream applications, demonstrate its outstanding performance comparison with strong...

10.48550/arxiv.2103.00823 preprint EN cc-by arXiv (Cornell University) 2021-01-01

EasyAug: An Automatic Textual Data Augmentation Platform for Classification Tasks

OPENALEX - Publications

Siyuan Qiu Binxia Xu Jie Zhang Yafang Wang Xiaoyu Shen and 3 more

Imbalanced data is a perennial problem that impedes the learning abilities of current machine learning-based classification models. One approach to address it leverage augmentation expand training set. For image data, there are number suitable techniques have proven effective in previous work. textual however, due discrete units inherent natural language, randomly perturb signal may be ineffective. Additionally, substantial discrepancy between different datasets (e.g., domains), an...

10.1145/3366424.3383552 article EN Companion Proceedings of the The Web Conference 2018 2020-04-20

Active Large Language Model-Based Knowledge Distillation for Session-Based Recommendation

OPENALEX - Publications

Yingpeng Du Zhu Sun Ziyan Wang Haoyan Chua Jie Zhang and 1 more

Large language models (LLMs) provide a promising way for accurate session-based recommendation (SBR), but they demand substantial computational time and memory. Knowledge distillation (KD)-based methods can alleviate these issues by transferring the knowledge to small student, which trains student based on predictions of cumbersome teacher. However, encounter difficulties LLM-based KD in SBR. 1) It is expensive make LLMs predict all instances KD. 2) may ineffective some KD, e.g., incorrect...

10.1609/aaai.v39i11.33263 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

Which Channel to Ask My Question?: Personalized Customer Service Request Stream Routing Using Deep Reinforcement Learning

OPENALEX - Publications

Zining Liu Chong Long Xiaolu Lu Zehong Hu Jie Zhang and 1 more

Customer services are critical to all companies, as they may directly connect the brand reputation. Due a great number of customers, e-commerce companies often employ multiple communication channels answer customers' questions, for example, Chatbot and Hotline. On one hand, each channel has limited capacity respond requests; on other customers have different preferences over these channels. The current production systems mainly built based business rules that merely consider tradeoffs...

10.1109/access.2019.2932047 article EN cc-by IEEE Access 2019-01-01

AATEAM: Achieving the Ad Hoc Teamwork by Employing the Attention Mechanism

OPENALEX - Publications

Shuo Chen Ewa Andrejczuk Zhiguang Cao Jie Zhang

In the ad hoc teamwork setting, a team of agents needs to perform task without prior coordination. The most advanced approach learns policies based on previous experiences and reuses one interact with new teammates. However, selected policy in many cases is sub-optimal. Switching between adapt teammates' behaviour takes time, which threatens successful performance task. this paper, we propose AATEAM – method that uses attention-based neural networks cope real-time. We train attention network...

10.1609/aaai.v34i05.6196 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Diversify Question Generation with Continuous Content Selectors and Question Type Modeling

OPENALEX - Publications

Zhen Wang Siwei Rao Jie Zhang Zhen Qin Guangjian Tian and 1 more

Generating questions based on answers and relevant contexts is a challenging task. Recent work mainly pays attention to the quality of single generated question. However, question generation actually one-to-many problem, as it possible raise with different focuses various means expression. In this paper, we explore diversity come up methods from these two aspects. Specifically, relate contextual content selectors, which are modeled by continuous latent variable technique conditional...

10.18653/v1/2020.findings-emnlp.194 article EN cc-by 2020-01-01

M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining

OPENALEX - Publications

Junyang Lin Yang An Jinze Bai Chang Zhou Le Jiang and 7 more

Recent expeditious developments in deep learning algorithms, distributed training, and even hardware design for large models have enabled training extreme-scale models, say GPT-3 Switch Transformer possessing hundreds of billions or trillions parameters. However, under limited resources, model that requires enormous amounts computes memory footprint suffers from frustratingly low efficiency convergence. In this paper, we propose a simple strategy called "Pseudo-to-Real"...

10.48550/arxiv.2110.03888 preprint EN cc-by-sa arXiv (Cornell University) 2021-01-01

Large Language Model with Graph Convolution for Recommendation

OPENALEX - Publications

Yingpeng Du Ziyan Wang Zhu Sun Haoyan Chua Hongzhi Liu and 4 more

In recent years, efforts have been made to use text information for better user profiling and item characterization in recommendations. However, can sometimes be of low quality, hindering its effectiveness real-world applications. With knowledge reasoning capabilities capsuled Large Language Models (LLMs), utilizing LLMs emerges as a promising way description improvement. existing ways prompting with raw texts ignore structured user-item interactions, which may lead hallucination problems...

10.48550/arxiv.2402.08859 preprint EN arXiv (Cornell University) 2024-02-13

Reinforcement Learning Over Knowledge Graphs for Explainable Dialogue Intent Mining

OPENALEX - Publications

Kai Yang Xinyu Kong Yafang Wang Jie Zhang Gerard de Melo

In light of the millions households that have adopted intelligent assistant powered devices, multi-turn dialogue has become an important field inquiry. Most current methods identify underlying intent in using opaque classification techniques fail to provide any interpretable basis for classification. To address this, we propose a scheme interpret based on specific characteristics text. We rely policy-guided reinforcement learning paths graph confirm concrete inference serve as explanations....

10.1109/access.2020.2991257 article EN cc-by IEEE Access 2020-01-01

A Skip-Connected Evolving Recurrent Neural Network for Data Stream Classification under Label Latency Scenario

OPENALEX - Publications

Monidipa Das Mahardhika Pratama Jie Zhang Yew-Soon Ong

Stream classification models for non-stationary environments often assume the immediate availability of data labels. However, in a practical scenario, it is quite natural that labels are available only after some temporal lag. This paper explores how stream classifier model can be made adaptive to such label latency scenario. We propose SkipE-RNN, self-evolutionary recurrent neural network with dynamically evolving skipped-recurrent-connection best utilization previously observed information...

10.1609/aaai.v34i04.5781 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Data Augmentation for Multiclass Utterance Classification – A Systematic Study

OPENALEX - Publications

Binxia Xu Siyuan Qiu Jie Zhang Yafang Wang Xiaoyu Shen and 1 more

Utterance classification is a key component in many conversational systems. However, classifying real-world user utterances challenging, as people may express their ideas and thoughts manifold ways, the amount of training data for some categories be fairly limited, resulting imbalanced distributions. To alleviate these issues, we conduct comprehensive survey regarding augmentation approaches text classification, including simple random resampling, word-level transformations, neural...

10.18653/v1/2020.coling-main.479 article EN cc-by Proceedings of the 17th international conference on Computational linguistics - 2020-01-01

Prototype Feature Extraction for Multi-task Learning

OPENALEX - Publications

Xin Shen Yuhang Jiao Cheng Long Yu Guang Wang Xiaowei Wang and 3 more

Multi-task learning (MTL) has been widely utilized in various industrial scenarios, such as recommender systems and search engines. MTL can improve efficiency prediction accuracy by exploiting commonalities differences across tasks. However, is sensitive to relationships among tasks may have performance degradation real-world applications, because existing neural-based models often share the same network structures original input features. To address this issue, we propose a novel multi-task...

10.1145/3485447.3512119 article EN Proceedings of the ACM Web Conference 2022 2022-04-25

Real-Fake: Effective Training Data Synthesis Through Distribution Matching

OPENALEX - Publications

Jianhao Yuan Jie Zhang Shuyang Sun Philip Torr Bo Zhao

Synthetic training data has gained prominence in numerous learning tasks and scenarios, offering advantages such as dataset augmentation, generalization evaluation, privacy preservation. Despite these benefits, the efficiency of synthetic generated by current methodologies remains inferior when advanced deep models exclusively, limiting its practical utility. To address this challenge, we analyze principles underlying synthesis for supervised elucidate a principled theoretical framework from...

10.48550/arxiv.2310.10402 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Improving short-term active power prediction through optimization of the categorical boosting model with meta-heuristic algorithms

OPENALEX - Publications

Wei Yan Jie Zhang

10.1007/s00202-024-02921-8 article EN Electrical Engineering 2024-12-30

Detecting Accounting Frauds in Publicly Traded U.S. Firms: New Perspective and New Method

OPENALEX - Publications

Yang Bao Bin Ke Bin Li Y. Julia Yu Jie Zhang

We develop a state-of-the-art fraud prediction model using machine learning approach. demonstrate the value of combining domain knowledge and method in building. select our input based on existing accounting theories, but we differ from prior research by raw numbers rather than financial ratios. employ one most powerful methods, ensemble learning, commonly used logistic regression. To assess performance models, introduce new evaluation metric ranking problems that is more appropriate for...

10.2139/ssrn.2670703 article EN SSRN Electronic Journal 2015-01-01

Generative Visual Dialogue System via Weighted Likelihood Estimation

OPENALEX - Publications

Heming Zhang Shalini Ghosh Larry Heck Stephen J. Walsh Junting Zhang and 2 more

The key challenge of generative Visual Dialogue (VD) systems is to respond human queries with informative answers in natural and contiguous conversation flow. Traditional Maximum Likelihood Estimation-based methods only learn from positive responses but ignore the negative responses, consequently tend yield safe or generic responses. To address this issue, we propose a novel training scheme conjunction weighted likelihood estimation method. Furthermore, an adaptive multi-modal reasoning...

10.24963/ijcai.2019/144 article EN 2019-07-28

OST: a heuristic-based orthogonal partitioning algorithm for dynamic hierarchical data visualization

OPENALEX - Publications

Yanchao Wang Yidan Xing Feng Lin Hock Soon Seah Jie Zhang

10.1007/s12650-022-00830-1 article EN Journal of Visualization 2022-02-28

DEEDP: Document-Level Event Extraction Model Incorporating Dependency Paths

OPENALEX - Publications

Hui Li Xin Zhao Lin Yu Yixin Zhao Jie Zhang

Document-level event extraction (DEE) aims at extracting records from given documents. Existing DEE methods handle troublesome challenges by using multiple encoders and casting the task into a multi-step paradigm. However, most of previous approaches ignore missing feature mean pooling or max operations in different encoding stages have not explicitly modeled interdependency features between input tokens, thus long-distance problem cannot be solved effectively. In this study, we propose...

10.3390/app13052846 article EN cc-by Applied Sciences 2023-02-22

Coming Soon ...