Xinhui Tu

ORCID: 0009-0000-6570-009X
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • Web Data Mining and Analysis
  • Information Retrieval and Search Behavior
  • Natural Language Processing Techniques
  • Text and Document Classification Technologies
  • Advanced Text Analysis Techniques
  • Semantic Web and Ontologies
  • Sentiment Analysis and Opinion Mining
  • Data Quality and Management
  • Speech and dialogue systems
  • Computational Drug Discovery Methods
  • Biomedical Text Mining and Ontologies
  • Welding Techniques and Residual Stresses
  • Machine Learning in Healthcare
  • Advanced Image and Video Retrieval Techniques
  • Rough Sets and Fuzzy Logic
  • Machine Learning and Algorithms
  • Expert finding and Q&A systems
  • Complex Network Analysis Techniques
  • Wikis in Education and Collaboration
  • Service-Oriented Architecture and Web Services
  • Advanced Graph Neural Networks
  • Multimodal Machine Learning Applications
  • Advanced Welding Techniques Analysis
  • Artificial Intelligence in Healthcare

Central China Normal University
2010-2024

Nanchang Hangkong University
2022-2023

York University
2020

Wuhan University of Science and Technology
2008-2015

University of Arizona
2010

Center for Information Technology
2010

Spoken language understanding (SLU) is an essential part of a task-oriented dialogue system, which mainly includes intent detection and slot filling. Some existing approaches obtain enhanced semantic representation by establishing the correlation between two tasks. However, those methods show little improvement when applied to BERT, since BERT has learned rich features. In this paper, we propose BERT-based model with probability-aware gate mechanism, called PAGM ( <underline...

10.1109/taslp.2023.3237156 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2023-01-01

Multimodal aspect-based sentiment classification (MABSC) aims to identify the polarity toward specific aspects in multimodal data. It has gained significant attention with increasing use of social media platforms. Existing approaches primarily focus on analyzing content posts predict sentiment. However, they often struggle limited contextual information inherent posts, hindering accurate detection. To overcome this issue, we propose a novel dual cause analysis (MDCA) method track underlying...

10.1109/tnnls.2024.3415028 article EN IEEE Transactions on Neural Networks and Learning Systems 2024-01-01

This paper proposes a novel topic model, Author-Conference Topic-Connection (ACTC) Model for academic network search. The ACTC extends the author-conference-topic (ACT) model by adding subject of conference and latent mapping information between subjects topics. It simultaneously models topical aspects papers, authors conferences with two layers: layer corresponding to topic, word topic. Each author would be associated multinomial distribution over (eg., KM, DB, IR CIKM 2012),...

10.1145/2396761.2398597 article EN 2012-10-29

Recently, many Wikipedia-based methods have been proposed to improve the performance of different natural language processing (NLP) tasks, such as semantic relatedness computation, text classification and information retrieval. Among these methods, salient analysis (SSA) has proven be an effective way generate conceptual representation for words or documents. However, its feasibility effectiveness in retrieval is mostly unknown. In this paper, we study how efficiently use SSA performance,...

10.1080/17517575.2015.1080301 article EN Enterprise Information Systems 2015-08-28

Pseudo‐relevance feedback is a well‐studied query expansion technique in which it assumed that the top‐ranked documents an initial set of retrieval results are relevant and terms then extracted from those documents. When selecting terms, most traditional models do not simultaneously consider term frequency co‐occurrence relationships between candidate terms. Intuitively, however, has higher with more likely to be related topic. In this article, we propose kernel co‐occurrence‐based framework...

10.1002/asi.24241 article EN Journal of the Association for Information Science and Technology 2019-05-13

At present, spoken language understanding (SLU) in multi-turn dialogue is a research hotspot, which mainly includes intent detection and slot filling. SLU models trained by large-scale corpus can learn good superficial semantic grammatical information. But they lack the ability for modeling knowledge needed to understand language. In order further deep information of dialogue, external needs be modeled incorporated into model. addition, utilizing correlation between history current utterance...

10.1109/bigdata47090.2019.9006162 article EN 2021 IEEE International Conference on Big Data (Big Data) 2019-12-01

This paper presents a tag-topic model with Dirichlet Forest prior (TTM-DF) for semantic knowledge acquisition from blog. The TTM-DF extends the (TTM) by replacing over topic-word multinomial. correlation between words are calculated to generate set of Must-Links and Cannot-Links, then structures trees obtained though encoding constraints Cannot-Links. Words under same subtrees expected be more correlated than different subtrees. We conduct experiments on synthetic blog dataset. Both...

10.1145/2396761.2398485 article EN 2012-10-29

Most of the existing information retrieval models assume that terms a text document are independent each other. These integrate three major variables to determine degree importance term for document: within frequency, length and specificity in collection. Intuitively, is not only dependent on aspects mentioned above, but also semantic coherence between document. In this paper, we propose heuristic approach, which query with adopted improve performance. Experimental results standard TREC...

10.1145/2911451.2914691 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2016-07-07

Dialogue state tracking (DST) is a core component of task-oriented dialogue systems. Recent works focus mainly on end-to-end DST models that omit the spoken language understanding (SLU) module to directly obtain based user's dialogue. However, slot information detected by filling in SLU closely tied slot-value pair needs be updated DST. Efficient use key semantic knowledge obtained contributes improving performance Based this idea, we introduce as subtask and build an joint model explicitly...

10.1109/tnnls.2022.3183081 article EN IEEE Transactions on Neural Networks and Learning Systems 2022-06-22

Recent years have witnessed a substantial increase in the use of deep learning to solve various natural language processing (NLP) problems. Early models were constrained by their sequential or unidirectional nature, such that they struggled capture contextual relationships across text inputs. The introduction bidirectional encoder representations from transformers (BERT) leads robust for transformer model can understand broader context and deliver state-of-the-art performance NLP tasks. This...

10.48550/arxiv.2403.00784 preprint EN arXiv (Cornell University) 2024-02-18

The scaled Web 3.0 digital economy, represented by decentralized finance (DeFi), has sparked increasing interest in the past few years, which usually relies on blockchain for token transfer and diverse transaction logic. However, illegal behaviors, such as financial fraud, hacker attacks, money laundering, are rampant ecosystem seriously threaten its integrity security. In this paper, we propose a novel double graph-based Ethereum account de-anonymization inference method, dubbed DBG4ETH,...

10.48550/arxiv.2411.18875 preprint EN arXiv (Cornell University) 2024-11-27

10.1109/bibm62325.2024.10822564 article EN 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2024-12-03

10.1109/bibm62325.2024.10821906 article EN 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2024-12-03

Knowledge Graph Completion (KGC) has garnered massive research interest recently, and most existing methods are designed following a transductive setting where all entities observed during training. Despite the great progress on KGC, these struggle to conduct reasoning emerging KGs involving unseen entities. Thus, inductive which aims deduce missing links among entities, become new trend. Many studies transform KGC as graph classification problem by extracting enclosing subgraphs surrounding...

10.48550/arxiv.2404.15807 preprint EN arXiv (Cornell University) 2024-04-24
Coming Soon ...