NFDI4DS | UHH-SEMS - Publication Details

Latent Relational Metric Learning via Memory-based Attention for Collaborative Ranking

OPENALEX - Publications

Yi Tay Luu Anh Tuan Siu Cheung Hui

This paper proposes a new neural architecture for collaborative ranking with implicit feedback. Our model, LRML (\textit{Latent Relational Metric Learning}) is novel metric learning approach recommendation. More specifically, instead of simple push-pull mechanisms between user and item pairs, we propose to learn latent relations that describe each interaction. helps alleviate the potential geometric inflexibility existing learing approaches. enables not only better performance but also...

10.1145/3178876.3186154 preprint EN 2018-01-01

Learning to Attend via Word-Aspect Associative Fusion for Aspect-Based Sentiment Analysis

OPENALEX - Publications

Yi Tay Luu Anh Tuan Siu Cheung Hui

Aspect-based sentiment analysis (ABSA) tries to predict the polarity of a given document with respect aspect entity. While neural network architectures have been successful in predicting overall sentences, aspect-specific still remains as an open problem. In this paper, we propose novel method for integrating information into model. More specifically, incorporate model by modeling word-aspect relationships. Our model, Aspect Fusion LSTM (AF-LSTM) learns attend based on associative...

10.1609/aaai.v32i1.12049 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-26

Learning to Rank Question Answer Pairs with Holographic Dual LSTM Architecture

OPENALEX - Publications

Yi Tay Minh C. Phan Luu Anh Tuan Siu Cheung Hui

We describe a new deep learning architecture for to rank question answer pairs. Our approach extends the long short-term memory (LSTM) network with holographic composition model relationship between and representations. As opposed neural tensor layer that has been adopted recently, provides benefits of scalable rich representational without incurring huge parameter costs. Overall, we present Holographic Dual LSTM (HD-LSTM), unified both sentence modeling semantic matching. Essentially, our...

10.1145/3077136.3080790 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2017-07-28

Dyadic Memory Networks for Aspect-based Sentiment Analysis

OPENALEX - Publications

Yi Tay Luu Anh Tuan Siu Cheung Hui

This paper proposes Dyadic Memory Networks (DyMemNN), a novel extension of end-to-end memory networks (memNN) for aspect-based sentiment analysis (ABSA). Originally designed question answering tasks, memNN operates via selection operation in which relevant pieces are adaptively selected based on the input query. In problem ABSA, this is analogous to aspects and documents relationship between each word document compared with aspect vector. standard networks, simple dot products or feed...

10.1145/3132847.3132936 article EN 2017-11-06

SkipFlow: Incorporating Neural Coherence Features for End-to-End Automatic Text Scoring

OPENALEX - Publications

Yi Tay Minh C. Phan Luu Anh Tuan Siu Cheung Hui

Deep learning has demonstrated tremendous potential for Automatic Text Scoring (ATS) tasks. In this paper, we describe a new neural architecture that enhances vanilla network models with auxiliary coherence features. Our method proposes SkipFlow mechanism relationships between snapshots of the hidden representations long short-term memory (LSTM) as it reads. Subsequently, semantic multiple are used features prediction. This two main benefits. Firstly, essays typically sequences and therefore...

10.1609/aaai.v32i1.12045 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-26

Hyperbolic Representation Learning for Fast and Efficient Neural Question Answering

OPENALEX - Publications

Yi Tay Luu Anh Tuan Siu Cheung Hui

The dominant neural architectures in question answer retrieval are based on recurrent or convolutional encoders configured with complex word matching layers. Given that recent architectural innovations mostly new interaction layers attention-based mechanisms, it seems to be a well-established fact these components mandatory for good performance. Unfortunately, the memory and computation cost incurred by mechanisms undesirable practical applications. As such, this paper tackles of whether is...

10.1145/3159652.3159664 preprint EN 2018-02-02

Capturing Greater Context for Question Generation

OPENALEX - Publications

Luu Anh Tuan Darsh Shah Regina Barzilay

Automatic question generation can benefit many applications ranging from dialogue systems to reading comprehension. While questions are often asked with respect long documents, there challenges modeling such documents. Many existing techniques generate by effectively looking at one sentence a time, leading that easy and not reflective of the human process generation. Our goal is incorporate interactions across multiple sentences realistic for In order link broad document context target...

10.1609/aaai.v34i05.6440 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Learning facial expression and body gesture visual information for video emotion recognition

OPENALEX - Publications

Wei Jie Guanyu Hu Xinyu Yang Luu Anh Tuan Yizhuo Dong

10.1016/j.eswa.2023.121419 article EN Expert Systems with Applications 2023-09-09

Multi-Cast Attention Networks

OPENALEX - Publications

Yi Tay Luu Anh Tuan Siu Cheung Hui

Attention is typically used to select informative sub-phrases that are for prediction. This paper investigates the novel use of attention as a form feature augmentation, i.e, casted attention. We propose Multi-Cast Networks (MCAN), new mechanism and general model architecture potpourri ranking tasks in conversational modeling question answering domains. Our approach performs series soft operations, each time casting scalar upon inner word embeddings. The key idea provide real-valued hint...

10.1145/3219819.3220048 article EN 2018-07-19

Exploring Clean Label Backdoor Attacks and Defense in Language Models

OPENALEX - Publications

Shuai Zhao Luu Anh Tuan Jie Fu Jinming Wen Weiqi Luo

10.1109/taslp.2024.3407571 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2024-01-01

Using Punctuation as an Adversarial Attack on Deep Learning-Based NLP Systems: An Empirical Study

OPENALEX - Publications

Brian Formento Chuan-Sheng Foo Luu Anh Tuan See Kiong Ng

This work empirically investigates punctuation insertions as adversarial attacks on NLP systems. Data from experiments three tasks, five datasets, and six models with four show that insertions, when limited to a few symbols (apostrophes hyphens), are superior attack vector compared character due 1) lower after-attack accuracy (Aaft-atk) than alphabetical insertions; 2) higher semantic similarity between the resulting original texts; 3) text is easier faster read assessed Test of Word Reading...

10.18653/v1/2023.findings-eacl.1 article EN cc-by 2023-01-01

KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search

OPENALEX - Publications

Haoran Luo E Haihong Yikai Guo Qika Lin Xiaobao Wu and 5 more

Knowledge Base Question Answering (KBQA) aims to answer natural language questions with a large-scale structured knowledge base (KB). Despite advancements large models (LLMs), KBQA still faces challenges in weak KB awareness, imbalance between effectiveness and efficiency, high reliance on annotated data. To address these challenges, we propose KBQA-o1, novel agentic method Monte Carlo Tree Search (MCTS). It introduces ReAct-based agent process for stepwise logical form generation...

10.48550/arxiv.2501.18922 preprint EN arXiv (Cornell University) 2025-01-31

Multi-Task Neural Network for Non-discrete Attribute Prediction in Knowledge Graphs

OPENALEX - Publications

Yi Tay Luu Anh Tuan Minh C. Phan Siu Cheung Hui

Many popular knowledge graphs such as Freebase, YAGO or DBPedia maintain a list of non-discrete attributes for each entity. Intuitively, these height, price population count are able to richly characterize entities in graphs. This additional source information may help alleviate the inherent sparsity and incompleteness problem that prevalent Unfortunately, many state-of-the-art relational learning models ignore this due challenging nature dealing with data types inherently binary-natured In...

10.1145/3132847.3132937 article EN 2017-11-06

Cross Temporal Recurrent Networks for Ranking Question Answer Pairs

OPENALEX - Publications

Yi Tay Luu Anh Tuan Siu Cheung Hui

Temporal gates play a significant role in modern recurrent-based neural encoders, enabling fine-grained control over recursive compositional operations time. In recurrent models such as the long short-term memory (LSTM), temporal amount of information retained or discarded time, not only playing an important influencing learned representations but also serving protection against vanishing gradients. This paper explores idea learning for sequence pairs (question and answer), jointly pairwise...

10.1609/aaai.v32i1.11973 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-27

Multi-range Reasoning for Machine Comprehension

OPENALEX - Publications

Yi Tay Luu Anh Tuan Siu Cheung Hui

We propose MRU (Multi-Range Reasoning Units), a new fast compositional encoder for machine comprehension (MC). Our proposed encoders are characterized by multi-ranged gating, executing series of parameterized contract-and-expand layers learning gating vectors that benefit from long and short-term dependencies. The aims our approach as follows: (1) representations concurrently aware context, (2) modeling relationships between intra-document blocks (3) efficient sequence encoding. show...

10.48550/arxiv.1803.09074 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Densely Connected Attention Propagation for Reading Comprehension

OPENALEX - Publications

Yi Tay Luu Anh Tuan Siu Cheung Hui Jian Su

We propose DecaProp (Densely Connected Attention Propagation), a new densely connected neural architecture for reading comprehension (RC). There are two distinct characteristics of our model. Firstly, model connects all pairwise layers the network, modeling relationships between passage and query across hierarchical levels. Secondly, dense connectors in network learned via attention instead standard residual skip-connectors. To this end, we novel Bidirectional Connectors (BAC) efficiently...

10.48550/arxiv.1811.04210 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Towards Interpretable Federated Learning

OPENALEX - Publications

Anran Li Rui Liu Ming Hu Luu Anh Tuan Han Yu

Federated learning (FL) enables multiple data owners to build machine models collaboratively without exposing their private local data. In order for FL achieve widespread adoption, it is important balance the need performance, privacy-preservation and interpretability, especially in mission critical applications such as finance healthcare. Thus, interpretable federated (IFL) has become an emerging topic of research attracting significant interest from academia industry alike. Its...

10.48550/arxiv.2302.13473 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Unlocking the Potential of User Feedback: Leveraging Large Language Model as User Simulators to Enhance Dialogue System

OPENALEX - Publications

Zhiyuan Hu Yue Feng Luu Anh Tuan Bryan Hooi Aldo Lipani

Dialogue systems and large language models (LLMs) have gained considerable attention. However, the direct utilization of LLMs as task-oriented dialogue (TOD) has been found to underperform compared smaller task-specific models. Nonetheless, it is crucial acknowledge significant potential explore improved approaches for leveraging their impressive abilities. Motivated by goal LLMs, we propose an alternative approach called User-Guided Response Optimization (UGRO) combine with a TOD model....

10.1145/3583780.3615220 article EN 2023-10-21

Multi-Cast Attention Networks for Retrieval-based Question Answering and Response Prediction

OPENALEX - Publications

Yi Tay Luu Anh Tuan Siu Cheung Hui

Attention is typically used to select informative sub-phrases that are for prediction. This paper investigates the novel use of attention as a form feature augmentation, i.e, casted attention. We propose Multi-Cast Networks (MCAN), new mechanism and general model architecture potpourri ranking tasks in conversational modeling question answering domains. Our approach performs series soft operations, each time casting scalar upon inner word embeddings. The key idea provide real-valued hint...

10.48550/arxiv.1806.00778 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations

OPENALEX - Publications

Wei Jie Guanyu Hu Luu Anh Tuan Xinyu Yang Wenjing Zhu

Emotion recognition in conversations (ERC) has gained more attention, where contextual information modeling and multimodal fusion have been the focus challenges recent years. In this paper, we proposed a Multi-Scale Receptive Field Graph model (MSRFG) to tackle of ERC. Specifically, MSRFG constructs multi-scale perception graphs learns via parallel receptive field paths. To compensate for deficiency temporal learning by graph network, injects dependencies into network relationships between...

10.1109/icassp49357.2023.10094596 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions

OPENALEX - Publications

Phuoc Van Long Pham Anh Vu-Duc Nhat M. Hoang Xuan Long Luu Anh Tuan

Mathematical questioning is crucial for assessing students' problem-solving skills. Since manually creating such questions requires substantial effort, automatic methods have been explored. Existing state-of-the-art models rely on fine-tuning strategies and struggle to generate that heavily involve multiple steps of logical arithmetic reasoning. Meanwhile, large language (LLMs) as ChatGPT excelled in many NLP tasks involving Nonetheless, their applications generating educational are...

10.1145/3605098.3636030 article EN cc-by Proceedings of the 37th ACM/SIGAPP Symposium on Applied Computing 2024-04-08

A Survey of Backdoor Attacks and Defenses on Large Language Models: Implications for Security Measures

OPENALEX - Publications

Shuai Zhao Meihuizi Jia Zhongliang Guo Leilei Gan Xiaoyu Xu and 5 more

Large Language Models (LLMs), which bridge the gap between human language understanding and complex problem-solving, achieve state-of-the-art performance on several NLP tasks, particularly in few-shot zero-shot settings. Despite demonstrable efficacy of LLMs, due to constraints computational resources, users have engage with open-source models or outsource entire training process third-party platforms. However, research has demonstrated that are susceptible potential security...

10.36227/techrxiv.172832726.62863760/v1 preprint EN 2024-10-07

CoupleNet: Paying Attention to Couples with Coupled Attention for Relationship Recommendation

OPENALEX - Publications

Yi Tay Luu Anh Tuan Siu Cheung Hui

Dating and romantic relationships not only play a huge role in our personal lives but also collectively influence shape society. Today, many partnerships originate from the Internet, signifying importance of technology web modern dating. In this paper, we present text-based computational approach for estimating relationship compatibility two users on social media. Unlike previous works that propose reciprocal recommender systems online dating websites, devise distant supervision heuristic to...

10.1609/icwsm.v12i1.15007 article EN Proceedings of the International AAAI Conference on Web and Social Media 2018-06-15

Personalized question recommendation for English grammar learning

OPENALEX - Publications

Lanting Fang Luu Anh Tuan Siu Cheung Hui Lenan Wu

Abstract Learning English grammar is a very challenging task for many students especially nonnative speakers. To learn well, it important to understand the concepts of with lots practise on exercise questions. Previous recommendation systems learning mainly focused recommending reading materials and vocabulary. Different from material vocabulary recommendations, question should recommend questions that have similar grammatical structure usage interest. The content similarity calculation...

10.1111/exsy.12244 article EN Expert Systems 2017-11-02

ORKG

DBLP

CEUR

MyBinder

Luu Anh Tuan