Xian-Ling Mao

ORCID: 0000-0001-6795-2311
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • Natural Language Processing Techniques
  • Multimodal Machine Learning Applications
  • Speech and dialogue systems
  • Advanced Image and Video Retrieval Techniques
  • Advanced Graph Neural Networks
  • Advanced Text Analysis Techniques
  • Text and Document Classification Technologies
  • Recommender Systems and Techniques
  • Video Surveillance and Tracking Methods
  • Image Retrieval and Classification Techniques
  • Sentiment Analysis and Opinion Mining
  • Domain Adaptation and Few-Shot Learning
  • Biomedical Text Mining and Ontologies
  • Web Data Mining and Analysis
  • Data Quality and Management
  • Expert finding and Q&A systems
  • Video Analysis and Summarization
  • Advanced Bandit Algorithms Research
  • Information Retrieval and Search Behavior
  • Text Readability and Simplification
  • Speech Recognition and Synthesis
  • Algorithms and Data Compression
  • Complex Network Analysis Techniques
  • Advanced Neural Network Applications

Beijing Institute of Technology
2015-2024

Beijing Computing Center
2021-2023

Microsoft (Finland)
2021-2022

Microsoft Research (India)
2021

Beijing Haidian Hospital
2020

University of Science and Technology of China
2020

Zhejiang Lab
2020

Microsoft Research (United Kingdom)
2019

China Waterborne Transport Research Institute
2017

Peking University
2010-2012

Zewen Chi, Li Dong, Furu Wei, Nan Yang, Saksham Singhal, Wenhui Wang, Xia Song, Xian-Ling Mao, Heyan Huang, Ming Zhou. Proceedings of the 2021 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2021.

10.18653/v1/2021.naacl-main.280 article EN cc-by Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2021-01-01

In this work we focus on transferring supervision signals of natural language generation (NLG) tasks between multiple languages. We propose to pretrain the encoder and decoder a sequence-to-sequence model under both monolingual cross-lingual settings. The pre-training objective encourages represent different languages in shared space, so that can conduct zero-shot transfer. After procedure, use data fine-tune pre-trained downstream NLG tasks. Then trained single be directly evaluated beyond...

10.1609/aaai.v34i05.6256 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Knowledge graph (KG) plays an increasingly important role in recommender systems. Recently, neural networks (GNNs) based model has gradually become the theme of knowledge-aware recommendation (KGR). However, there is a natural deficiency for GNN-based KGR models, that is, sparse supervised signal problem, which may make their actual performance drop to some extent. Inspired by recent success contrastive learning mining signals from data itself, this paper, we focus on exploring KG-aware and...

10.1145/3477495.3532025 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2022-07-06

Joint entity and relation extraction is an essential task in natural language processing knowledge graph construction. Existing approaches usually decompose the joint into several basic modules or steps to make it easy conduct. However, such a paradigm ignores fact that three elements of triple are interdependent indivisible. Therefore, previous methods suffer from problems cascading errors redundant information. To address these issues, this paper, we propose novel model, named OneRel,...

10.1609/aaai.v36i10.21379 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

Session-based recommendation (SBR) is a challenging task, which aims at recommending items based on anonymous behavior sequences. Almost all the existing solutions for SBR model user preference only current session without exploiting other sessions, may contain both relevant and irrelevant item-transitions to session. This paper proposes novel approach, called Global Context Enhanced Graph Neural Networks (GCE-GNN) exploit item transitions over sessions in more subtle manner better inferring...

10.1145/3397271.3401142 preprint EN 2020-07-25

Bundle recommendation aims to recommend the user a bundle of items as whole. Previous models capture user’s preferences on both and association items. Nevertheless, they usually neglect diversity intents adopting fail disentangle in representations. In real scenario recommendation, intent may be naturally distributed different bundles that (Global view). And contain multiple (Local Each view has its advantages for disentangling: 1) global view, more are involved present each intent, which...

10.1609/aaai.v36i4.20359 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

Chinese Spell Checking (CSC) aims to detect and correct erroneous characters for usergenerated text in language.Most of the spelling errors are misused semantically, phonetically or graphically similar characters.Previous attempts notice this phenomenon try utilize similarity relationship task.However, these methods use either heuristics handcrafted confusion sets predict character.In paper, we propose a spell checker called REALISE, by directly leveraging multimodal information...

10.18653/v1/2021.findings-acl.64 article EN cc-by 2021-01-01

Zewen Chi, Shaohan Huang, Li Dong, Shuming Ma, Bo Zheng, Saksham Singhal, Payal Bajaj, Xia Song, Xian-Ling Mao, Heyan Furu Wei. Proceedings of the 60th Annual Meeting Association for Computational Linguistics (Volume 1: Long Papers). 2022.

10.18653/v1/2022.acl-long.427 article EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2022-01-01

Incorporating Knowledge Graphs (KG) into recommeder system as side information has attracted considerable attention. Recently, the technical trend of Knowledge-aware Recommendation (KGR) is to develop end-to-end models based on graph neural networks (GNNs). However, extremely sparse user-item interactions significantly degrade performance GNN-based models, from following aspects: 1) interaction, itself, means inadequate supervision signals and limits supervised models; 2) combination (CF...

10.1145/3511808.3557358 article EN Proceedings of the 31st ACM International Conference on Information & Knowledge Management 2022-10-16

Aspect-based sentiment analysis (ABSA) is a fine-grained task that aims to align aspects and corresponding sentiments for aspect-specific polarity inference. It challenging because sentence may contain multiple or complicated (e.g., conditional, coordinating, adversative) relations. Recently, exploiting dependency syntax information with graph neural networks has been the most popular trend. Despite its success, methods heavily rely on tree pose challenges in accurately modeling alignment of...

10.18653/v1/2022.findings-acl.144 article EN cc-by Findings of the Association for Computational Linguistics: ACL 2022 2022-01-01

Cross-modal hashing has been widely used in multimedia retrieval tasks due to its fast speed and low storage cost. Recently, many deep unsupervised cross-modal methods have proposed deal the unlabeled datasets. These usually construct an instance similarity matrix by fusing image text modality-specific matrices as guiding information train networks. However, most of them directly use cosine similarities between bag-of-words (BoW) vectors datapoints define matrix, which fails mine semantic...

10.1109/tmm.2023.3243608 article EN IEEE Transactions on Multimedia 2023-01-01

Aspect Sentiment Triplet Extraction (ASTE) has become an emerging task in sentiment analysis research, aiming to extract triplets of the aspect term, its corresponding opinion and associated polarity from a given sentence. Recently, many neural networks based models with different tagging schemes have been proposed, but almost all them their limitations: heavily relying on 1) prior assumption that each word is only single role (e.g., or etc. ) 2) word-level interactions treating...

10.1609/aaai.v37i11.26547 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

In this work, we present an information-theoretic framework that formulates cross-lingual language model pre-training as maximizing mutual information between multilingual-multi-granularity texts. The unified view helps us to better understand the existing methods for learning representations. More importantly, inspired by framework, propose a new task based on contrastive learning. Specifically, regard bilingual sentence pair two views of same meaning and encourage their encoded...

10.48550/arxiv.2007.07834 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Most of the unsupervised hashing methods usually map images into semantic similarity-preserving hash codes by constructing local similarity structure as guiding information, i.e., treating each point similar to its k nearest neighbours. However, for an image, some neighbours may be dissimilar it, they are noisy datapoints which will damage retrieval performance. Thus, tackle this problem, in paper, we propose a novel deep method, called MLS3RDUH, can reduce further enhance Specifically,...

10.24963/ijcai.2020/479 article EN 2020-07-01

The task of table structure recognition aims to recognize the internal a table, which is key step make machines understand tables. Currently, there are lots studies on this for different file formats such as ASCII text and HTML. It also attracts attention structures in PDF files. However, it hard existing methods accurately complicated tables contain spanning cells occupy at least two columns or rows. To address issue, we propose novel graph neural network recognizing files, named GraphTSR....

10.48550/arxiv.1908.04729 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Due to their high retrieval efficiency and low storage cost, cross-modal hashing methods have attracted considerable attention. Generally, compared with shallow methods, deep can achieve a more satisfactory performance by integrating feature learning hash codes optimizing into same framework. However, most existing either cannot learn unified code for the two correlated data-points of different modalities in database instance or guide feedback function procedure, enhance accuracy. To address...

10.1109/tkde.2020.2987312 article EN IEEE Transactions on Knowledge and Data Engineering 2020-04-16

Zewen Chi, Li Dong, Shuming Ma, Shaohan Huang, Saksham Singhal, Xian-Ling Mao, Heyan Xia Song, Furu Wei. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 2021.

10.18653/v1/2021.emnlp-main.125 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021-01-01

Zewen Chi, Li Dong, Bo Zheng, Shaohan Huang, Xian-Ling Mao, Heyan Furu Wei. Proceedings of the 59th Annual Meeting Association for Computational Linguistics and 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021.

10.18653/v1/2021.acl-long.265 article EN cc-by 2021-01-01

Sequential recommendation (SR) aims to predict the subsequent behaviors of users by understanding their successive historical behaviors. Recently, some methods for SR are devoted alleviating data sparsity problem (i.e., limited supervised signals training), which take account contrastive learning incorporate self-supervised into SR. Despite achievements, it is far from enough learn informative user/item embeddings due inadequacy modeling complex collaborative information and co-action...

10.1145/3511808.3557404 article EN Proceedings of the 31st ACM International Conference on Information & Knowledge Management 2022-10-16

Recently, deep supervised hashing methods have shown state-of-the-art performance by integrating feature learning and hash codes into an end-to-end network to generate high-quality codes. However, it is still a challenge learn discriminative for preserving the label information of images efficiently. To overcome this difficulty, in paper, we propose novel Partial-Softmax Loss based Deep Hashing, called PSLDH, Specifically, PSLDH first trains category code each category, will preserve...

10.1145/3442381.3449825 article EN 2021-04-19
Coming Soon ...