NFDI4DS | UHH-SEMS - Publication Details

A Novel Multiresolution Spatiotemporal Saliency Detection Model and Its Applications in Image and Video Compression

OPENALEX - Publications

Chenlei Guo Liming Zhang

Salient areas in natural scenes are generally regarded as which the human eye will typically focus on, and finding these is key step object detection. In computer vision, many models have been proposed to simulate behavior of eyes such SaliencyToolBox (STB), Neuromorphic Vision Toolkit (NVT), others, but they demand high computational cost computing useful results mostly relies on their choice parameters. Although some region-based approaches were reduce complexity feature maps, still not...

10.1109/tip.2009.2030969 article EN IEEE Transactions on Image Processing 2009-08-25

Spatio-temporal Saliency detection using phase spectrum of quaternion fourier transform

OPENALEX - Publications

Chenlei Guo Qi Ma Liming Zhang

Salient areas in natural scenes are generally regarded as the candidates of attention focus human eyes, which is key stage object detection. In computer vision, many models have been proposed to simulate behavior eyes such SaliencyToolBox (STB), neuromorphic vision toolkit (NVT) and etc., but they demand high computational cost their remarkable results mostly rely on choice parameters. Recently a simple fast approach based Fourier transform called spectral residual (SR) was proposed, used SR...

10.1109/cvpr.2008.4587715 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2008-06-01

Knowledge Distillation from Internal Representations

OPENALEX - Publications

Gustavo Aguilar Ling Yuan Yu Zhang Benjamin Yao Xing Fan and 1 more

Knowledge distillation is typically conducted by training a small model (the student) to mimic large and cumbersome teacher). The idea compress the knowledge from teacher using its output probabilities as soft-labels optimize student. However, when considerably large, there no guarantee that internal of will be transferred into student; even if student closely matches soft-labels, representations may different. This mismatch can undermine generalization capabilities originally intended In...

10.1609/aaai.v34i05.6229 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Overcoming Catastrophic Forgetting During Domain Adaptation of Seq2seq Language Generation

OPENALEX - Publications

Dingcheng Li Zheng Chen Eunah Cho Jie Hao Xiaohu Liu and 3 more

Dingcheng Li, Zheng Chen, Eunah Cho, Jie Hao, Xiaohu Liu, Fan Xing, Chenlei Guo, Yang Liu. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2022.

10.18653/v1/2022.naacl-main.398 article EN cc-by Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2022-01-01

A Self-Learning Framework for Large-Scale Conversational AI Systems

OPENALEX - Publications

Xiaohu Liu Chenlei Guo Benjamin Yao Ruhi Sarikaya

10.1109/mci.2024.3363971 article EN IEEE Computational Intelligence Magazine 2024-04-05

Feedback-Based Self-Learning in Large-Scale Conversational AI Agents

OPENALEX - Publications

Pragaash Ponnusamy Alireza Roshan Ghias Chenlei Guo Ruhi Sarikaya

Today, most of the large-scale conversational AI agents such as Alexa, Siri, or Google Assistant are built using manually annotated data to train different components system including Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Entity Resolution (ER). Typically, accuracy machine learning models in these improved by transcribing annotating data. As scope systems increase cover more scenarios domains, manual annotation improve becomes prohibitively costly time...

10.1609/aaai.v34i08.7022 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Automatic identification and delineation of germ layer components in H&E stained images of teratomas derived from human and nonhuman primate embryonic stem cells

OPENALEX - Publications

Ramamurthy Bhagavatula Matthew Fickus W.S. Kelly Chenlei Guo John A. Ozolek and 2 more

We present a methodology for the automatic identification and delineation of germ-layer components in H&E stained images teratomas derived from human nonhuman primate embryonic stem cells. A knowledge understanding biology these cells may lead to advances tissue regeneration repair, treatment genetic developmental syndromes, drug testing discovery. As teratoma is chaotic organization tissues three primary germ layers, often multiple tissues, each having complex unpredictable positions,...

10.1109/isbi.2010.5490168 article EN 2010-04-01

Feedback‐based self‐learning in large‐scale conversational AI agents

OPENALEX - Publications

Pragaash Ponnusamy Alireza Roshan Ghias Yi Yi Benjamin Yao Chenlei Guo and 1 more

Abstract Today, most of the large‐scale conversational AI agents such as Alexa, Siri, or Google Assistant are built using manually annotated data to train different components system including automatic speech recognition (ASR), natural language understanding (NLU), and entity resolution (ER). Typically, accuracy machine learning models in these improved by transcribing annotating data. As scope systems increase cover more scenarios domains, manual annotation improve becomes prohibitively...

10.1609/aaai.12025 article EN cc-by AI Magazine 2021-12-01

Feedback-Based Self-Learning in Large-Scale Conversational AI Agents

OPENALEX - Publications

Pragaash Ponnusamy Alireza Ghias Yi Yi Benjamin Yao Chenlei Guo and 1 more

Today, most of the large-scale conversational AI agents such as Alexa, Siri, or Google Assistant are built using manually annotated data to train different components system including automatic speech recognition (ASR), natural language understanding (NLU), and entity resolution (ER). Typically, accuracy machine learning models in these improved by transcribing annotating data. As scope systems increase cover more scenarios domains, manual annotation improve becomes prohibitively costly time...

10.1609/aimag.v42i4.15102 article EN AI Magazine 2022-01-12

KG-ECO: Knowledge Graph Enhanced Entity Correction For Query Rewriting

OPENALEX - Publications

Jinglun Cai Mingda Li Ziyan Jiang Eunah Cho Zheng Chen and 3 more

Query Rewriting (QR) plays a critical role in large-scale dialogue systems for reducing frictions. When there is an entity error, it imposes extra challenges system to produce satisfactory responses. In this work, we propose KG-ECO: Knowledge Graph enhanced Entity COrrection query rewriting, correction with corrupt span detection and retrieval/re-ranking functionalities.To boost the model performance, incorporate (KG) provide structural information (neighboring entities encoded by graph...

10.1109/icassp49357.2023.10096826 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Personalized Search-based Query Rewrite System for Conversational AI

OPENALEX - Publications

Eunah Cho Ziyan Jiang Jie Hao Zheng Chen Saurabh Gupta and 2 more

Query rewrite (QR) is an emerging component in conversational AI systems, reducing user defect. User defect caused by various reasons, such as errors the spoken dialogue system, users' slips of tongue or their abridged language. Many defects stem from personalized factors, user's speech pattern, dialect, preferences. In this work, we propose a search-based QR framework, which focuses on automatic reduction We build index for each user, encompasses diverse affinity layers to reflect personal...

10.18653/v1/2021.nlp4convai-1.17 article EN cc-by 2021-01-01

CGF: Constrained Generation Framework for Query Rewriting in Conversational AI

OPENALEX - Publications

Jie Hao Yang Liu Xing Fan Saurabh Gupta Saleh Soltan and 4 more

Jie Hao, Yang Liu, Xing Fan, Saurabh Gupta, Saleh Soltan, Rakesh Chada, Pradeep Natarajan, Chenlei Guo, Gokhan Tur. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: Industry Track. 2022.

10.18653/v1/2022.emnlp-industry.48 article EN cc-by 2022-01-01

Real-Time Robust Signal Space Separation for Magnetoencephalography

OPENALEX - Publications

Chenlei Guo Xin Li Samu Taulu Wei Wang Doug Weber

In this paper, we develop a robust signal space separation (rSSS) algorithm for real-time magnetoencephalography (MEG) data processing. rSSS is based on the spatial (SSS) method and it applies regression to automatically detect remove bad MEG channels so that results of SSS are not distorted. We extend existing via three important new contributions: 1) low-rank solver efficiently performs matrix operations; 2) subspace iteration scheme selects using low-order spherical harmonic functions; 3)...

10.1109/tbme.2010.2043358 article EN IEEE Transactions on Biomedical Engineering 2010-02-19

Knowledge Distillation from Internal Representations

OPENALEX - Publications

Gustavo Aguilar Ling Yuan Yu Zhang Benjamin Yao Xing Fan and 1 more

Knowledge distillation is typically conducted by training a small model (the student) to mimic large and cumbersome teacher). The idea compress the knowledge from teacher using its output probabilities as soft-labels optimize student. However, when considerably large, there no guarantee that internal of will be transferred into student; even if student closely matches soft-labels, representations may different. This mismatch can undermine generalization capabilities originally intended In...

10.48550/arxiv.1910.03723 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Graph Enhanced Query Rewriting for Spoken Language Understanding System

OPENALEX - Publications

Siyang Yuan Saurabh Gupta Xing Fan Xiaohu Liu Yang Liu and 1 more

Query rewriting (QR) is an increasingly important component in voice assistant systems to reduce customer friction caused by errors a spoken language understanding pipeline. These originate from various sources such as Automatic Speech Recognition (ASR) and Natural Language Understanding (NLU) modules. In this work, we construct user interaction graph their queries using data mined Markov Chain Model [1], introduce self-supervised pre-training process for learning query embeddings leveraging...

10.1109/icassp39728.2021.9413840 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021-05-13

Contextual Rephrase Detection for Reducing Friction in Dialogue Systems

OPENALEX - Publications

Zhuoyi Wang Saurabh Gupta Jie Hao Xing Fan Dingcheng Li and 2 more

For voice assistants like Alexa, Google Assistant, and Siri, correctly interpreting users’ intentions is of utmost importance. However, users sometimes experience friction with these assistants, caused by errors from different system components or user such as slips the tongue. Users tend to rephrase their queries until they get a satisfactory response. Rephrase detection used identify rephrases has long been treated task pairwise input, which does not fully utilize contextual information...

10.18653/v1/2021.emnlp-main.143 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021-01-01

Improving Contextual Query Rewrite for Conversational AI Agents through User-preference Feedback Learning

OPENALEX - Publications

Zhongkai Sun Yingxue Zhou Jie Hao Xing Fan Yanbin Lu and 3 more

Zhongkai Sun, Yingxue Zhou, Jie Hao, Xing Fan, Yanbin Lu, Chengyuan Ma, Wei Shen, Chenlei Guo. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track. 2023.

10.18653/v1/2023.emnlp-industry.41 article EN cc-by 2023-01-01

Personalized Query Rewriting in Conversational AI Agents

OPENALEX - Publications

Alireza Roshan Ghias Clint Solomon Mathialagan Pragaash Ponnusamy Lambert Mathias Chenlei Guo

Spoken language understanding (SLU) systems in conversational AI agents often experience errors the form of misrecognitions by automatic speech recognition (ASR) or semantic gaps natural (NLU). These easily translate to user frustrations, particularly so recurrent events e.g. regularly toggling an appliance, calling a frequent contact, etc. In this work, we propose query rewriting approach leveraging users' historically successful interactions as memory. We present neural retrieval model and...

10.48550/arxiv.2011.04748 preprint EN other-oa arXiv (Cornell University) 2020-01-01

An Attention Selection Model with Visual Memory and Online Learning

OPENALEX - Publications

Chenlei Guo Liming Zhang

In this paper, an attention selection model with visual memory and online learning is proposed, which has three parts: Sensory Mapping (SM), Cognitive (CM) Motor (MM). CM the novelty of our incorporates learning. order to mimic memory, we put forward Amnesic Incremental Hierachical Discriminant Regression (AIHDR) Tree amnesic function guide deletion redundant information tree. Experimental results show that AIHDR tree better performance in retrieval speed accuracy than IHDR/HDR...

10.1109/ijcnn.2007.4371145 article EN IEEE International Conference on Neural Networks/IEEE ... International Conference on Neural Networks 2007-08-01

A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task Learning

OPENALEX - Publications

Md Mofijul Islam Gustavo Aguilar Pragaash Ponnusamy Clint Solomon Mathialagan Chengyuan Ma and 1 more

Subword tokenization is a commonly used input pre-processing step in most recent NLP models. However, it limits the models’ ability to leverage end-to-end task learning. Its frequency-based vocabulary creation compromises low-resource languages, leading models produce suboptimal representations. Additionally, dependency on fixed subword adaptability across languages and domains. In this work, we propose vocabulary-free neural tokenizer by distilling segmentation information from...

10.18653/v1/2022.repl4nlp-1.10 article EN cc-by 2022-01-01

PENTATRON: PErsonalized coNText-Aware Transformer for Retrieval-based cOnversational uNderstanding

OPENALEX - Publications

Niranjan Uma Naresh Ziyan Jiang Ankit Ankit Sung‐Jin Lee Jie Hao and 2 more

Niranjan Uma Naresh, Ziyan Jiang, Ankit Ankit, Sungjin Lee, Jie Hao, Xing Fan, Chenlei Guo. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: Industry Track. 2022.

10.18653/v1/2022.emnlp-industry.7 article EN cc-by 2022-01-01

VAE based Text Style Transfer with Pivot Words Enhancement Learning

OPENALEX - Publications

Haoran Xu Sixing Lu Zhongkai Sun Chengyuan Ma Chenlei Guo

Text Style Transfer (TST) aims to alter the underlying style of source text another specific while keeping same content. Due scarcity high-quality parallel training data, unsupervised learning has become a trending direction for TST tasks. In this paper, we propose novel VAE based with pivOt Words Enhancement leaRning (VT-STOWER) method which utilizes Variational AutoEncoder (VAE) and external embeddings learn semantics distribution jointly. Additionally, introduce pivot words learning, is...

10.48550/arxiv.2112.03154 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Pre-Training for Query Rewriting in A Spoken Language Understanding System

OPENALEX - Publications

Zheng Chen Xing Fan Ling Yuan Lambert Mathias Chenlei Guo

Query rewriting (QR) is an increasingly important technique to reduce customer friction caused by errors in a spoken language understanding pipeline, where the originate from various sources such as speech recognition errors, or entity resolution errors. In this work, we first propose neural-retrieval based approach for query rewriting. Then, inspired wide success of pre-trained contextual embeddings, and also way compensate insufficient QR training data, language-modeling (LM) pre-train...

10.48550/arxiv.2002.05607 preprint EN cc-by-nc-sa arXiv (Cornell University) 2020-01-01