NFDI4DS | UHH-SEMS - Publication Details

Xin Liu

ORCID: 0000-0003-2802-594X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100352321

Research Areas

Topic Modeling
Natural Language Processing Techniques
Advanced Text Analysis Techniques
Multimodal Machine Learning Applications
Speech and dialogue systems
Semantic Web and Ontologies
Machine Learning in Healthcare
Speech Recognition and Synthesis
Sentiment Analysis and Opinion Mining
Quantum Information and Cryptography
Text and Document Classification Technologies
Software Engineering Research
Biomedical Text Mining and Ontologies
Advanced Graph Neural Networks
Text Readability and Simplification
Quantum and electron transport phenomena
Handwritten Text Recognition Techniques
Radiomics and Machine Learning in Medical Imaging
Computational and Text Analysis Methods
Educational Technology and Pedagogy
Advanced Computational Techniques and Applications
Artificial Intelligence in Healthcare and Education
Quantum optics and atomic interactions
Explainable Artificial Intelligence (XAI)
Phonetics and Phonology Research

Micron (United States)
2025

Institute of Technology of Cambodia
2025

Xiamen University
2021-2025

Peng Cheng Laboratory
2020-2024

Hohai University
2024

University of Hong Kong
2023

Hong Kong University of Science and Technology
2023

National Institute of Advanced Industrial Science and Technology
2023

Beijing Information Science & Technology University
2022

China Electronics Technology Group Corporation
2022

A re-examination of text categorization methods

OPENALEX - Publications

Yiming Yang Xin Liu

Article Free Access Share on A re-examination of text categorization methods Authors: Yiming Yang School Computer Science, Carnegie Mellon University, Pittsburgh, PA PAView Profile , Xin Liu Authors Info & Claims SIGIR '99: Proceedings the 22nd annual international ACM conference Research and development in information retrievalAugust 1999Pages 42–49https://doi.org/10.1145/312624.312647Published:01 August 1999Publication History 1,646citation8,975DownloadsMetricsTotal Citations1,646Total...

10.1145/312624.312647 article EN 1999-08-01

Generic text summarization using relevance measure and latent semantic analysis

OPENALEX - Publications

Yihong Gong Xin Liu

In this paper, we propose two generic text summarization methods that create summaries by ranking and extracting sentences from the original documents. The first method uses standard IR to rank sentence relevances, while second latent semantic analysis technique identify semantically important sentences, for summary creations. Both strive select are highly ranked different each other. This is an attempt a with wider coverage of document's main content less redundancy. Performance evaluations...

10.1145/383952.383955 article EN 2001-09-01

The BQ Corpus: A Large-scale Domain-specific Chinese Corpus For Sentence Semantic Equivalence Identification

OPENALEX - Publications

Chen Jing Qingcai Chen Xin Liu Haijun Yang Daohe Lu and 1 more

This paper introduces the Bank Question (BQ) corpus, a Chinese corpus for sentence semantic equivalence identification (SSEI). The BQ contains 120,000 question pairs from 1-year online bank custom service logs. To efficiently process and annotate questions such large scale of logs, this proposes clustering based annotation method to achieve with same intent. First, deduplicated answer are clustered into stacks by Word Mover’s Distance (WMD) Affinity Propagation (AP) algorithm. Then,...

10.18653/v1/d18-1536 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2018-01-01

ChatGPT Evaluation on Sentence Level Relations: A Focus on Temporal, Causal, and Discourse Relations

OPENALEX - Publications

Chunkit Chan Jay J. Cheng Weiqi Wang Yuxin Jiang Tianqing Fang and 2 more

This paper aims to quantitatively evaluate the performance of ChatGPT, an interactive large language model, on inter-sentential relations such as temporal relations, causal and discourse relations. Given ChatGPT's promising across various tasks, we proceed carry out thorough evaluations whole test sets 11 datasets, including PDTB2.0-based, dialogue-based To ensure reliability our findings, employ three tailored prompt templates for each task, zero-shot template, engineering (PE) in-context...

10.48550/arxiv.2304.14827 preprint EN other-oa arXiv (Cornell University) 2023-01-01

A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine

OPENALEX - Publications

Hanguang Xiao Feizhong Zhou Xingyue Liu Tianqi Liu Жипенг Ли and 2 more

10.2139/ssrn.5031720 preprint EN 2024-01-01

An automatic system to identify heart disease risk factors in clinical texts over time

OPENALEX - Publications

Qingcai Chen Haodi Li Buzhou Tang Xiaolong Wang Xin Liu and 7 more

Despite recent progress in prediction and prevention, heart disease remains a leading cause of death. One preliminary step prevention is risk factor identification. Many studies have been proposed to identify factors associated with disease; however, none attempted all factors. In 2014, the National Center Informatics for Integrating Biology Beside (i2b2) issued clinical natural language processing (NLP) challenge that involved track (track 2) identifying texts over time. This aimed...

10.1016/j.jbi.2015.09.002 article EN cc-by-nc-nd Journal of Biomedical Informatics 2015-09-08

Stroke Sequence-Dependent Deep Convolutional Neural Network for Online Handwritten Chinese Character Recognition

OPENALEX - Publications

Xin Liu Baotian Hu Qingcai Chen Xiangping Wu Jinghan You

We propose a novel model, called stroke sequence-dependent deep convolutional neural network (SSDCNN), which uses the sequence information and eight-directional features of Chinese characters for online handwritten character recognition (OLHCCR). SSDCNN learns representation OLHCCs by incorporating natural strokes. Furthermore, it naturally incorporates features. First, inputs transforms into stacks feature maps following writing order Second, fixed-length, representations OLHCC are derived...

10.1109/tnnls.2019.2956965 article EN IEEE Transactions on Neural Networks and Learning Systems 2020-01-03

Attention-Driven Contextual Feature Fusion Network for Facial Videos-Based Depression Recognition

OPENALEX - Publications

Hanguang Xiao Xin Liu Tingting Zhou Lingling Qian Xiaoxuan Huang

10.2139/ssrn.5181599 preprint EN 2025-01-01

A Comprehensive Overhaul of Multimodal Assistant with Small Language Models

OPENALEX - Publications

Minjie Zhu Yichen Zhu Ning Liu Xin Liu Zhiyuan Xu and 2 more

Multimodal Large Language Models (MLLMs) have showcased impressive skills in tasks related to visual understanding and reasoning. Yet, their widespread application faces obstacles due the high computational demands during both training inference phases, restricting use a limited audience within research user communities. In this paper, we investigate design aspects of Small (MSLMs) propose an efficient multimodal assistant named Mipha, which is designed create synergy among various aspects:...

10.1609/aaai.v39i10.33194 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

FishDetectLLM: Multimodal instruction tuning with large language models for fish detection

OPENALEX - Publications

Jiaxin Zhu Shibai Yin Xin Liu Xingyang Wang Yee‐Hong Yang

10.1016/j.knosys.2025.113418 article EN Knowledge-Based Systems 2025-04-01

Programmable and Spatial Stiffness Gradient Substrates for Highly Robust Artificial Skins

OPENALEX - Publications

Qibin Zhuang Yiyi Zhang Lianjie Lu Xin Liu Xiao Wei and 9 more

Stretchable artificial skins have garnered great interest for their potential applications in real-time human-machine interaction and equipment operation status monitoring. The local stiffer structure areas on the substrates functional elements been verified to improve robustness of skins, but it remains challenging achieve robust sensing performance under mechanical deformation due large mismatch intricate fabrication process. Herein, we propose an easy strategy fabricating a substrate with...

10.1021/acssensors.4c03584 article EN ACS Sensors 2025-04-23

State and parameter estimation of the heat shock response system using Kalman and particle filters

OPENALEX - Publications

Xin Liu Mahesan Niranjan

Traditional models of systems biology describe dynamic biological phenomena as solutions to ordinary differential equations, which, when parameters in them are set correct values, faithfully mimic observations. Often parameter values tweaked by hand until desired results achieved, or computed from biochemical experiments carried out vitro. Of interest this article, is the use probabilistic modelling tools with which and unobserved variables, modelled hidden states, can be estimated limited...

10.1093/bioinformatics/bts161 article EN Bioinformatics 2012-04-26

Diaformer: Automatic Diagnosis via Symptoms Sequence Generation

OPENALEX - Publications

Junying Chen Dongfang Li Qingcai Chen Wenxiu Zhou Xin Liu

Automatic diagnosis has attracted increasing attention but remains challenging due to multi-step reasoning. Recent works usually address it by reinforcement learning methods. However, these methods show low efficiency and require task-specific reward functions. Considering the conversation between doctor patient allows doctors probe for symptoms make diagnoses, process can be naturally seen as generation of a sequence including diagnoses. Inspired this, we reformulate automatic Sequence...

10.1609/aaai.v36i4.20365 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

A comprehensive survey of large language models and multimodal large language models in medicine

OPENALEX - Publications

Hanguang Xiao Feizhong Zhou Xingyue Liu Tianqi Liu Жипенг Ли and 2 more

10.1016/j.inffus.2024.102888 article EN Information Fusion 2024-12-01

LCSegNet: An Efficient Semantic Segmentation Network for Large-Scale Complex Chinese Character Recognition

OPENALEX - Publications

Xiangping Wu Qingcai Chen Yulun Xiao Wei Li Xin Liu and 1 more

Complex scene character recognition is a challenging yet important task in machine learning, especially for languages with large sets, such as Chinese, which composed of hieroglyphics large-scale categories and similar glyphs. Recently, state-of-the-art methods based on semantic segmentation have achieved great success parsing been applied text recognition. However, because limitations terms memory computation, they are only the small category tasks, tasks involving English alphabets digits....

10.1109/tmm.2020.3025696 article EN IEEE Transactions on Multimedia 2020-09-22

Recognition and extraction of named entities in online medical diagnosis data based on a deep neural network

OPENALEX - Publications

Xin Liu Yanju Zhou Zongrun Wang

10.1016/j.jvcir.2019.02.001 article EN Journal of Visual Communication and Image Representation 2019-02-01

DiscoPrompt: Path Prediction Prompt Tuning for Implicit Discourse Relation Recognition

OPENALEX - Publications

Chunkit Chan Xin Liu Jiayang Cheng Z. Z. Li Yangqiu Song and 2 more

Implicit Discourse Relation Recognition (IDRR) is a sophisticated and challenging task to recognize the discourse relations between arguments with absence of connectives. The sense labels for each relation follow hierarchical classification scheme in annotation process (Prasad et al., 2008), forming hierarchy structure. Most existing works do not well incorporate structure but focus on syntax features prior knowledge connectives manner pure text classification. We argue that it more...

10.18653/v1/2023.findings-acl.4 article EN cc-by Findings of the Association for Computational Linguistics: ACL 2022 2023-01-01

FolkScope: Intention Knowledge Graph Construction for E-commerce Commonsense Discovery

OPENALEX - Publications

Changlong Yu Weiqi Wang Xin Liu Jiaxin Bai Yangqiu Song and 4 more

Understanding users' intentions in e-commerce platforms requires commonsense knowledge. In this paper, we present FolkScope, an intention knowledge graph construction framework, to reveal the structure of humans' minds about purchasing items. As is usually ineffable and not expressed explicitly, it challenging perform information extraction. Thus, propose a new approach that leverages generation power large language models (LLMs) human-in-the-loop annotation semi-automatically construct...

10.18653/v1/2023.findings-acl.76 article EN cc-by Findings of the Association for Computational Linguistics: ACL 2022 2023-01-01

ASER: Towards Large-scale Commonsense Knowledge Acquisition via Higher-order Selectional Preference over Eventualities

OPENALEX - Publications

Hongming Zhang Xin Liu Haojie Pan Haowen Ke Jiefu Ou and 2 more

Commonsense knowledge acquisition and reasoning have long been a core artificial intelligence problem. However, in the past, there has lack of scalable methods to collect commonsense knowledge. In this paper, we propose develop principles for collecting based on selectional preference. We generalize definition preference from one-hop linguistic syntactic relations higher-order over graphs. Unlike previous (e.g., ConceptNet), (SP) only relies statistical distribution graphs, which can be...

10.48550/arxiv.2104.02137 preprint EN cc-by arXiv (Cornell University) 2021-01-01

OncoGPT: A Medical Conversational Model Tailored with Oncology Domain Expertise on a Large Language Model Meta-AI (LLaMA)

OPENALEX - Publications

Fujian Jia Xin Liu Lixi Deng Jiwen Gu Chunchao Pu and 4 more

In the past year, there has been a growing trend in applying Large Language Models (LLMs) to field of medicine, particularly with advent advanced language models such as ChatGPT developed by OpenAI. However, is limited research on LLMs specifically addressing oncology-related queries. The primary aim this was develop specialized model that demonstrates improved accuracy providing advice related oncology. We performed an extensive data collection online question-answer interactions centered...

10.48550/arxiv.2402.16810 preprint EN arXiv (Cornell University) 2024-02-26

Deep neural network-based recognition of entities in Chinese online medical inquiry texts

OPENALEX - Publications

Xin Liu Yanju Zhou Zongrun Wang

10.1016/j.future.2020.08.022 article EN Future Generation Computer Systems 2020-08-24

Condensed Convolution Neural Network by Attention over Self-attention for Stance Detection in Twitter

OPENALEX - Publications

Shengping Zhou Junjie Lin Lianzhi Tan Xin Liu

In the era of Web 2.0, people have become accustomed to expressing their attitudes and exchanging opinions on social media sites such as Twitter. It is critical for security business related applications make sense public implied in users' texts. Stance detection aims classify stances users hold towards certain targets FAVOR, AGAINST or NONE. literature, many efforts been paid neural network based stance avoid hand-crafted features. As a widely used structure, convolutional (CNN) can mine...

10.1109/ijcnn.2019.8851965 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2019-07-01

Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation

OPENALEX - Publications

Xin Liu Baosong Yang Dayiheng Liu Haibo Zhang Weihua Luo and 3 more

Xin Liu, Baosong Yang, Dayiheng Haibo Zhang, Weihua Luo, Min Haiying Jinsong Su. Proceedings of the 59th Annual Meeting Association for Computational Linguistics and 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021.

10.18653/v1/2021.acl-long.468 article EN cc-by 2021-01-01

EAI-SIM: An Open-Source Embodied AI Simulation Framework with Large Language Models

OPENALEX - Publications

Guocai Liu Tao Sun Weihua Li Xiaohui Li Xin Liu and 1 more

10.1109/icca62789.2024.10591865 article EN 2024-06-18

Coming Soon ...