NFDI4DS | UHH-SEMS - Publication Details

Hierarchical Label-Enhanced Contrastive Learning for Chinese NER

OPENALEX - Publications

Chengyu Wang Shan Zhao Tianwei Yan Shezheng Song Wentao Ma and 2 more

Recently, character-word lattice structures have achieved promising results for Chinese named entity recognition (NER), reducing word segmentation errors and increasing boundary information character sequences. However, constructing the structure is complex time-consuming, thus these lattice-based models usually suffer from low inference speed. Moreover, quality of lexicon affects accuracy NER model. Since noise words can potentially confuse NER, limited coverage cause to degenerate into...

10.1109/tnnls.2025.3528416 article EN IEEE Transactions on Neural Networks and Learning Systems 2025-01-01

FRCL-MNER: A Finer Grained Rank-Based Contrastive Learning Framework for Multimodal NER

OPENALEX - Publications

Tianwei Yan Shan Zhao Wentao Ma Shezheng Song Chengyu Wang and 4 more

Multimodal named entity recognition (MNER) is an emerging field that aims to automatically detect entities and classify their categories, utilizing input text auxiliary resources such as images. While previous studies have leveraged object detectors preprocess images fuse textual semantics with corresponding image features, these methods often overlook the potential finer grained information within each modality may exacerbate error propagation due predetection. To address issues, we propose...

10.1109/tnnls.2025.3528567 article EN IEEE Transactions on Neural Networks and Learning Systems 2025-01-01

Enhancing Text-Based Person Search with Re-Ranking and Advanced Cross-Modal Alignment Techniques

OPENALEX - Publications

Yu Bai Wentao Ma Shan Zhao Tianwei Yan Shezheng Song and 2 more

10.2139/ssrn.5211784 preprint EN 2025-01-01

A Dual-Way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking

OPENALEX - Publications

Shezheng Song Shan Zhao Chengyu Wang Tianwei Yan Shasha Li and 2 more

Multimodal Entity Linking (MEL) aims at linking ambiguous mentions with multimodal information to entity in Knowledge Graph (KG) such as Wikipedia, which plays a key role many applications. However, existing methods suffer from shortcomings, including modality impurity noise raw image and textual representation, puts obstacles MEL. We formulate neural text matching problem where each (text image) is treated query, the model learns mapping query relevant candidate entities. This paper...

10.1609/aaai.v38i17.29867 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

LTACL: long-tail awareness contrastive learning for distantly supervised relation extraction

OPENALEX - Publications

Tianwei Yan Xiang Zhang Zhigang Luo

Abstract Distantly supervised relation extraction is an automatically annotating method for large corpora by classifying a bound of sentences with two same entities and the relation. Recent works exploit sound performance adopting contrastive learning to efficiently obtain instance representations under multi-instance framework. Though these methods weaken impact noisy labels, it ignores long-tail distribution problem in distantly sets fails capture mutual information different parts. We are...

10.1007/s40747-023-01226-w article EN cc-by Complex & Intelligent Systems 2023-09-28

HCL: A Hierarchical Contrastive Learning Framework for Zero-Shot Relation Extraction

OPENALEX - Publications

Tianwei Yan Shan Zhao Minghao Hu Mengzhu Wang Xiang Zhang and 2 more

Zero-shot relation extraction (ZSRE) is shown to become more significant in the current information system, which aims at predicting classes that lack annotations or have just never appeared during training. Previous works focus on projecting sentences with their corresponding descriptions an intermediate semantic space and searching nearest for unseen classes. Though these methods can achieve sound performance, they only obtain inferior via a trivial distance metric neglect interaction...

10.1109/tnnls.2024.3379527 article EN IEEE Transactions on Neural Networks and Learning Systems 2024-04-02

A novel deep residual network-based incomplete information competition strategy for four-players Mahjong games

OPENALEX - Publications

Mingyan Wang Tianwei Yan Mingyuan Luo Wei Huang

10.1007/s11042-019-7682-5 article EN Multimedia Tools and Applications 2019-05-04

DWE+: Dual-Way Matching Enhanced Framework for Multimodal Entity Linking

OPENALEX - Publications

Shezheng Song Shasha Li Shan Zhao Xiaopeng Li Chengyu Wang and 5 more

Multimodal entity linking (MEL) aims to utilize multimodal information (usually textual and visual information) link ambiguous mentions unambiguous entities in knowledge base. Current methods facing main issues: (1)treating the entire image as input may contain redundant information. (2)the insufficient utilization of entity-related information, such attributes images. (3)semantic inconsistency between base its representation. To this end, we propose DWE+ for linking. could capture finer...

10.48550/arxiv.2404.04818 preprint EN arXiv (Cornell University) 2024-04-07

Towards Addressing Heterogeneity Of Data In Federated Learning

OPENALEX - Publications

Mingyu Sun Zhihua Chen Tianwei Yan Yan Yang Sikun Liu and 1 more

10.1109/icccs61882.2024.10603332 article EN 2022 7th International Conference on Computer and Communication Systems (ICCCS) 2024-04-19

MOSABench: Multi-Object Sentiment Analysis Benchmark for Evaluating Multimodal Large Language Models Understanding of Complex Image

OPENALEX - Publications

Shezheng Song Chao He Shasha Li Shan Zhao Chengyu Wang and 6 more

Multimodal large language models (MLLMs) have shown remarkable progress in high-level semantic tasks such as visual question answering, image captioning, and emotion recognition. However, despite advancements, there remains a lack of standardized benchmarks for evaluating MLLMs performance multi-object sentiment analysis, key task understanding. To address this gap, we introduce MOSABench, novel evaluation dataset designed specifically analysis. MOSABench includes approximately 1,000 images...

10.48550/arxiv.2412.00060 preprint EN arXiv (Cornell University) 2024-11-25

A Dual-way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking

OPENALEX - Publications

Shezheng Song Shan Zhao Chengyu Wang Tianwei Yan Shasha Li and 2 more

Multimodal Entity Linking (MEL) aims at linking ambiguous mentions with multimodal information to entity in Knowledge Graph (KG) such as Wikipedia, which plays a key role many applications. However, existing methods suffer from shortcomings, including modality impurity noise raw image and textual representation, puts obstacles MEL. We formulate neural text matching problem where each (text image) is treated query, the model learns mapping query relevant candidate entities. This paper...

10.48550/arxiv.2312.11816 preprint EN cc-by-nc-nd arXiv (Cornell University) 2023-01-01