NFDI4DS | UHH-SEMS - Publication Details

Personalized Hashtag Recommendation for Micro-videos

OPENALEX - Publications

Yinwei Wei Zhiyong Cheng Xuzheng Yu Zhou Zhao Lei Zhu and 1 more

Personalized hashtag recommendation methods aim to suggest users hashtags annotate, categorize, and describe their posts. The hashtags, that a user provides post (e.g., micro-video), are the ones which in her mind can well content where she is interested in. It means we should consider both users' preferences on contents personal understanding hashtags. Most existing rely modeling either interactions between posts or for recommendation. These have not explored complicated among users,...

10.1145/3343031.3350858 article EN Proceedings of the 30th ACM International Conference on Multimedia 2019-10-15

Personalized Item Recommendation for Second-hand Trading Platform

OPENALEX - Publications

Xuzheng Yu Tian Gan Yinwei Wei Zhiyong Cheng Liqiang Nie

With rising awareness of environment protection and recycling, second-hand trading platforms have attracted increasing attention in recent years. The interaction data on platforms, consisting sufficient interactions per user but rare item, is different from what they are traditional platforms. Therefore, building successful recommendation systems the requires balancing modeling items? users? preference, mitigating adverse effects sparsity, which makes especially challenging. Accordingly, we...

10.1145/3394171.3413640 article EN Proceedings of the 30th ACM International Conference on Multimedia 2020-10-12

SHE-Net: Syntax-Hierarchy-Enhanced Text-Video Retrieval

OPENALEX - Publications

Xuzheng Yu Chen Jiang Xingning Dong Tian Gan Ming Yang and 1 more

10.1109/tcsvt.2025.3543840 article EN IEEE Transactions on Circuits and Systems for Video Technology 2025-01-01

Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning

OPENALEX - Publications

Chen Jiang Hong Liu Xuzheng Yu Qing Wang Yuan Cheng and 6 more

In recent years, the explosion of web videos makes text-video retrieval increasingly essential and popular for video filtering, recommendation, search. Text-video aims to rank relevant text/video higher than irrelevant ones. The core this task is precisely measure cross-modal similarity between texts videos. Recently, contrastive learning methods have shown promising results retrieval, most which focus on construction positive negative pairs learn text representations. Nevertheless, they do...

10.1145/3581783.3612006 preprint EN 2023-10-26

Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition

OPENALEX - Publications

Xuzheng Yu Chen Jiang Wei Zhang Tian Gan Linlin Chao and 4 more

With the explosive growth of video data in real-world applications, a comprehensive representation videos becomes increasingly important. In this paper, we address problem scene recognition, whose goal is to learn high-level classify scenes videos. Due diversity and complexity contents realistic scenarios, task remains challenge. Most existing works identify for only from visual or textual information temporal perspective, ignoring valuable hidden single frames, while several earlier studies...

10.48550/arxiv.2401.04354 preprint EN other-oa arXiv (Cornell University) 2024-01-01

M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval

OPENALEX - Publications

Xingning Dong Zipeng Feng Chunluan Zhou Xuzheng Yu Ming Yang and 1 more

We present a Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards effective and efficient zero-shot video-text retrieval, dubbed M2-RAAP. Upon popular image-text models like CLIP, most current adaptation-based pre-training methods are confronted by three major issues, i.e., noisy data corpus, time-consuming pre-training, limited performance gain. Towards this end, we conduct comprehensive study including four critical steps in pre-training. Specifically, investigate 1)...

10.48550/arxiv.2401.17797 preprint EN arXiv (Cornell University) 2024-01-31

SHE-Net: Syntax-Hierarchy-Enhanced Text-Video Retrieval

OPENALEX - Publications

Xuzheng Yu Chen Jiang Xingning Dong Tian Gan Ming Yang and 1 more

The user base of short video apps has experienced unprecedented growth in recent years, resulting a significant demand for content analysis. In particular, text-video retrieval, which aims to find the top matching videos given text descriptions from vast corpus, is an essential function, primary challenge bridge modality gap. Nevertheless, most existing approaches treat texts merely as discrete tokens and neglect their syntax structures. Moreover, abundant spatial temporal clues are often...

10.48550/arxiv.2404.14066 preprint EN arXiv (Cornell University) 2024-04-22

M 2 -RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval

OPENALEX - Publications

Xingning Dong Zipeng Feng Chunluan Zhou Xuzheng Yu Ming Yang and 1 more

10.1145/3626772.3657833 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2024-07-10

Personalized Hashtag Recommendation for Micro-videos

OPENALEX - Publications

Yinwei Wei Zhiyong Cheng Xuzheng Yu Zhou Zhao Lei Zhu and 1 more

Personalized hashtag recommendation methods aim to suggest users hashtags annotate, categorize, and describe their posts. The hashtags, that a user provides post (e.g., micro-video), are the ones which in her mind can well content where she is interested in. It means we should consider both users' preferences on contents personal understanding hashtags. Most existing rely modeling either interactions between posts or for recommendation. These have not explored complicated among users,...

10.48550/arxiv.1908.09987 preprint EN other-oa arXiv (Cornell University) 2019-01-01