NFDI4DS | UHH-SEMS - Publication Details

Ye Xiang

ORCID: 0000-0003-1945-7433

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5101831667

Research Areas

Human Pose and Action Recognition
Anomaly Detection Techniques and Applications
Advanced Graph Neural Networks
Video Surveillance and Tracking Methods
Recommender Systems and Techniques
Multimodal Machine Learning Applications
Gait Recognition and Analysis
Face and Expression Recognition
Topic Modeling
Context-Aware Activity Recognition Systems
Advanced Neural Network Applications
Sentiment Analysis and Opinion Mining
Image Retrieval and Classification Techniques
Advanced Image and Video Retrieval Techniques
Advanced Vision and Imaging
Domain Adaptation and Few-Shot Learning
Image and Video Stabilization
Plasmonic and Surface Plasmon Research
Thermal Radiation and Cooling Technologies
Computer Graphics and Visualization Techniques
3D Shape Modeling and Analysis
Semantic Web and Ontologies
Expert finding and Q&A systems
Text and Document Classification Technologies
Fire Detection and Safety Systems

Beijing University of Technology
2021-2024

Beijing University of Posts and Telecommunications
2020-2023

Learning to compose diversified prompts for image emotion classification

OPENALEX - Publications

Sinuo Deng Lifang Wu Ge Shi Lehao Xing Meng Jian and 2 more

Abstract Image emotion classification (IEC) aims to extract the abstract emotions evoked in images. Recently, language-supervised methods such as contrastive language-image pretraining (CLIP) have demonstrated superior performance image understanding. However, underexplored task of IEC presents three major challenges: a tremendous training objective gap between and IEC, shared suboptimal prompts, invariant prompts for all instances. In this study, we propose general framework that...

10.1007/s41095-023-0389-6 article EN cc-by Computational Visual Media 2024-04-26

Learning Label Semantics for Weakly Supervised Group Activity Recognition

OPENALEX - Publications

Lifang Wu Meng Tian Ye Xiang Ke Gu Ge Shi

Weakly supervised group activity recognition deals with the dependence on individual-level annotations during understanding scenes involving multiple individuals, which is a challenging task. Existing methods either take trained detectors to extract individual features or utilize attention mechanisms for partial context encoding, followed by integration form final group-level representations. However, require training phase and have mis-detection issue, contexts extracted immediately from...

10.1109/tmm.2024.3349923 article EN IEEE Transactions on Multimedia 2024-01-01

Learning to Compose Diversified Prompts for Image Emotion Classification

OPENALEX - Publications

Sinuo Deng Lifang Wu Ge Shi Lehao Xing Meng Jian and 2 more

Image Emotion Classification (IEC) aims to extract abstract emotions evoked in images. The language-supervised method has recently shown superior power image understanding, e.g., CLIP. However, the underexplored IEC task three significant challenges: tremendous training objective gap between pre-training and IEC, shared suboptimal invariant prompts for all instances. In this paper, we propose a general framework that shows how CLIP can be effectively exploited on task. We first introduce...

10.2139/ssrn.4279935 article EN SSRN Electronic Journal 2022-01-01

Simple But Powerful, a Language-Supervised Method for Image Emotion Classification

OPENALEX - Publications

Sinuo Deng Lifang Wu Ge Shi Lehao Xing Wenjin Hu and 2 more

Image emotion classification is an important computer vision task to extract emotions from images. The methods for image (IEC) are primarily based on label or distribution as a supervision signal, which neither has enough accessibility nor diversity, limiting the development of IEC research. Inspired by psychology research and recent booming large-scale pretrained language models. We figure out language-supervised paradigm, can cleverly combine features visual drive model gain stronger...

10.1109/taffc.2022.3225049 article EN IEEE Transactions on Affective Computing 2022-11-28

Graph Contrastive Learning on Complementary Embedding for Recommendation

OPENALEX - Publications

Meishan Liu Meng Jian Ge Shi Ye Xiang Lifang Wu

Previous works build interest learning via mining deeply on interactions. However, the interactions come incomplete and insufficient to support modeling, even bringing severe bias into recommendations. To address interaction sparsity consequent challenges, we propose a graph contrastive complementary embedding (GCCE), which introduces negative interests assist positive of for modeling. embed interest, design perturbed convolution by preventing distribution from bias. Since samples are not...

10.1145/3591106.3592222 article EN 2023-06-08

GLOCAL: A self-supervised learning framework for global and local motion estimation

OPENALEX - Publications

Yihao Zheng Kunming Luo Shuaicheng Liu Zun Li Ye Xiang and 3 more

10.1016/j.patrec.2023.12.024 article EN Pattern Recognition Letters 2024-01-05

Active Spatial Positions Based Hierarchical Relation Inference for Group Activity Recognition

OPENALEX - Publications

Lifang Wu Xianglong Lang Ye Xiang Chang Wen Chen Zun Li and 1 more

Group activity recognition aims to recognize behaviors characterized by multiple individuals within a scene. Existing schemes rely on individual relation inference and usually take the as tokens. Essentially they select most relevant region of group from entire image while filtering out irrelevant background noises. However, these require bounding box labeling in both training testing stages. Since have been presented at one scale, multi-scale cannot be combined an effective way. In this...

10.1109/tcsvt.2022.3228731 article EN IEEE Transactions on Circuits and Systems for Video Technology 2022-12-12

Multi-scale motion-based relational reasoning for group activity recognition

OPENALEX - Publications

Yihao Zheng Zhuming Wang Ke Gu Lifang Wu Zun Li and 1 more

10.1016/j.engappai.2024.109570 article EN Engineering Applications of Artificial Intelligence 2024-11-07

Latent label mining for group activity recognition in basketball videos

OPENALEX - Publications

Lifang Wu Zeyu Li Ye Xiang Meng Jian Jialie Shen

Abstract Motion information has been widely exploited for group activity recognition in sports video. However, order to model and extract the various motion between adjacent frames, existing algorithms only use coarse video‐level labels as supervision cues. This may lead ambiguity of extracted features omission changing rules patterns that are also important video recognition. In this paper, a latent label mining strategy basketball videos is proposed. The authors' novel allows them obtain...

10.1049/ipr2.12265 article EN cc-by IET Image Processing 2021-07-23

Multi-Perspective Representation to Part-Based Graph for Group Activity Recognition

OPENALEX - Publications

Lifang Wu Xianglong Lang Ye Xiang Qi Wang Meng Tian

Group activity recognition that infers the of a group people is challenging task and has received great deal interest in recent years. Different from individual action recognition, needs to model not only visual cues individuals but also relationships between them. The existing approaches inferred relations based on holistic features individual. However, parts human body, such as head, hands, legs, their relationships, are critical most activities. In this paper, we establish part-based...

10.3390/s22155521 article EN cc-by Sensors 2022-07-24

GLM-Net

OPENALEX - Publications

Yuchen Yang Ye Xiang Shuaicheng Liu Lifang Wu Boxuan Simen Zhao and 1 more

In this work, we study the problem of separating global camera motion and local dynamic from an optical flow. Previous methods either estimate motions by a parametric model, such as homography, or both them flow field. However, none these can directly through end-to-end manner. addition, two accurately hybrid field is challenging. Because one easily confuse other when they are compounded together. To end, propose estimation network GLM-Net. We design encoder-decoder structures for separation...

10.1145/3474085.3475556 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

Hypersphere anchor loss for K-Nearest neighbors

OPENALEX - Publications

Ye Xiang Zihang He Heng Wang Yong Li

10.1007/s10489-023-05148-5 article EN Applied Intelligence 2023-11-15

Exploring Spatio-Temporal Discriminative Cues for Group Activity Recognition Via Contrastive Learning

OPENALEX - Publications

Meng Tian Ye Xiang Lifang Wu

Group activity recognition is a challenging task that involves multiple moving actors within cluttered scene. Existing methods often rely on object detector to avoid individual bounding box labeling during testing, but are prone false detections due factors such as occlusion and background clutter. In addition, existing detector-free method based Transformer attends attention map too sparse, resulting in the loss of some important foreground information. this paper, we introduce...

10.1109/icassp48485.2024.10448174 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

Enhanced amplified spontaneous emission via splitted strong coupling mode in large-area plasmonic cone lattices

OPENALEX - Publications

Jiazhi Yuan Jiang Hu Yan Zheng Hao Wei Jiamin Xiao and 5 more

10.29026/oes.2025.240021 article EN cc-by Opto-Electronic Science 2024-12-20

Part Based Interaction Learning for Group Activity Recognition

OPENALEX - Publications

Meng Tian Xianglong Lang Ye Xiang Yan Huang Lifang Wu and 1 more

Group activity recognition is a subject with broad applications, and its main challenge to model the interactions between individuals. Existing algorithms mostly merely based on holistic features of persons, which completely ignore local details that could be significant for recognition. In this paper, we propose novel part interaction learning algorithm group Our proposed introduces both physical structural information fine-grained contextual into representations, through exploring intraand...

10.1109/mmsp55362.2022.9950045 article EN 2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP) 2022-09-26

Scaling loss: updating gradient of loss for accurate object detection

OPENALEX - Publications

Jiahao Hu Zihang He Ye Xiang Gaoxin Zhang Yong Li

L<sub>1</sub> loss function and Intersection over Union (IoU) are commonly used in object detection. However, minimizing the through training process does not necessarily amount to maximizing IoUs. simply assigns equal weights difference of width, height, center point between a prediction box ground truth but pays less attention contribution each shape property. Observing this, we propose scaling which can be easily embedded convolutional neural networks for mitigating gap IoU function. The...

10.1117/12.2557236 article EN 2020-01-03

Counterfactual Embedding Learning for Debiased Recommendation

OPENALEX - Publications

Meng Jian Jingjing Guo Ye Xiang Lifang Wu

Recently, recommender system suffers extremely from both interaction bias and sparsity. The conventional unified embedding learning policies fail to consider the imbalanced issue produce suboptimal representations of users items for recommendation. Towards end, this work dedicates bias-aware in a decomposed manner proposes Counterfactual Embedding Learning (CEL) debiased Instead debiasing with sampling uniform interactions, we follow capitalize natural distribution model counterfactual...

10.1109/bigmm52142.2021.00019 article EN 2021-11-01

Coming Soon ...