NFDI4DS | UHH-SEMS - Publication Details

Wei Xia

ORCID: 0000-0001-8988-8381

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5067662547

Research Areas

Face and Expression Recognition
Advanced Image and Video Retrieval Techniques
Face recognition and analysis
Video Surveillance and Tracking Methods
Text and Document Classification Technologies
Advanced Computing and Algorithms
Biometric Identification and Security
Domain Adaptation and Few-Shot Learning
Advanced Graph Neural Networks
Anomaly Detection Techniques and Applications
Advanced Neural Network Applications
Tensor decomposition and applications
Software Engineering Research
Software Reliability and Analysis Research
Complex Network Analysis Techniques
Advanced Clustering Algorithms Research
Generative Adversarial Networks and Image Synthesis
Natural Language Processing Techniques
Software Engineering Techniques and Practices
Human Pose and Action Recognition
Remote-Sensing Image Classification
Machine Learning and Data Classification
Sparse and Compressive Sensing Techniques
Topic Modeling
Adversarial Robustness in Machine Learning

Xidian University
2019-2024

Harbin University of Science and Technology
2024

University of Jinan
2023

Anhui Jianzhu University
2022

Seattle University
2022

Amazon (United States)
2020-2022

Institute of Computing Technology
2021-2022

Hunan University of Technology
2022

Amazon (Germany)
2020-2021

Hefei University of Technology
2020-2021

Learning Self-Consistency for Deepfake Detection

OPENALEX - Publications

Tianchen Zhao Xiang Xu Mingze Xu Hui Ding Yuanjun Xiong and 1 more

We propose a new method to detect deepfake images using the cue of source feature inconsistency within forged images. It is based on hypothesis that images' distinct features can be preserved and extracted after going through state-of-the-art generation processes. introduce novel representation learning approach, called pair-wise self-consistency (PCL), for training ConvNets extract these accompanied by image synthesis genera-tor (I2G), provide richly annotated data PCL. Experimental results...

10.1109/iccv48922.2021.01475 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Tensor-SVD Based Graph Learning for Multi-View Subspace Clustering

OPENALEX - Publications

Quanxue Gao Wei Xia Zhizhen Wan Deyan Xie Pu Zhang

Low-rank representation based on tensor-Singular Value Decomposition (t-SVD) has achieved impressive results for multi-view subspace clustering, but it does not well deal with noise and illumination changes embedded in data. The major reason is that all the singular values have same contribution tensor-nuclear norm t-SVD, which make sense existence of change. To improve robustness clustering performance, we study weighted t-SVD develop an efficient algorithm to optimize minimization (WTNNM)...

10.1609/aaai.v34i04.5807 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Tensorized Bipartite Graph Learning for Multi-View Clustering

OPENALEX - Publications

Wei Xia Quanxue Gao Qianqian Wang Xinbo Gao Chris Ding and 1 more

Despite the impressive clustering performance and efficiency in characterizing both relationship between data cluster structure, most existing graph-based multi-view methods still have following drawbacks. They suffer from expensive time burden due to construction of graphs eigen-decomposition Laplacian matrix. Moreover, none them simultaneously considers similarity inter-view intra-view. In this article, we propose a variance-based de-correlation anchor selection strategy for bipartite...

10.1109/tpami.2022.3187976 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-07-04

MeMOT: Multi-Object Tracking with Memory

OPENALEX - Publications

Jiarui Cai Mingze Xu Wei Li Yuanjun Xiong Wei Xia and 2 more

We propose an online tracking algorithm that performs the object detection and data association under a common framework, capable of linking objects after long time span. This is realized by preserving large spatio-temporal memory to store identity embeddings tracked objects, adaptively referencing aggregating useful information from as needed. Our model, called MeMOT, consists three main modules are all Transformer-based: 1) Hypothesis Generation produce proposals in current video frame; 2)...

10.1109/cvpr52688.2022.00792 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Enhanced Tensor RPCA and its Application

OPENALEX - Publications

Quanxue Gao Pu Zhang Wei Xia Deyan Xie Xinbo Gao and 1 more

Despite the promising results, tensor robust principal component analysis (TRPCA), which aims to recover underlying low-rank structure of clean data corrupted with noise/outliers by shrinking all singular values equally, cannot well preserve salient content image. The major reason is that, in real applications, there a difference information between image, and larger are generally associated some parts Thus, should be treated differently. Inspired this observation, we investigate whether...

10.1109/tpami.2020.3017672 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2020-08-18

Segmenting Objects in Day and Night: Edge-Conditioned CNN for Thermal Image Semantic Segmentation

OPENALEX - Publications

Chenglong Li Wei Xia Yan Yan Bin Luo Jin Tang

Despite much research progress in image semantic segmentation, it remains challenging under adverse environmental conditions caused by imaging limitations of the visible spectrum, while thermal infrared cameras have several advantages over for such as operating total darkness, insensitive to illumination variations, robust shadow effects, and strong ability penetrate haze smog. These make segmentation objects day night. In this article, we propose a novel network architecture, called...

10.1109/tnnls.2020.3009373 article EN IEEE Transactions on Neural Networks and Learning Systems 2020-07-29

Self-Supervised Graph Convolutional Network for Multi-View Clustering

OPENALEX - Publications

Wei Xia Qianqian Wang Quanxue Gao Xiangdong Zhang Xinbo Gao

Despite the promising preliminary results, existing graph convolutional network (GCN) based multi-view learning methods directly use structure as view descriptor, which may inhibit ability of for multimedia data. The major reason is that, in real applications, contain outliers. Moreover, they fail to take advantage information embedded inaccurate clustering labels obtained from their proposed methods, resulting inferior results. These observations motivate us study whether there a better...

10.1109/tmm.2021.3094296 article EN IEEE Transactions on Multimedia 2021-07-02

Multiview Subspace Clustering by an Enhanced Tensor Nuclear Norm

OPENALEX - Publications

Wei Xia Xiangdong Zhang Quanxue Gao Xiaochuang Shu Jungong Han and 1 more

Despite the promising preliminary results, tensor-singular value decomposition (t-SVD)-based multiview subspace is incapable of dealing with real problems, such as noise and illumination changes. The major reason that tensor-nuclear norm minimization (TNNM) used in t-SVD regularizes each singular equally, which does not make sense matrix completion coefficient learning. In this case, values represent different perspectives should be treated differently. To well exploit significant difference...

10.1109/tcyb.2021.3052352 article EN IEEE Transactions on Cybernetics 2021-02-26

Tensor Completion-Based Incomplete Multiview Clustering

OPENALEX - Publications

Wei Xia Quanxue Gao Qianqian Wang Xinbo Gao

Incomplete multiview clustering is a challenging problem in the domain of unsupervised learning. However, existing incomplete methods only consider similarity structure intraview while neglecting interview. Thus, they cannot take advantage both complementary information and spatial embedded matrices different views. To this end, we complete graph with missing data referring to tensor present novel effective model handel task. be specific, interview graphs via Schatten p -norm-based...

10.1109/tcyb.2021.3140068 article EN IEEE Transactions on Cybernetics 2022-01-25

Multi-view graph embedding clustering network: Joint self-supervision and block diagonal representation

OPENALEX - Publications

Wei Xia Sen Wang Ming Yang Quanxue Gao Jungong Han and 1 more

10.1016/j.neunet.2021.10.006 article EN Neural Networks 2021-10-25

Adversarial Multiview Clustering Networks With Adaptive Fusion

OPENALEX - Publications

Qianqian Wang Zhiqiang Tao Wei Xia Quanxue Gao Xiaochun Cao and 1 more

The existing deep multiview clustering (MVC) methods are mainly based on autoencoder networks, which seek common latent variables to reconstruct the original input of each view individually. However, due view-specific reconstruction loss, it is challenging extract consistent representations over multiple views for clustering. To address this challenge, we propose adversarial MVC (AMvC) networks in article. proposed AMvC generates view's samples conditioning fused among different encourage a...

10.1109/tnnls.2022.3145048 article EN IEEE Transactions on Neural Networks and Learning Systems 2022-02-03

Multiview Spectral Clustering With Bipartite Graph

OPENALEX - Publications

Haizhou Yang Quanxue Gao Wei Xia Ming Yang Xinbo Gao

Multi-view spectral clustering has become appealing due to its good performance in capturing the correlations among all views. However, on one hand, many existing methods usually require a quadratic or cubic complexity for graph construction eigenvalue decomposition of Laplacian matrix; other they are inefficient and unbearable burden be applied large scale data sets, which can easily obtained era big data. Moreover, cannot encode complementary information between adjacency matrices, i.e....

10.1109/tip.2022.3171411 article EN IEEE Transactions on Image Processing 2022-01-01

Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering

OPENALEX - Publications

Wei Xia Tianxiu Wang Quanxue Gao Ming Yang Xinbo Gao

Multi-modal clustering (MMC) aims to explore complementary information from diverse modalities for performance facilitating. This article studies challenging problems in MMC methods based on deep neural networks. On one hand, most existing lack a unified objective simultaneously learn the inter- and intra-modality consistency, resulting limited representation learning capacity. other processes are modeled finite sample set cannot handle out-of-sample data. To above two challenges, we propose...

10.1109/tip.2023.3240863 article EN IEEE Transactions on Image Processing 2023-01-01

Novel architecture for long short-term memory used in question classification

OPENALEX - Publications

Wei Xia Wen Zhu Bo Liao Min Chen Lijun Cai and 1 more

10.1016/j.neucom.2018.03.020 article EN Neurocomputing 2018-03-18

Low-rank tensor constrained co-regularized multi-view spectral clustering

OPENALEX - Publications

Huiling Xu Xiangdong Zhang Wei Xia Quanxue Gao Xinbo Gao

10.1016/j.neunet.2020.08.019 article EN Neural Networks 2020-09-06

Towards Backward-Compatible Representation Learning

OPENALEX - Publications

Yantao Shen Yuanjun Xiong Wei Xia Stefano Soatto

We propose a way to learn visual features that are compatible with previously computed ones even when they have different dimensions and learned via neural network architectures loss functions. Compatible means that, if such used compare images, then ``new'' can be compared directly ``old'' features, so interchangeably. This enables search systems bypass computing new for all seen images updating the embedding models, process known as backfilling. Backward compatibility is critical quickly...

10.1109/cvpr42600.2020.00640 preprint EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Long Short-Term Transformer for Online Action Detection

OPENALEX - Publications

Mingze Xu Yuanjun Xiong Hao Chen Xinyu Li Wei Xia and 2 more

We present Long Short-term TRansformer (LSTR), a temporal modeling algorithm for online action detection, which employs long- and short-term memory mechanism to model prolonged sequence data. It consists of an LSTR encoder that dynamically leverages coarse-scale historical information from extended window (e.g., 2048 frames spanning up 8 minutes), together with decoder focuses on short time 32 seconds) the fine-scale characteristics Compared prior work, provides effective efficient method...

10.48550/arxiv.2107.03377 preprint EN cc-by arXiv (Cornell University) 2021-01-01

A Performance Evaluation of Classic Convolutional Neural Networks for 2D and 3D Palmprint and Palm Vein Recognition

OPENALEX - Publications

Wei Jia Jian Gao Wei Xia Yang Zhao Hai Min and 1 more

Abstract Palmprint recognition and palm vein are two emerging biometrics technologies. In the past decades, many traditional methods have been proposed for palmprint recognition, achieved impressive results. However, research on deep learning-based is still very preliminary. this paper, in order to investigate problem of learning based 2D 3D in-depth, we conduct performance evaluation seventeen representative classic convolutional neural networks (CNNs) one database, five databases...

10.1007/s11633-020-1257-9 article EN cc-by International Journal of Automation and Computing 2020-12-29

A survey on dorsal hand vein biometrics

OPENALEX - Publications

Wei Jia Wei Xia Bob Zhang Yang Zhao Lunke Fei and 3 more

10.1016/j.patcog.2021.108122 article EN Pattern Recognition 2021-06-28

Tensorized Label Learning on Anchor Graph

OPENALEX - Publications

Jing Li Quanxue Gao Qianqian Wang Wei Xia

Graph-based multimedia data clustering has attracted much attention due to the impressive performance for arbitrarily shaped data. However, existing graph-based methods need post-processing get labels with high computational complexity. Moreover, it is sub-optimal label learning fact that they exploit complementary information embedded in different types pixel by pixel. To handle these problems, we present a novel model good interpretability clustering. be specific, our decomposes anchor...

10.1609/aaai.v38i12.29257 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Semi-TCL: Semi-Supervised Track Contrastive Representation Learning

OPENALEX - Publications

Wei Li Yuanjun Xiong Shuo Yang Mingze Xu Yongxin Wang and 1 more

Online tracking of multiple objects in videos requires strong capacity modeling and matching object appearances. Previous methods for learning appearance embedding mostly rely on instance-level without considering the temporal continuity provided by videos. We design a new instance-to-track objective to learn that compares candidate detection tracks persisted tracker. It enables us not only from labeled with complete tracks, but also unlabeled or partially implement this unified form...

10.48550/arxiv.2107.02396 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Graph embedding clustering: Graph attention auto-encoder with cluster-specificity distribution

OPENALEX - Publications

Huiling Xu Wei Xia Quanxue Gao Jungong Han Xinbo Gao

10.1016/j.neunet.2021.05.008 article EN Neural Networks 2021-05-08

Learning Hierarchical Graph Neural Networks for Image Clustering

OPENALEX - Publications

Yifan Xing Tong He Tianjun Xiao Yongxin Wang Yuanjun Xiong and 4 more

We propose a hierarchical graph neural network (GNN) model that learns how to cluster set of images into an unknown number identities using training annotated with labels belonging disjoint identities. Our GNN uses novel approach merge connected components predicted at each level the hierarchy form new next level. Unlike fully unsupervised clustering, choice grouping and complexity criteria stems naturally from supervision in set. The resulting method, Hi-LANDER, achieves average 49%...

10.1109/iccv48922.2021.00345 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Self-Consistent Contrastive Attributed Graph Clustering With Pseudo-Label Prompt

OPENALEX - Publications

Wei Xia Qianqian Wang Quanxue Gao Ming Yang Xinbo Gao

Attributed graph clustering, which learns node representation from attribute and topological for is a fundamental challenging task multimedia network-structured data analysis. Recently, contrastive learning (GCL)-based methods have obtained impressive clustering performance on this task. Nevertheless, there still remain some limitations to be solved: 1) most existing fail consider the self-consistency between latent representations cluster structures; 2) require post-processing operation get...

10.1109/tmm.2022.3213208 article EN IEEE Transactions on Multimedia 2022-10-10

Coming Soon ...