Wenjun Zeng

ORCID: 0000-0003-2531-3137
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Video Surveillance and Tracking Methods
  • Human Pose and Action Recognition
  • Advanced Vision and Imaging
  • Advanced Steganography and Watermarking Techniques
  • Advanced Data Compression Techniques
  • Advanced Neural Network Applications
  • Advanced Image and Video Retrieval Techniques
  • Video Coding and Compression Technologies
  • Domain Adaptation and Few-Shot Learning
  • Multimodal Machine Learning Applications
  • Gait Recognition and Analysis
  • Chaos-based Image/Signal Encryption
  • Digital Media Forensic Detection
  • Image and Signal Denoising Methods
  • Advanced Image Processing Techniques
  • Image and Video Quality Assessment
  • Anomaly Detection Techniques and Applications
  • Face recognition and analysis
  • Hand Gesture Recognition Systems
  • Video Analysis and Summarization
  • Generative Adversarial Networks and Image Synthesis
  • Peer-to-Peer Network Technologies
  • Network Traffic and Congestion Control
  • Image Enhancement Techniques
  • Multimedia Communication and Technology

China National GeneBank
2020-2025

Princeton University
1994-2025

East China Normal University
2024-2025

Botswana Geoscience Institute
2025

Hunan University of Science and Technology
2025

Beibu Gulf University
2025

Nanchang University
2025

Nanchang Hangkong University
2025

Eastern Institute of Technology, Ningbo
2023-2024

Shanghai Jiao Tong University
2024

In this paper, we present a comprehensive study and evaluation of existing single image dehazing algorithms, using new large-scale benchmark consisting both synthetic real-world hazy images, called REalistic Single Image DEhazing (RESIDE). RESIDE highlights diverse data sources contents, is divided into five subsets, each serving different training or purposes. We further provide rich variety criteria for algorithm evaluation, ranging from full-reference metrics, to no-reference subjective...

10.1109/tip.2018.2867951 article EN publisher-specific-oa IEEE Transactions on Image Processing 2018-08-30

The huge success of the Internet allows for transmission, wide distribution, and access electronic data in an effortless manner. Content providers are faced with challenge how to protect their data. This problem has generated a flurry research activity area digital watermarking content copyright protection. here is introduce watermark that does not alter perceived quality content, while being extremely robust attack. For instance, case image data, editing picture or illegal tampering should...

10.1109/49.668975 article EN IEEE Journal on Selected Areas in Communications 1998-05-01

Skeleton based action recognition distinguishes human actions using the trajectories of skeleton joints, which provide a very good representation for describing actions. Considering that recurrent neural networks (RNNs) with Long Short-Term Memory (LSTM) can learn feature representations and model long-term temporal dependencies automatically, we propose an end-to-end fully connected deep LSTM network recognition. Inspired by observation co-occurrences joints intrinsically characterize...

10.1609/aaai.v30i1.10451 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2016-03-05

Human action recognition is an important task in computer vision. Extracting discriminative spatial and temporal features to model the evolutions of different actions plays a key role accomplishing this task. In work, we propose end-to-end attention for human from skeleton data. We build our on top Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM), which learns selectively focus joints within each frame inputs pays levels outputs frames. Furthermore, ensure effective...

10.1609/aaai.v31i1.11212 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2017-02-12

Observing that Semantic features learned in an image classification task and Appearance a similarity matching complement each other, we build twofold Siamese network, named SA-Siam, for real-time object tracking. SA-Siam is composed of semantic branch appearance branch. Each similaritylearning network. An important design choice to separately train the two branches keep heterogeneity types features. In addition, propose channel attention mechanism Channel-wise weights are computed according...

10.1109/cvpr.2018.00508 preprint EN 2018-06-01

Skeleton-based human action recognition has recently attracted increasing attention due to the popularity of 3D skeleton data. One main challenge lies in large view variations captured actions. We propose a novel adaptation scheme automatically regulate observation viewpoints during occurrence an action. Rather than re-positioning skeletons based on defined prior criterion, we design adaptive recurrent neural network (RNN) with LSTM architecture, which enables itself adapt most suitable from...

10.1109/iccv.2017.233 article EN 2017-10-01

For person re-identification (re-id), attention mechanisms have become attractive as they aim at strengthening discriminative features and suppressing irrelevant ones, which matches well the key of re-id, i.e., feature learning. Previous approaches typically learn using local convolutions, ignoring mining knowledge from global structure patterns. Intuitively, affinities among spatial positions/nodes in map provide clustering-like information are helpful for inferring semantics thus...

10.1109/cvpr42600.2020.00325 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Machine learning systems generally assume that the training and testing distributions are same. To this end, a key requirement is to develop models can generalize unseen distributions. Domain generalization (DG), i.e., out-of-distribution generalization, has attracted increasing interests in recent years. deals with challenging setting where one or several different but related domain(s) given, goal learn model an test domain. Great progress been made area of domain for This paper presents...

10.1109/tkde.2022.3178128 article EN IEEE Transactions on Knowledge and Data Engineering 2022-01-01

Skeleton-based human action recognition has attracted great interest thanks to the easy accessibility of skeleton data. Recently, there is a trend using very deep feedforward neural networks model 3D coordinates joints without considering computational efficiency. In this paper, we propose simple yet effective semantics-guided network (SGN) for skeleton-based recognition. We explicitly introduce high level semantics (joint type and frame index) into enhance feature representation capability....

10.1109/cvpr42600.2020.00119 preprint EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Human action recognition is an important task in computer vision. Extracting discriminative spatial and temporal features to model the evolutions of different actions plays a key role accomplishing this task. In work, we propose end-to-end attention for human from skeleton data. We build our on top Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM), which learns selectively focus joints within each frame inputs pays levels outputs frames. Furthermore, ensure effective...

10.48550/arxiv.1611.06067 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Skeleton-based human action recognition has recently attracted increasing attention thanks to the accessibility and popularity of 3D skeleton data. One key challenges in lies large variations representations when they are captured from different viewpoints. In order alleviate effects view variations, this paper introduces a novel adaptation scheme, which automatically determines virtual observation viewpoints over course an learning based data driven manner. Instead re-positioning skeletons...

10.1109/tpami.2019.2896631 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2019-01-31

Abstract With the application and development of high-throughput sequencing technology in life health sciences, massive multi-omics data brings problem efficient management utilization. Database biocuration are prerequisites for reuse these big data. Here, relying on China National GeneBank (CNGB), we present CNGB Sequence Archive (CNSA) archiving omics data, including raw its further analyzed results which organized into six objects, namely Project, Sample, Experiment, Run, Assembly...

10.1093/database/baaa055 article EN cc-by Database 2020-01-01

Existing fully-supervised person re-identification (ReID) methods usually suffer from poor generalization capability caused by domain gaps. The key to solving this problem lies in filtering out identity-irrelevant interference and learning domain-invariant representations. In paper, we aim design a generalizable ReID framework which trains model on source domains yet is able generalize/perform well target domains. To achieve goal, propose simple effective Style Normalization Restitution...

10.1109/cvpr42600.2020.00321 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

We propose a densely semantically aligned person re-identification (re-ID) framework. It fundamentally addresses the body misalignment problem caused by pose/viewpoint variations, imperfect detection, occlusion, etc.. By leveraging estimation of dense semantics image, we construct set part images (DSAP-images), where same spatial positions have across different images. design two-stream network that consists main full image stream (MF-Stream) and semantically-aligned guiding (DSAG-Stream)....

10.1109/cvpr.2019.00076 preprint EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

The greatest challenge facing visual object tracking is the simultaneous requirements on robustness and discrimination power. In this paper, we propose a SiamFC-based tracker, named SPM-Tracker, to tackle challenge. basic idea address two in separate matching stages. Robustness strengthened coarse (CM) stage through generalized training while power enhanced fine (FM) distance learning network. stages are connected series as input proposals of FM generated by CM stage. They also parallel...

10.1109/cvpr.2019.00376 preprint EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Time-frequency (T-F) domain masking is a mainstream approach for single-channel speech enhancement. Recently, focuses have been put to phase prediction in addition amplitude prediction. In this paper, we propose phase-and-harmonics-aware deep neural network (DNN), named PHASEN, task. Unlike previous methods which directly use complex ideal ratio mask supervise the DNN learning, design two-stream network, where stream and are dedicated We discover that two streams should communicate with each...

10.1609/aaai.v34i05.6489 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Human actions in videos are three-dimensional (3D) signals. Recent attempts use 3D convolutional neural networks (CNNs) to explore spatio-temporal information for human action recognition. Though promising, CNNs have not achieved high performance on this task with respect their well-established two-dimensional (2D) counterparts visual recognition still images. We argue that the training complexity of fusion and huge memory cost convolution hinder current CNNs, which stack convolutions layer...

10.1109/cvpr.2018.00054 article EN 2018-06-01

Skeleton based action recognition distinguishes human actions using the trajectories of skeleton joints, which provide a very good representation for describing actions. Considering that recurrent neural networks (RNNs) with Long Short-Term Memory (LSTM) can learn feature representations and model long-term temporal dependencies automatically, we propose an end-to-end fully connected deep LSTM network recognition. Inspired by observation co-occurrences joints intrinsically characterize...

10.48550/arxiv.1603.07772 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Human action analytics has attracted a lot of attention for decades in computer vision. It is important to extract discriminative spatio-temporal features model the spatial and temporal evolutions different actions. In this paper, we propose explore human recognition detection from skeleton data. We build our networks based on recurrent neural with long short-term memory units. The learned capable selectively focusing joints skeletons within each input frame paying levels outputs frames. To...

10.1109/tip.2018.2818328 article EN IEEE Transactions on Image Processing 2018-03-22

We present an approach to recover absolute 3D human poses from multi-view images by incorporating geometric priors in our model. It consists of two separate steps: (1) estimating the 2D and (2) recovering poses. First, we introduce a cross-view fusion scheme into CNN jointly estimate for multiple views. Consequently, pose estimation each view already benefits other Second, recursive Pictorial Structure Model gradually improves accuracy with affordable computational cost. test method on...

10.1109/iccv.2019.00444 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

We consider the tracking problem as a special type of object detection problem, which we call instance detection. With proper initialization, detector can be quickly converted into tracker by learning new from single image. find that model-agnostic meta-learning (MAML) offers strategy to initialize satisfies our needs. propose principled three-step approach build high-performance tracker. First, pick any modern trained with gradient descent. Second, conduct offline training (or...

10.1109/cvpr42600.2020.00632 preprint EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

We propose a pseudo-sequence-based scheme for light field image compression. In our scheme, the raw captured by camera is decomposed into multiple views according to lenslet array of that camera. These constitute pseudo sequence like video, and redundancy between exploited video encoder. The specific coding order views, prediction structure, rate allocation have been investigated encoding sequence. Experimental results show superior performance which achieves as high 6.6 dB gain compared...

10.1109/icmew.2016.7574674 article EN 2016-07-01
Coming Soon ...