NFDI4DS | UHH-SEMS - Publication Details

Shengwu Xiong

ORCID: 0000-0002-3836-0664

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5101598103

Research Areas

Image Enhancement Techniques
Advanced Vision and Imaging
Generative Adversarial Networks and Image Synthesis
Face recognition and analysis
Image and Signal Denoising Methods
Robotics and Sensor-Based Localization
Advanced Image Fusion Techniques
Autonomous Vehicle Technology and Safety
Human Pose and Action Recognition
Advanced Neural Network Applications
Speech and Audio Processing
Video Surveillance and Tracking Methods
Topic Modeling
Speech Recognition and Synthesis
Natural Language Processing Techniques
Face and Expression Recognition
Video Analysis and Summarization
Image Processing Techniques and Applications
Adhesion, Friction, and Surface Interactions
Surface Roughness and Optical Measurements
Enhanced Oil Recovery Techniques
Computational and Text Analysis Methods
Computer Graphics and Visualization Techniques
Biometric Identification and Security
Precipitation Measurement and Analysis

Wuhan College
2025

Sanya University
2021-2025

Wuhan University of Technology
2011-2025

Shanghai Artificial Intelligence Laboratory
2023

Beijing Academy of Artificial Intelligence
2023

Institute of Porous Flow and Fluid Mechanics
2019

University of Chinese Academy of Sciences
2019

Research Institute of Petroleum Exploration and Development
2019

Learned active contours via transformer-based deep convolutional neural network using canny edge detection algorithm

OPENALEX - Publications

Johnas Omanwa Maranga Justine John Nnko Shengwu Xiong

10.1007/s11760-024-03795-w article EN Signal Image and Video Processing 2025-01-17

Multiscale Digital Porous Rock Reconstruction Using Template Matching

OPENALEX - Publications

Wei Lin Xiangwen Li Zhengming Yang Michael Manga Xiaojing Fu and 8 more

Abstract Rocks are heterogeneous multiscale porous media: two rock samples with identical bulk properties can vary widely in microstructure. The advent of digital technology and modern 3‐D printing provides new opportunities to replicate rocks. However, the inherent trade‐off between imaging resolution sample size limits scales over which microstructure macrostructure be identified related each other. Here, we develop a construction strategy by combining X‐ray computed microtomography...

10.1029/2019wr025219 article EN Water Resources Research 2019-07-30

A Spatial and Semantic Alignment Fusion Network for Sea-Land Port Segmentation

OPENALEX - Publications

Bo Zhang Yaxiong Chen Weiqin Dang Shengwu Xiong Xiaoqiang Lu

10.1109/jstars.2025.3544317 article EN cc-by IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2025-01-01

Enhancing Zero-Shot Relation Extraction through Staged Interaction with Large Language Models

OPENALEX - Publications

Yifang Zhang Pengfei Duan Yiwen Yang Shengwu Xiong

10.1109/icassp49660.2025.10887575 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

Face Anti-Spoofing Based on Dynamic Color Texture Analysis Using Local Directional Number Pattern

OPENALEX - Publications

Junwei Zhou Ke Shu Peng Liu Jianwen Xiang Shengwu Xiong

Face anti-spoofing is becoming increasingly indispensable for face recognition systems, which are vulnerable to various spoofing attacks performed using fake photos and videos. In this paper, a novel "LDN-TOP representation followed by ProCRC classification" pipeline proposed. We use local directional number pattern (LDN) with the derivative-Gaussian mask capture detailed appearance information resisting illumination variations noises, can influence texture distribution. To further motion...

10.1109/icpr48806.2021.9412323 article EN 2022 26th International Conference on Pattern Recognition (ICPR) 2021-01-10

CF-VTON: Multi-Pose Virtual Try-on with Cross-Domain Fusion

OPENALEX - Publications

Chenghu Du Shengwu Xiong

The multi-pose virtual try-on technology aims to seamlessly fit an in-shop garment onto a reference person in various poses. This has attracted considerable attention from researchers due its potential commercial and practical applications. Previous works this field have encountered issues such as unnatural alignment difficulty preserving the person's identity, arising weak mapping relationship between different feature crosses. To address these challenges, paper proposes novel network named...

10.1109/icassp49357.2023.10095176 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Accurate and Robust Stereo Direct Visual Odometry for Agricultural Environment

OPENALEX - Publications

Yu Tao Junwei Zhou Liangliang Wang Shengwu Xiong

Vision-based localization and mapping in the agricultural environment is challenging due to unstructured scene with unstable features, illumination variations, bumpy roads, dynamic environmental objects. To address these challenges, we propose an accurate robust stereo direct visual odometry system modifications on Stereo-DSO. We firstly select some well-matched static points latest keyframe improve accuracy of inverse depth calculation for tracking. The can further distinguish close objects...

10.1109/icra48506.2021.9561074 article EN 2021-05-30

Reversible data-hiding exploiting huffman encoding in dual image using weighted matrix and generalized exploiting modification direction (GEMD)

OPENALEX - Publications

Nada Hussien Abd El Salam Shengwu Xiong Xuan Liu

10.1007/s00371-023-03058-8 article EN The Visual Computer 2023-10-05

DLFusion: Painting-Depth Augmenting-LiDAR for Multimodal Fusion 3D Object Detection

OPENALEX - Publications

Junyin Wang Chenghu Du Hui Li Shengwu Xiong

Surround-view cameras combined with image depth transformation to 3D feature space and fusion point cloud features are highly regarded. The of 2D into by means predefined sampling points distribution happens throughout the scene, this process generates a large number redundant features. In addition, multimodal unified in often previous step downstream task, ignoring interactive between different scales. To end, we design new framework, focusing on that can give geometric perception...

10.1145/3581783.3612344 article EN 2023-10-26

Residual Deformable Convolution for better image de-weathering

OPENALEX - Publications

Huikai Liu Ao Zhang Wenqian Zhu Bin Fu Bingjian Ding and 1 more

10.1016/j.patcog.2023.110093 article EN Pattern Recognition 2023-11-04

IFNET: Integrating Data Augmentation and Decoupled Attention Fusion for 3D Object Detection

OPENALEX - Publications

Zhenchang Xia Guanqun Zheng Shengwu Xiong Jia Wu Junyin Wang and 1 more

LiDAR is a key sensor for accurately sensing of the environment in autonomous driving. While existing 3D object detection methods generally rely on data augmentation and feature fusion to improve performance, challenge dealing with sample imbalance often overlooked. We design novel network, IFNet, that tackles these issues by introducing mutually reinforcing enhancement strategies. It aims achieve dual purpose: 1) correcting category directly enhancing pedestrian samples using mixed...

10.1109/icassp48485.2024.10446519 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

Short Utterance Speaker Recognition Based on Speech High Frequency Information Compensation and Dynamic Feature Enhancement Methods

OPENALEX - Publications

Yunfei Zi Shengwu Xiong

This work aims to further compensate for the weaknesses of feature sparsity and insufficient discriminative acoustic features in existing short-duration speaker recognition. To address this issue, we propose Bark-scaled Gauss linear filter bank superposition cepstral coefficients (BGLCC), multidimensional central difference (MDCD) extracted method. The focuses on low-frequency information, while filtering is uniformly distributed, therefore, can obtain more richer audio signals. In addition,...

10.24425/aoa.2024.148768 article EN cc-by Archives of Acoustics 2024-03-19

CycleVTON: A Cycle Mapping Framework for Parser-Free Virtual Try-On

OPENALEX - Publications

Chenghu Du Junyin Wang Yi Rong Shuqing Liu Kai Liu and 1 more

Image-based virtual try-on aims to transfer a target clothing onto specific person. A significant challenge is arbitrarily matched and person lack corresponding ground truth supervised learning. recent pioneering work leveraged an improved cycleGAN enable one network generate the desired image for another during training. However, there no difference in result distribution before after changes. Therefore, using two different networks unnecessary may even increase difficulty of convergence....

10.1609/aaai.v38i2.27928 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Self-supervised flow field decoupling for Controllable face reenactment

OPENALEX - Publications

Xianwei Kong Shengwu Xiong

Abstract Face reenactment is a face image generation method. Its main task to generate new given source and driving image, which has the facial motion information of while retaining content image. Existing flow-based approaches have demonstrated high-quality results, but these works regard head movement as whole, cannot achieve more flexible control, often suffer from loss identity information. In this paper, we propose novel Controllable multi-identity reenactment(CFReenet), uses prior...

10.1088/1742-6596/2253/1/012034 article EN Journal of Physics Conference Series 2022-04-01

Directional Regularized Tensor Modeling for Video Rain Streaks Removal

OPENALEX - Publications

Zhaoyang Sun Shengwu Xiong Ryan Wen Liu

Outdoor videos sometimes contain unexpected rain streaks due to the rainy weather, which bring negative effects on subsequent computer vision applications, e.g., video surveillance, object recognition and tracking, etc. In this paper, we propose a directional regularized tensor-based deraining model by taking into consideration arbitrary direction of streaks. particular, sparsity in spatial derivative domains, spatiotemporal low-rank property background are incorporated proposed method....

10.48550/arxiv.1902.07090 preprint EN other-oa arXiv (Cornell University) 2019-01-01

The Image Texture Analysis of the Turning Workpieces Based on FBM Model for TCM

OPENALEX - Publications

Shi Ming Ji L.B. Zhang Li Zhang Shengwu Xiong Yang-Rong Ye and 3 more

10.4028/www.scientific.net/kem.259-260.702 article EN Key engineering materials 2004-03-01

SSAT: A Symmetric Semantic-Aware Transformer Network for Makeup Transfer and Removal

OPENALEX - Publications

Zhaoyang Sun Yaxiong Chen Shengwu Xiong

Makeup transfer is not only to extract the makeup style of reference image, but also render semantic corresponding position target image. However, most existing methods focus on former and ignore latter, resulting in a failure achieve desired results. To solve above problems, we propose unified Symmetric Semantic-Aware Transformer (SSAT) network, which incorporates correspondence learning realize removal simultaneously. In SSAT, novel Semantic Corresponding Feature Transfer (SSCFT) module...

10.48550/arxiv.2112.03631 preprint EN other-oa arXiv (Cornell University) 2021-01-01

KTR2DN: Knowledge Transfer with Residual in Residual Dehazing Network

OPENALEX - Publications

Yihua Lu Pengfei Duan Xiongbo Lu Lei Zhou Shengwu Xiong

Single-image dehazing is an essential but challenging computer vision problem. Due to the lack of nonhomogeneous haze datasets, most existing image methods are only applicable homogeneous rather than tasks. In addition, results always blurred in detail. Thus, a novel network structure, Knowledge Transfer with Residual Dehazing Network, KTR2DN, proposed, which consists two parts: knowledge transfer and super-resolution using (R2) block. The former aims solve problem lacking datasets it...

10.1145/3580219.3580230 article EN 2023-01-28

Residual Deformable Convolution for Better Image De-Weathering

OPENALEX - Publications

Huikai Liu Ao Zhang Wenqian Zhu Bin Fu Bingjian Ding and 1 more

Adverse weather conditions pose great challenges to computer vision tasks in the wild. Image de-weathering, which aims at removing degradations from videos and images, has hence accumulated huge popularity as a significant component of image restoration. Considering computational efficiency for on-device applications, Autoencoder-based deep models are widely adopted degradation removal due its excellent generalization high efficiency. However, most these models, parts high-frequency...

10.2139/ssrn.4431494 preprint EN 2023-01-01

Lane-Aware Transformers for Multi-Agent Trajectory Prediction

OPENALEX - Publications

Tao Yang Shengwu Xiong

For autonomous driving vehicles, accurately predicting the future trajectories of interactive road agents and planning a trajectory that complies with societal requirements resembles human-like behavior is extremely important. Existing multi-vehicle prediction methods have redundancy when dealing multi-agent scenarios, is, they repeatedly encode invariant scenes around each vehicle, such as lane lines, which leads to increased delays in model's reasoning. To solve this problem, we propose...

10.1145/3653081.3653192 article EN 2023-11-24

Robust Facial Landmark Localization Based on Texture and Pose Correlated Initialization

OPENALEX - Publications

Yiyun Pan Junwei Zhou Y. S. Gao Shengwu Xiong

Robust facial landmark localization remains a challenging task when faces are partially occluded. Recently, the cascaded pose regression has attracted increasing attentions, due to it's superior performance in and occlusion detection. However, such an approach is sensitive initialization, where improper initialization can severly degrade performance. In this paper, we propose Initialization for Cascaded Pose Regression (RICPR) by providing texture correlated initial shapes testing face. By...

10.48550/arxiv.1805.05612 preprint EN other-oa arXiv (Cornell University) 2018-01-01

ATVNet: Adaptive Thin Cost Volume for Stereo Disparity Estimation

OPENALEX - Publications

Wang Xiao-nan Shengwu Xiong Tao Yu

Recently, stereo models based on coarse-to-fine approaches have drastically alleviated the memory footprint and speed limitations of complex network models. However, previous designs used a uniformly defined range for disparity estimation, which ignored difficulty pixel matching in different regions introduced many unnecessary candidates. In this paper, we construct Adaptive Thin Volume Network (ATVNet) to improve accuracy reduce computation time. Firstly, multi-scale feature maps are...

10.1109/ctisc54888.2022.9849728 article EN 2022 4th International Conference on Advances in Computer Technology, Information Science and Communications (CTISC) 2022-04-22

Coming Soon ...