NFDI4DS | UHH-SEMS - Publication Details

Xi Zhang

ORCID: 0000-0002-0760-2843

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100430822

Research Areas

Advanced Vision and Imaging
Advanced Image Processing Techniques
Handwritten Text Recognition Techniques
Advanced Neural Network Applications
Advanced Image and Video Retrieval Techniques
Robotics and Sensor-Based Localization
Image and Signal Denoising Methods
Video Surveillance and Tracking Methods
Generative Adversarial Networks and Image Synthesis
Image Processing and 3D Reconstruction
Human Pose and Action Recognition
Optical measurement and interference techniques
Autonomous Vehicle Technology and Safety
Image Retrieval and Classification Techniques
3D Shape Modeling and Analysis
3D Surveying and Cultural Heritage
Industrial Vision Systems and Defect Detection
Vehicle License Plate Recognition
Advanced Data Compression Techniques
Sparse and Compressive Sensing Techniques
Face and Expression Recognition
Power Line Inspection Robots
Gaze Tracking and Assistive Technology
Grey System Theory Applications
Energy Load and Power Forecasting

China Tobacco
2025

State Grid Corporation of China (China)
2020-2024

Shanghai Jiao Tong University
2020-2024

China University of Mining and Technology
2022-2023

Ministry of Transport
2022-2023

Kunming University of Science and Technology
2023

Amazon (Germany)
2022-2023

Huawei Technologies (United Kingdom)
2022

Shaanxi Normal University
2022

Guangdong Institute of Intelligent Manufacturing
2021

ABO: Dataset and Benchmarks for Real-World 3D Object Understanding

OPENALEX - Publications

Jasmine Collins Shubham Goel Kenan Deng Achleshwar Luthra Leon L. Xu and 7 more

We introduce Amazon Berkeley Objects (ABO), a new large-scale dataset designed to help bridge the gap between real and virtual 3D worlds. ABO contains product catalog images, metadata, artist-created models with com-plex geometries physically-based materials that cor-respond real, household objects. derive challenging benchmarks exploit unique properties of measure current limits state-of-the-art on three open problems for real-world object understanding: single-view reconstruction, material...

10.1109/cvpr52688.2022.02045 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

NTIRE 2023 Challenge on Image Denoising: Methods and Results

OPENALEX - Publications

Yawei Li Yulun Zhang Radu Timofte Luc Van Gool Zhijun Tu and 76 more

This paper reviews the NTIRE 2023 challenge on image denoising (σ = 50) with a focus proposed solutions and results. The aim is to obtain network design capable produce high-quality results best performance measured by PSNR for denoising. Independent additive white Gaussian noise (AWGN) assumed level 50. had 225 registered participants, 16 teams made valid submissions. They gauge state-of-the-art

10.1109/cvprw59228.2023.00188 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2023-06-01

CT-Net: An Efficient Network for Low-Altitude Object Detection Based on Convolution and Transformer

OPENALEX - Publications

Tao Ye Jun Zhang Yunwang Li Xi Zhang Zongyang Zhao and 1 more

With the growing popularity of civilian unmanned aerial vehicles (UAVs), unauthorized flights are on rise accordingly. Therefore, it is critical to detect low-altitude UAVs for protecting personal privacy and public safety. Though substantial progress has been made in UAV detection, existing detection methods still have problems balancing accuracy, model size, speed. To address these limitations, this article proposes a novel deep learning method named convolution–transformer network...

10.1109/tim.2022.3165838 article EN IEEE Transactions on Instrumentation and Measurement 2022-01-01

Improving Small Object Detection in Tobacco Strands Using Optimized Anchor Boxes

OPENALEX - Publications

Shaolong Han Wenqi Liu Shangrong Wang Xi Zhang Songjin Zheng

10.1109/access.2025.3531050 article EN cc-by-nc-nd IEEE Access 2025-01-01

A Novel Mid- and Long-term Time-Series Forecasting Framework for Electricity Price Based on Hierarchical Recurrent Neural Networks

OPENALEX - Publications

Weiwu Yan Peng Wang Renchao Xu Rui Han Enze Chen and 2 more

10.1016/j.jfranklin.2025.107590 article EN Journal of the Franklin Institute 2025-02-01

A novel face recognition method based on fusion of LBP and HOG

OPENALEX - Publications

Ting Chen Tao Gao Shuying Li Xi Zhang Jinpei Cao and 2 more

Abstract As one of the hot topics in field computer vision research, face recognition technology has received significant attention due to its potentiality for a wide range applications government as well commercial purposes. In practical applications, although several existing methods have achieved good performances specific scenes, they easily suffer from sharp decline rate if affected by different conditions light, expression, posture and occlusion. Among many factors, influences complex...

10.1049/ipr2.12192 article EN cc-by IET Image Processing 2021-04-06

Design of IIR orthogonal wavelet filter banks using lifting scheme

OPENALEX - Publications

Xi Zhang Wei Wang Toshinori Yoshikawa Yoshinori Takei

The lifting scheme is well known to be an efficient tool for constructing second generation wavelets and often used design a class of biorthogonal wavelet filter banks. For its efficiency, the implementation has been adopted in international standard JPEG2000. It that orthogonality important property many applications. This paper presents how implement infinite-impulse-response (IIR) orthogonal banks by using with two steps. shown IIR can realized allpass filters Then, proposed discussed....

10.1109/tsp.2006.874791 article EN IEEE Transactions on Signal Processing 2006-06-21

A study on automatic on-machine inspection system for 3D modeling and measurement of cutting tools

OPENALEX - Publications

Xi Zhang Wai-Ming Tsang Kazuo Yamazaki M. Mori

10.1007/s10845-011-0540-6 article EN Journal of Intelligent Manufacturing 2011-05-23

Pose-aware Multi-feature Fusion Network for Driver Distraction Recognition

OPENALEX - Publications

Mingyan Wu Xi Zhang Linlin Shen Hang Yu

Traffic accidents caused by distracted driving have gradually increased in recent years. In this work, we propose a novel multi-feature fusion network based on pose estimation, for image detection. Since hand is the most important part of driver to infer actions, our proposed method firstly detects hands using human body posture information. addition features extracted from whole image, also include information and posture. The global feature, are finally fused weighted combination...

10.1109/icpr48806.2021.9413337 article EN 2022 26th International Conference on Pattern Recognition (ICPR) 2021-01-10

Learning Graph Variational Autoencoders with Constraints and Structured Priors for Conditional Indoor 3D Scene Generation

OPENALEX - Publications

Aditya Chattopadhyay Xi Zhang David Wipf Himanshu Arora René Vidal

We present a graph variational autoencoder with structured prior for generating the layout of indoor 3D scenes. Given room type (e.g., living or library) and elements such as floor walls), our architecture generates collection objects furniture items sofa, table chairs) that is consistent layout. This challenging problem because generated scene needs to satisfy multiple constrains, e.g., each object should lie inside two not occupy same volume. To address these challenges, we propose deep...

10.1109/wacv56688.2023.00085 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023-01-01

Handwritten word image matching based on Heat Kernel Signature

OPENALEX - Publications

Xi Zhang Chew Lim Tan

10.1016/j.patcog.2014.10.028 article EN Pattern Recognition 2014-10-28

Segmentation-Free Keyword Spotting for Bangla Handwritten Documents

OPENALEX - Publications

Xi Zhang Umapada Pal Chew Lim Tan

In this paper, a segmentation-free keyword spotting method is proposed for Bangla handwritten documents. order to tolerate large variations in scenarios, we extracted key points based on SIFT point detector, and the end intersection found by morphological operations. Heat Kernel signature (HKS) used present local characteristics of detected points. Instead using same size patch all points, apply dynamically deciding size. Furthermore, our reduces scope searching document only considering...

10.1109/icfhr.2014.70 article EN 2014-09-01

Segmented handwritten text recognition with recurrent neural network classifiers

OPENALEX - Publications

Bolan Su Xi Zhang Shijian Lu Chew Lim Tan

Recognition of handwritten text is a useful technique that can be applied in different applications, such as signature recognition, bank check etc. However, the off-line recognition an unconstrained situation still very challenging task due to high complexity strokes and image background. This paper presents novel segmented ensembles recurrent neural network (RNN) classifiers. Two RNN models are first trained take advantage widely used geometrical feature Histogram Oriented Gradient (HOG)...

10.1109/icdar.2015.7333789 article EN 2015-08-01

Summary on calibration method of line-structured light sensor

OPENALEX - Publications

Xi Zhang Jian Zhang

Line-structured light sensor (LLS) can provide the capability of three-dimensional point acquisition for robotics. As one type 3D scanners, LLS has been widely used in many filed robotics its strong anti-interference, fast scanning speed and high measuring accuracy. Researchers have studying methods to improve accuracy, operability years. In this paper, calibration are reviewed, which covers target plane methods. What's more, some potential improvements discussed analyzed. This review paper...

10.1109/robio.2017.8324571 article EN 2021 IEEE International Conference on Robotics and Biomimetics (ROBIO) 2017-12-01

Upsampling Matters for Road Marking Segmentation of Autonomous Driving

OPENALEX - Publications

Ye Liu Xi Zhang Lei Liu Lei Zhang

Although autonomous driving have become applicable to the industry, prevalent application of key techniques vehicles still needs be refined. For instance, how fast and accurately segment road markings in order assist next pedestrian path prediction creation high-definition (HD) map respectively is useful for more practical. Current marking segmentation mainly rely on semantic computer vision with encoder-decoder architecture. However, as demonstrated this paper, upsampling layer...

10.1016/j.ifacol.2021.04.102 article EN IFAC-PapersOnLine 2020-01-01

Layered Optical Flow Estimation Using a Deep Neural Network with a Soft Mask

OPENALEX - Publications

Xi Zhang Di Ma Xu Ouyang Shanshan Jiang Lin Gan and 1 more

Using a layered representation for motion estimation has the advantage of being able to cope with discontinuities and occlusions. In this paper, we learn estimate optical flow by combining deep learning. Instead pre-segmenting image layers, proposed approach automatically generates using soft-mask module. The essential components module are maxout fuse operations, which enable disjoint more accurate estimation. We show that masks results in quadratic function input features output layer. can...

10.24963/ijcai.2018/163 article EN 2018-07-01

Unconstrained Handwritten Word Recognition Based on Trigrams Using BLSTM

OPENALEX - Publications

Xi Zhang Chew Lim Tan

To get high recognition accuracy, we should train the recognizer with sufficient training data to capture characteristics of various handwriting styles and all possible occurring words. However, in most cases, available are not satisfactory enough, especially for unseen data. In this paper, try improve accuracy randomly selected data, by splitting into two parts based on trigrams recognizers separately. We also propose a modified version token passing algorithm, which makes use outputs accuracy.

10.1109/icpr.2014.502 article EN 2014-08-01

Augmented Reality in Telecom Industry: Concepts, Technologies and Applications

OPENALEX - Publications

Zhiqiang Tian Peng Gao Lu Yang Junjian Liu Xi Zhang and 3 more

Augmented reality, as an important end of merging virtual objects and the real world, has been widely used in Internet, games, e-commerce, other fields. The maturity AR technology brought a broad market space gained attention researchers companies. As ICT industry, tele-com sector also needs for its complex business scenarios issues. Unfortunately, few studies focused on application telecommunication field. This paper starts from concept technology, introduces key technologies details cases...

10.1109/ismar-adjunct57072.2022.00012 article EN 2022 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct) 2022-10-01

FLLIC: Functionally Lossless Image Compression

OPENALEX - Publications

Xi Zhang Xiaolin Wu

Recently, DNN models for lossless image coding have surpassed their traditional counterparts in compression performance, reducing the bit rate by about ten percent natural color images. But even with these advances, mathematically (MLLIC) ratios images still fall short of bandwidth and cost-effectiveness requirements most practical imaging vision systems at present beyond. To break bottleneck MLLIC we question necessity MLLIC, as almost all digital sensors inherently introduce acquisition...

10.48550/arxiv.2401.13616 preprint EN other-oa arXiv (Cornell University) 2024-01-01

Path planning of substation inspection robot based on high-precision positioning and navigation technology

OPENALEX - Publications

Zexu Du Guoliang Zhang Yi Zhang Jiangqi Chen Xi Zhang

Abstract Outdoor substation is an important part of power system. Substation inspection robot based on intelligent autonomous system has become the research focus unmanned inspection. In order to improve positioning accuracy and speed system, a high-precision algorithm transformer detection proposed in this paper. Tikhonov regularization used correct pathological problem localization model. The observation amount receiver increased by using four signals single base station with double...

10.1093/ijlct/ctae125 article EN cc-by International Journal of Low-Carbon Technologies 2024-01-01

Hybrid Attention-based Multi-task Vehicle Motion Prediction Using Non-Autoregressive Transformer and Mixture of Experts

OPENALEX - Publications

Hao Jiang Chuan Hu Yixun Niu Biao Yang Hao Chen and 1 more

10.1109/tiv.2024.3523318 article EN IEEE Transactions on Intelligent Vehicles 2024-01-01

Asymmetric Risk-Field Based Spatio-Temporal Trajectory Planning for Autonomous Driving Considering Game Interaction

OPENALEX - Publications

Zihao Chen Hui Pang Chuan Hu Xi Zhang

10.1109/cdc56724.2024.10886599 article EN 2024-12-16

FEGNet: A feature enhancement and guided network for infrared object detection in underground mines

OPENALEX - Publications

Lisha Huang Xi Zhang Miao Yu Songyue Yang Cao Xiao and 1 more

Object detection plays an important role in underground intelligent vehicles and transportation systems. Due to the uneven light mining scenarios, infrared cameras are one of typical onboard sensors for environmental perception. Although object has been studied decades, it still confronts challenge detecting objects mines. The contributing factors include weak small images similar environments scenarios. In this paper, a Feature Enhancement Guided Network (FEGNet) is proposed address these...

10.1177/09544070231165627 article EN Proceedings of the Institution of Mechanical Engineers Part D Journal of Automobile Engineering 2023-04-01

Prior-YOLO: Enhancing Intelligent Vehicle Small Object Detection with Driving Status-Informed YOLOv8

OPENALEX - Publications

Shuang Hu Baixuan Zhao Taojun Ding Hao Jiang Chuan Hu and 1 more

In the realm of intelligent vehicles, evolution object detection algorithms is paramount importance. Current deep learning-based methodologies excel in identifying medium to large-sized objects but often falter with smaller entities. A notable research gap exists integrating key vehicular state data, such as velocity and steering angle, into generally designed frameworks. To bridge these gaps, we present Prior-YOLO, a novel modification YOLO v8, marked by advanced network structure refined...

10.1109/cei60616.2023.10528134 article EN 2023-12-15

Coming Soon ...