NFDI4DS | UHH-SEMS - Publication Details

Xiaoming Huang

ORCID: 0000-0003-4254-2820

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100668774

Research Areas

Visual Attention and Saliency Detection
Advanced Image and Video Retrieval Techniques
Advanced Neural Network Applications
Image and Video Quality Assessment
Advanced Vision and Imaging
Advanced Image Processing Techniques
Image Enhancement Techniques
Face recognition and analysis
Image Retrieval and Classification Techniques
Olfactory and Sensory Function Studies
Handwritten Text Recognition Techniques
Asian Culture and Media Studies
Generative Adversarial Networks and Image Synthesis
Image Processing Techniques and Applications
Image and Signal Denoising Methods
Vehicle License Plate Recognition
Image and Object Detection Techniques
Diversity and Impact of Dance
Adversarial Robustness in Machine Learning
Concrete Corrosion and Durability
High-Voltage Power Transmission Systems
Vacuum and Plasma Arcs
Anomaly Detection Techniques and Applications
Advanced Image Fusion Techniques
Multimodal Machine Learning Applications

Sun Yat-sen Memorial Hospital
2024

Sun Yat-sen University
2024

Beijing Information Science & Technology University
2019-2024

Tencent (China)
2021-2022

Southeast University
2009-2022

Guangxi Normal University
2012-2021

Tsinghua University
2016-2018

Chongqing University of Posts and Telecommunications
2014-2015

State Key Laboratory of Digital Multimedia Chip Technology
2014

State Key Laboratory of Digital Publishing Technology
2014

IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation

OPENALEX - Publications

Lingtong Kong Boyuan Jiang Donghao Luo Wenqing Chu Xiaoming Huang and 3 more

Prevailing video frame interpolation algorithms, that generate the intermediate frames from consecutive inputs, typically rely on complex model architectures with heavy parameters or large delay, hindering them diverse real-time applications. In this work, we devise an efficient encoder-decoder based network, termed IFRNet, for fast in-termediate synthesizing. It first extracts pyramid features given and then refines bilateral flow fields together a powerful intermedi-ate feature until...

10.1109/cvpr52688.2022.00201 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Structural damage-causing concrete cracking detection based on a deep-learning method

OPENALEX - Publications

Xiaojian Han Zhicheng Zhao Lingkun Chen Xiaolun Hu Yuan Tian and 3 more

10.1016/j.conbuildmat.2022.127562 article EN Construction and Building Materials 2022-04-25

300-FPS Salient Object Detection via Minimum Directional Contrast

OPENALEX - Publications

Xiaoming Huang Yu‐Jin Zhang

Global contrast considers the color difference between a target region or pixel and rest of image. It is frequently used to measure saliency pixel. In previous global contrast-based methods, usually measured by sum from entire We find that spatial distribution one important cue neglected works. Foreground has high all directions, since it surrounded background. Background often shows low in at least direction, as connect Motivated this intuition, we first compute directional different...

10.1109/tip.2017.2710636 article EN IEEE Transactions on Image Processing 2017-06-01

Convolutional Dynamically Convergent Differential Neural Network for Brain Signal Classification

OPENALEX - Publications

Zhijun Zhang Yu He W.Y. Mai Yamei Luo Xiaoli Li and 3 more

The brain signal classification is the basis for implementation of brain-computer interfaces (BCIs). However, most existing methods are based on processing technology, which require a significant amount manual intervention, such as channel selection and dimensionality reduction, often struggle to achieve satisfactory accuracy. To high accuracy little intervention possible, convolutional dynamically convergent differential neural network (ConvDCDNN) proposed solving electroencephalography...

10.1109/tnnls.2024.3437676 article EN IEEE Transactions on Neural Networks and Learning Systems 2024-01-01

Weakly supervised salient object detection via bounding-box annotation and SAM model

OPENALEX - Publications

Xiangquan Liu Xiaoming Huang

<abstract><p>Salient object detection (SOD) aims to detect the most attractive region in an image. Fully supervised SOD based on deep learning usually needs a large amount of data with human annotation. Researchers have gradually focused task using weakly annotation such as category, scribble, and bounding-box, while these existing methods achieve limited performance demonstrate huge gap fully methods. In this work, we proposed one novel two-stage method bounding-box recent...

10.3934/era.2024074 article EN cc-by Electronic Research Archive 2024-01-01

Water flow driven salient object detection at 180 fps

OPENALEX - Publications

Xiaoming Huang Yu‐Jin Zhang

10.1016/j.patcog.2017.10.027 article EN Pattern Recognition 2017-10-23

An O (1) disparity refinement method for stereo matching

OPENALEX - Publications

Xiaoming Huang Yu‐Jin Zhang

10.1016/j.patcog.2016.01.025 article EN Pattern Recognition 2016-02-05

Developing an Optical Image-Based Method for Bridge Deformation Measurement Considering Camera Motion

OPENALEX - Publications

Vahid Abolhasannejad Xiaoming Huang N.M. Namazi

Since deformation estimation may lead to errors occurring when the camera vibrates, it is necessary remove image global motion before computing real bridge deformation. In this study, a combination of correction algorithm and 2D image-based measurement technique was utilized address issue during data acquisition for measurement. Based on proposed methodology, parameters were estimated by defining an effective sub-image in using Iterative Affine Motion Estimator. Then applied all pixels each...

10.3390/s18092754 article EN cc-by Sensors 2018-08-21

ASFD: Automatic and Scalable Face Detector

OPENALEX - Publications

Jian Li Bin Zhang Yabiao Wang Ying Tai Zhenyu Zhang and 4 more

Along with current multi-scale based detectors, Feature Aggregation and Enhancement (FAE) modules have shown superior performance gains for cutting-edge object detection. However, these hand-crafted FAE show inconsistent improvements on face detection, which is mainly due to the significant distribution difference between its training applying corpus, i.e. COCO vs. WIDER Face. To tackle this problem, we essentially analyse effect of data distribution, consequently propose search an effective...

10.1145/3474085.3475372 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

Text detection and recognition in natural scene images

OPENALEX - Publications

Xiaoming Huang Tao Shen Run Wang Chenqiang Gao

Text detection and recognition in natural scene images plays an important role content analysis of images. In this paper, based on the characteristics text, we propose a robust text method using Maximally Stable Extremal Regions (MSER) Support Vector Machine (SVM). Different from end to recognition, split problem into procedure. Firstly, stage, order extract potential as much possible, use MSER color clustering connected component. Then, for obtained candidate component, visual saliency some...

10.1109/icedif.2015.7280160 article EN 2015-01-01

Remote Sensing Image Super-Resolution Adversarial Network Based on Reverse Feature Fusion and Residual Feature Dilation

OPENALEX - Publications

Rui Han Bingxiao Mei Xiaoming Huang Hanbo Xue Xiongwei Jiang and 1 more

Enhancing the resolution of images by super-resolution reconstruction algorithm is less costly compared with way upgrading hardware devices. At present, image can recover texture details well, but performance relatively general on remote sensing lower contrast and more complex texture, reconstructed are prone to noise, missing checkerboard effect. This paper proposes a adversarial network based inverse feature fusion for characteristics images, which combines high-level semantics low-level...

10.1109/access.2023.3304050 article EN cc-by IEEE Access 2023-01-01

Fast Video Saliency Detection via Maximally Stable Region Motion and Object Repeatability

OPENALEX - Publications

Xiaoming Huang Yu‐Jin Zhang

Motion information is one important cue in unsupervised video salient object detection. In order to estimate motion videos, most of the methods adopt time-consuming algorithms such as large displacement optical flow estimation (needs more than 8-40s with 640X480 size per frame), which leads saliency detection only 0.01-0.1 FPS speed and limits its application. human visual system, usually considered a whole. Therefore, we need not compute each pixel. Instead, it desirable probability pixel...

10.1109/tmm.2021.3094356 article EN IEEE Transactions on Multimedia 2021-07-02

50 FPS Object-Level Saliency Detection via Maximally Stable Region

OPENALEX - Publications

Xiaoming Huang Yin Zheng Junzhou Huang Yu‐Jin Zhang

The human visual system tends to consider saliency of an object as a whole. Some object-level detection methods have been proposed by leveraging proposals in bounding boxes, and regarding the entire box one candidate salient region. However, boxes can not provide exact position lot pixels belong background. Consequently, background also show high saliency. Besides, acquiring needs time cost. In order compute saliency, we region growing from some seed superpixels, find surrounding which...

10.1109/tip.2019.2941663 article EN IEEE Transactions on Image Processing 2019-09-20

Table frame line detection in low quality document images based on Hough transform

OPENALEX - Publications

Yangyang Tian Chenqiang Gao Xiaoming Huang

Table detection is of importance in the field document images analysis and processing, especially table frame line detection. Although a great success has been achieved for high quality during past decade, low still remains challenge. To address this problem, we proposed neoteric method to detect automatically images. Firstly, Radon transform adopted skew then correct it. Secondly, run length smoothing algorithm (RLSA) used extract lines longer than predefined threshold. Thirdly, locate...

10.1109/icsai.2014.7009397 article EN 2014-11-01

Physically-guided Disentangled Implicit Rendering for 3D Face Modeling

OPENALEX - Publications

Zhenyu Zhang Yanhao Ge Ying Tai Weijian Cao Renwang Chen and 6 more

This paper presents a novel Physically-guided Disentangled Implicit Rendering (PhyDIR) framework for highfidelity 3D face modeling. The motivation comes from two observations: Widely-used graphics renderers yield excessive approximations against photo-realistic imaging, while neural rendering methods produce superior appearances but are highly entangled to perceive 3D-aware operations. Hence, we learn disentangle the implicit via explicit physical guidance, guaranteeing properties of: (1)...

10.1109/cvpr52688.2022.01971 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Character recognition in low quality document images using local and global features

OPENALEX - Publications

Chenqiang Gao Xiaoming Huang

Although a great success has been achieved for the situation of high quality images during past decades, Character recognition in low still remains challenge. To tackle this challenge, paper novel method SVM framework is proposed to recognize characters document by using local and global features. Firstly, multi-scale sliding window strategy with pruning character traits adopted generate potential sub-regions. Then, conventional feature state-of-art feature, namely histogram oriented...

10.1109/cisp.2014.7003864 article EN 2014-10-01

A fast non-local disparity refinement method for stereo matching

OPENALEX - Publications

Xiaoming Huang Guoqin Cui Yundong Zhang

This paper presents a fast non-local disparity refinement method based on belief propagation. The propagated minimum spanning tree only need two sequential passes, first from leaf nodes to root, then root nodes. Computational complexity of each pixel at all levels is O(1). Performance evaluation standard Middlebury data sets shows that the proposed outperforms local both in accuracy and speed. Compared with existing nonlocal method, about maximum 15× faster speed almost same accuracy.

10.1109/icip.2014.7025776 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2014-10-01

A neuron image segmentation method based Deep Boltzmann Machine and CV model

OPENALEX - Publications

Fuyun He Xiaoming Huang Xun Wang Senhui Qiu Frank Jiang and 1 more

10.1016/j.compmedimag.2021.101871 article EN Computerized Medical Imaging and Graphics 2021-02-23

Weakly supervised salient object detection via image category annotation

OPENALEX - Publications

Ruoqi Zhang Xiaoming Huang Qiang Zhu

<abstract><p>The rapid development of deep learning has made a great progress in salient object detection task. Fully supervised methods need large number pixel-level annotations. To avoid laborious and consuming annotation, weakly consider low-cost annotations such as category, bounding-box, scribble, etc. Due to simple annotation existing large-scale classification datasets, the category based have received more attention while still suffering from inaccurate detection. In this...

10.3934/mbe.2023945 article EN cc-by Mathematical Biosciences & Engineering 2023-01-01

Analysis of different test techniques of DC circuit breaker

OPENALEX - Publications

Ruike Zhang Feng Xu Ziyue Yang Yifei Wu Wenbo Wu and 3 more

Abstract Amidst the escalating demand for energy and proliferation of renewable sources, direct current (DC) systems have garnered increasing interest. As pivotal safety reliability DC systems, testing circuit breakers is a critical step to evaluate their performance dependability. This study focuses on breaking test method medium high-voltage breakers. It compares evaluates waveform, recovery voltage, dissipation across various methodologies during process assesses suitability methods...

10.1088/1742-6596/2850/1/012002 article EN Journal of Physics Conference Series 2024-09-01

Safety Helmet Detection Based On YOLOV3N

OPENALEX - Publications

Li Liu Rui Han Xiaoming Huang Xiongwei Jiang Qiancheng Hong and 1 more

The YOLOv3 algorithm is widely used in the industry due to its high speed and precision. Aiming at problem of low detection accuracy slow rate wearing helmets intelligent monitoring, a YOLOv3N based on improved (You Only Look Once) proposed. Improve network structure basis algorithm, replace Darknet-53 traditional convolution with fewer parameters, reduce model increase rate; order screen out required frames more reasonably, NMS optimized. Experimental results show that compared YOLOv3,...

10.1109/cisp-bmei53629.2021.9624363 article EN 2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) 2021-10-23

A Minimum Barrier Distance based Saliency Box for Object Proposals Generation

OPENALEX - Publications

Xiaoming Huang Yin Zheng Junzhou Huang Yu‐Jin Zhang

Object proposals generation plays an important role in computer vision. A good object model should assign obviously high and low objectness score to the window that contains complete objects incomplete objects, respectively. However, some existing methods such as local contrast based models usually fail satisfy this requirement. In letter, we propose MBDSal Box, a minimum barrier distance (MBD) saliency box for locating proposals. Box consists of three components: item: First, computation...

10.1109/lsp.2018.2844097 article EN IEEE Signal Processing Letters 2018-01-01

An Improved Filtering for Fast Stereo Matching

OPENALEX - Publications

Xiaoming Huang Guoqin Cui Yundong Zhang

This paper presents a novel full-image guided filtering based on eight-connected weight propagation for dense stereo matching. The proposed method has three main features: first, the is more approximate compared to previous approach, second, pixels employed into are all without constrained by one fixed window, last but not least, computational complexity of each pixel at disparity level 0(1), and implementation filter can efficiently parallelized hardware platform. Performance evaluation...

10.1109/icpr.2014.423 article EN 2014-08-01

Novice Cognition Pattern of Guide Sign Based on Eye Tracking and Physical Feedback Tests

OPENALEX - Publications

Yingying Cheng Xiaoming Huang Fei Chen Fang Wang

To ascertain the cognition pattern of novice driver, indoor tests were designed in this study. Subjects' reaction time during ongoing test was recorded. By using H6 Head Mounted Optics eye tracking system, subjects' movement data recorded; at same time, EEG recorded by BioGraph Infiniti physical feedback system. With field data, paper analyzed correlation between average and strokes Chinese character (Ns), total number information on sign (Nt) both placename searching placenames reading...

10.1061/41039(345)237 article EN 2009-07-29

Coming Soon ...