NFDI4DS | UHH-SEMS - Publication Details

King Ngi Ngan

ORCID: 0000-0003-1946-3235

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5062332580

Research Areas

Video Coding and Compression Technologies
Advanced Data Compression Techniques
Advanced Vision and Imaging
Image and Video Quality Assessment
Advanced Image and Video Retrieval Techniques
Advanced Image Processing Techniques
Visual Attention and Saliency Detection
Image and Signal Denoising Methods
Image Enhancement Techniques
Advanced Image Fusion Techniques
Image Retrieval and Classification Techniques
Video Surveillance and Tracking Methods
Advanced Neural Network Applications
Video Analysis and Summarization
Face recognition and analysis
Medical Image Segmentation Techniques
Domain Adaptation and Few-Shot Learning
Multimodal Machine Learning Applications
Image Processing Techniques and Applications
Face and Expression Recognition
Advanced Wireless Communication Techniques
Multimedia Communication and Technology
Digital Filter Design and Implementation
Human Pose and Action Recognition
Computer Graphics and Visualization Techniques

University of Electronic Science and Technology of China
2016-2025

Chinese University of Hong Kong
2013-2022

Australian National University
2019

University of Science and Technology of China
2019

University of Hong Kong
2008

Nanyang Technological University
2001-2005

National University of Singapore
1985-2005

The University of Western Australia
1995-2004

Applied Materials (United States)
1993-2003

Monash University
1991-2003

Face segmentation using skin-color map in videophone applications

OPENALEX - Publications

Douglas Chai King Ngi Ngan

This paper addresses our proposed method to automatically segment out a person's face from given image that consists of head-and-shoulders view the person and complex background scene. The involves fast, reliable, effective algorithm exploits spatial distribution characteristics human skin color. A universal skin-color map is derived used on chrominance component input detect pixels with appearance. Then, based detected their corresponding luminance values, employs set novel regularization...

10.1109/76.767122 article EN IEEE Transactions on Circuits and Systems for Video Technology 1999-06-01

Unsupervised extraction of visual attention objects in color images

OPENALEX - Publications

Junwei Han King Ngi Ngan Mingjing Li Hao Zhang

This paper proposes a generic model for unsupervised extraction of viewer's attention objects from color images. Without the full semantic understanding image content, formulates as Markov random field (MRF) by integrating computational visual mechanisms with object growing techniques. Furthermore, we describe MRF Gibbs an energy function. The minimization function provides practical way to obtain objects. Experimental results on 880 real images and user subjective evaluations 16 subjects...

10.1109/tcsvt.2005.859028 article EN IEEE Transactions on Circuits and Systems for Video Technology 2005-12-28

A Co-Saliency Model of Image Pairs

OPENALEX - Publications

Hongliang Li King Ngi Ngan

In this paper, we introduce a method to detect co-saliency from an image pair that may have some objects in common. The is modeled as linear combination of the single-image saliency map (SISM) and multi-image (MISM). first term designed describe local attention, which computed by using three detection techniques available literature. To compute MISM, co-multilayer graph constructed dividing into spatial pyramid representation. Each node described two types visual descriptors, are extracted...

10.1109/tip.2011.2156803 article EN IEEE Transactions on Image Processing 2011-05-20

Spatio-Temporal Just Noticeable Distortion Profile for Grey Scale Image/Video in DCT Domain

OPENALEX - Publications

Zhenyu Wei King Ngi Ngan

In image and video processing field, an effective compression algorithm should remove not only the statistical redundancy information but also perceptually insignificant component from pictures. Just-noticeable distortion (JND) profile is efficient model to represent those perceptual redundancies. Human eyes are usually sensitive below JND threshold. this paper, a DCT based for monochrome pictures proposed. This incorporates spatial contrast sensitivity function (CSF), luminance adaptation...

10.1109/tcsvt.2009.2013518 article EN IEEE Transactions on Circuits and Systems for Video Technology 2009-02-19

Image Quality Assessment by Separately Evaluating Detail Losses and Additive Impairments

OPENALEX - Publications

Songnan Li Fan Zhang Lin Ma King Ngi Ngan

In the research field of image processing, mean squared error (MSE) and peak signal-to-noise ratio (PSNR) are extensively adopted as objective visual quality metrics, mainly because their simplicity for calculation optimization. However, it has been well recognized that these pixel-based difference measures correlate poorly with human perception. Inspired by existing works <citerefgrp xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><citeref...

10.1109/tmm.2011.2152382 article EN IEEE Transactions on Multimedia 2011-05-10

Automatic segmentation of moving objects for video object plane generation

OPENALEX - Publications

Thomas Meier King Ngi Ngan

The new video coding standard MPEG-4 is enabling content-based functionalities. It takes advantage of a prior decomposition sequences into object planes (VOPs) so that each VOP represents one moving object. A comprehensive review summarizes some the most important motion segmentation and generation techniques have been proposed. Then, automatic sequence algorithm extracts objects presented. core this an tracker matches two-dimensional (2-D) binary model against subsequent frames using...

10.1109/76.718500 article EN IEEE Transactions on Circuits and Systems for Video Technology 1998-01-01

Admission control in IEEE 802.11e wireless LANs

OPENALEX - Publications

Deyun Gao Jianfei Cai King Ngi Ngan

Although IEEE 802.11 based wireless local area networks have become more and popular due to low cost easy deployment, they can only provide best effort services do not quality of service supports for multimedia applications. Recently, a new standard, 802.11e, has been proposed, which introduces so-called hybrid coordination function containing two medium access mechanisms: contention-based channel controlled access. In this article we first give brief tutorial on the various MAC-layer QoS...

10.1109/mnet.2005.1470677 article EN IEEE Network 2005-07-01

Blind Image Quality Assessment Based on Multichannel Feature Fusion and Label Transfer

OPENALEX - Publications

Qingbo Wu Hongliang Li Fanman Meng King Ngi Ngan Bing Luo and 2 more

In this paper, we propose an efficient blind image quality assessment (BIQA) algorithm, which is characterized by a new feature fusion scheme and k-nearest-neighbor (KNN)-based prediction model. Our goal to predict the perceptual of without any prior information its reference distortion type. Since inaccessible in many applications, BIQA quite desirable context. our method, first introduced combining image's statistical from multiple domains (i.e., discrete cosine transform, wavelet, spatial...

10.1109/tcsvt.2015.2412773 article EN IEEE Transactions on Circuits and Systems for Video Technology 2015-03-13

Reduced-Reference Image Quality Assessment Using Reorganized DCT-Based Image Representation

OPENALEX - Publications

Lin Ma Songnan Li Fan Zhang King Ngi Ngan

In this paper, a novel reduced-reference (RR) image quality assessment (IQA) is proposed by statistical modeling of the discrete cosine transform (DCT) coefficient distributions. order to reduce RR data rates and further exploit identical nature distributions between adjacent DCT subbands, coefficients are reorganized into three-level tree. Subsequently, generalized Gaussian density (GGD) employed model distribution each subband. The city-block distance measure difference two images....

10.1109/tmm.2011.2109701 article EN IEEE Transactions on Multimedia 2011-01-31

Object Co-Segmentation Based on Shortest Path Algorithm and Saliency Model

OPENALEX - Publications

Fanman Meng Hongliang Li Guanghui Liu King Ngi Ngan

Segmenting common objects that have variations in color, texture and shape is a challenging problem.In this paper, we propose new model efficiently segments from multiple images.We first segment each original image into number of local regions.Then, construct digraph based on region similarities saliency maps.Finally, formulate the co-segmentation problem as shortest path problem, use dynamic programming method to solve problem.The experimental results demonstrate proposed can group images...

10.1109/tmm.2012.2197741 article EN IEEE Transactions on Multimedia 2012-09-12

Image Retargeting Quality Assessment: A Study of Subjective Scores and Objective Metrics

OPENALEX - Publications

Lin Ma Weisi Lin Chenwei Deng King Ngi Ngan

This paper presents the result of a recent large-scale subjective study image retargeting quality on collection images generated by several representative methods. Owning to many approaches that have been developed, there is need for diverse independent public database retargeted and corresponding scores be freely available. We build an database, in which 171 (obtained from 57 natural source different contents) were created And perceptual each subjectively rated at least 30 viewers,...

10.1109/jstsp.2012.2211996 article EN IEEE Journal of Selected Topics in Signal Processing 2012-08-07

Unsupervised Salient Object Segmentation Based on Kernel Density Estimation and Two-Phase Graph Cut

OPENALEX - Publications

Zhi Liu Ran Shi Liquan Shen Yinzhu Xue King Ngi Ngan and 1 more

In this paper, we propose an unsupervised salient object segmentation approach based on kernel density estimation (KDE) and two-phase graph cut. A set of KDE models are first constructed the pre-segmentation result input image, then for each pixel, a likelihoods to fit all calculated accordingly. The color saliency spatial model evaluated its distinctiveness distribution, pixel-wise map is generated by integrating likelihood measures pixels models. phase segmentation, cut exploited obtain...

10.1109/tmm.2012.2190385 article EN IEEE Transactions on Multimedia 2012-03-08

Co-Salient Object Detection From Multiple Images

OPENALEX - Publications

Hongliang Li Fanman Meng King Ngi Ngan

In this paper, we propose a novel method to discover co-salient objects from group of images, which is modeled as linear fusion an intra-image saliency (IaIS) map and inter-image (IrIS) map. The first term measure the salient each image using multiscale segmentation voting. second designed detect images. To compute IrIS map, perform pairwise similarity ranking based on pyramid representation. A minimum spanning tree then constructed determine matching order. For region in image, design three...

10.1109/tmm.2013.2271476 article EN IEEE Transactions on Multimedia 2013-06-27

MVF-Net: Multi-View 3D Face Morphable Model Regression

OPENALEX - Publications

Fanzi Wu Linchao Bao Yajing Chen Yonggen Ling Yibing Song and 3 more

We address the problem of recovering 3D geometry a human face from set facial images in multiple views. While recent studies have shown impressive progress Morphable Model (3DMM) based reconstruction, settings are mostly restricted to single view. There is an inherent drawback single-view setting: lack reliable constraints can cause unresolvable ambiguities. this paper explore 3DMM-based shape recovery different setting, where multi-view given as input. A novel approach proposed regress 3DMM...

10.1109/cvpr.2019.00105 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Simultaneously Detecting and Counting Dense Vehicles From Drone Images

OPENALEX - Publications

Wei Li Hongliang Li Qingbo Wu Xiaoyu Chen King Ngi Ngan

Unmanned aerial vehicles are an essential component in the realization of Industry 4.0. With drones helping to improve industrial safety and efficiency utilities, construction, communication, there is urgent need for drone-based intelligent applications. In this paper, we develop a unified framework simultaneously detect count from drone images. We first explore why state-of-the-art detectors fail highly dense scenes, which provides more appropriate insights. Then, propose effective loss...

10.1109/tie.2019.2899548 article EN IEEE Transactions on Industrial Electronics 2019-02-21

Video segmentation for content-based coding

OPENALEX - Publications

Thomas Meier King Ngi Ngan

To provide multimedia applications with new functionalities, the video coding standard MPEG-4 relies on a content-based representation. This requires prior decomposition of sequences into semantically meaningful, physical objects. We formulate this problem as one separating foreground objects from background based motion information. For object interest, 2D binary model is derived and tracked throughout sequence. The points consist edge pixels detected by Canny operator. accommodate rotation...

10.1109/76.809155 article EN IEEE Transactions on Circuits and Systems for Video Technology 1999-01-01

Locating facial region of a head-and-shoulders color image

OPENALEX - Publications

Douglas Chai King Ngi Ngan

This paper addresses our proposed method to automatically locate the person's face from a given image that consists of head-and-shoulders view person and complex background scene. The involves fast, simple yet robust algorithm exploits spatial distribution characteristics human skin color. It first uses chrominance component input detect pixels with color appearance. Then, bused on detected skin-color their corresponding luminance values, employs some regularization processes reinforce...

10.1109/afgr.1998.670936 article EN 2002-11-27

Recent advances in rate control for video coding

OPENALEX - Publications

Zhenzhong Chen King Ngi Ngan

10.1016/j.image.2006.11.002 article EN Signal Processing Image Communication 2006-11-30

Adaptive cosine transform coding of images in perceptual domain

OPENALEX - Publications

King Ngi Ngan Wai Yie Leong Harminder Singh

An adaptive cosine transform coding scheme for color images which incorporates human visual properties into the is described. It employs quantization to exploit statistical nature of coefficients and block distortion equalization reduce edge structures inherent in schemes. Results show that subjective quality reconstructed at a bit rate 0.4 bit/pixel or compression ratio 60:1 very good.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>

10.1109/29.46556 article EN IEEE Transactions on Acoustics Speech and Signal Processing 1989-01-01

An Efficient Frame-Content Based Intra Frame Rate Control for High Efficiency Video Coding

OPENALEX - Publications

Miaohui Wang King Ngi Ngan Hongliang Li

Rate control plays an important role in the rapid development of high-fidelity video services. As High Efficiency Video Coding (HEVC) standard has been finalized, many rate algorithms are being developed to promote its commercial use. The HEVC encoder adopts a new R-lambda based model reduce bit estimation error. However, fails consider frame-content complexity that ultimately degrades performance control. In this letter, gradient (GRL) is proposed for intra frame control, where can...

10.1109/lsp.2014.2377032 article EN IEEE Signal Processing Letters 2014-12-04

Low-Delay Rate Control for Consistent Quality Using Distortion-Based Lagrange Multiplier

OPENALEX - Publications

Miaohui Wang King Ngi Ngan Hongliang Li

Video quality fluctuation plays a significant role in human visual perception, and hence, many rate control approaches have been widely developed to maintain consistent for video communication. This paper presents novel framework based on the Lagrange multiplier high-efficiency coding. With assumption of constant control, new relationship between distortion is established. Based proposed model buffer status, we obtain computationally feasible solution problem minimizing variation across...

10.1109/tip.2016.2552646 article EN IEEE Transactions on Image Processing 2016-04-11

Fast HEVC Inter CU Decision Based on Latent SAD Estimation

OPENALEX - Publications

Jian Xiong Hongliang Li Fanman Meng Qingbo Wu King Ngi Ngan

The emerging high efficiency video coding (HEVC) standard has improved compression performance significantly in comparison with H.264/AVC. However, more intensive computational complexity been introduced by adopting a number of new tools. In this paper, fast inter CU decision is proposed based on the latent sum absolute differences (SAD) estimation. Firstly, two-layer motion estimation (ME) method designed to take advantage SAD cost. ME can obtain costs for both upper and its sub-CUs....

10.1109/tmm.2015.2491018 article EN IEEE Transactions on Multimedia 2015-10-14

A2RMNet: Adaptively Aspect Ratio Multi-Scale Network for Object Detection in Remote Sensing Images

OPENALEX - Publications

Heqian Qiu Hongliang Li Qingbo Wu Fanman Meng King Ngi Ngan and 1 more

Object detection is a significant and challenging problem in the study area of remote sensing image analysis. However, most existing methods are easy to miss or incorrectly locate objects due various sizes aspect ratios objects. In this paper, we propose novel end-to-end Adaptively Aspect Ratio Multi-Scale Network (A 2 RMNet) solve problem. On one hand, design multi-scale feature gate fusion network adaptively integrate features This composed modules, refine blocks region proposal networks....

10.3390/rs11131594 article EN cc-by Remote Sensing 2019-07-04

A Perceptually Weighted Rank Correlation Indicator for Objective Image Quality Assessment

OPENALEX - Publications

Qingbo Wu Hongliang Li Fanman Meng King Ngi Ngan

In the field of objective image quality assessment (IQA), Spearman's $\rho$ and Kendall's $\tau$ are two most popular rank correlation indicators, which straightforwardly assign uniform weight to all levels assume each pair images sortable. They successful for measuring average accuracy an IQA metric in ranking multiple processed images. However, important perceptual properties ignored by them as well. Firstly, sorting (SA) high usually more than poor ones many real world applications, where...

10.1109/tip.2018.2799331 article EN IEEE Transactions on Image Processing 2018-01-29

High-Quality R-CNN Object Detection Using Multi-Path Detection Calibration Network

OPENALEX - Publications

Xiaoyu Chen Hongliang Li Qingbo Wu King Ngi Ngan Linfeng Xu

Object proposals are used in two-stage detectors, such as R-CNN, to generate detection results, including category predictions and refined bounding-boxes. As a result, classification scores assigned bounding-boxes rather than object proposals. However, this procedure ignores the discrepancy of data distribution between We consider could limit accuracy. Specifically, foreground/background imbalance on inaccurate information from low-IoU hinder prediction. In paper, we propose detector called...

10.1109/tcsvt.2020.2987465 article EN IEEE Transactions on Circuits and Systems for Video Technology 2020-04-14

Coming Soon ...