NFDI4DS | UHH-SEMS - Publication Details

Lai-Man Po

ORCID: 0000-0002-5185-1492

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5038133707

Research Areas

Advanced Vision and Imaging
Video Coding and Compression Technologies
Advanced Image Processing Techniques
Image and Signal Denoising Methods
Advanced Data Compression Techniques
Image and Video Quality Assessment
Image Retrieval and Classification Techniques
Advanced Image and Video Retrieval Techniques
Image Enhancement Techniques
Advanced Image Fusion Techniques
Face recognition and analysis
Biometric Identification and Security
Video Analysis and Summarization
Digital Filter Design and Implementation
Advanced Neural Network Applications
Generative Adversarial Networks and Image Synthesis
Domain Adaptation and Few-Shot Learning
Digital Media Forensic Detection
Human Pose and Action Recognition
Face and Expression Recognition
Non-Invasive Vital Sign Monitoring
Computer Graphics and Visualization Techniques
Remote-Sensing Image Classification
Visual Attention and Saliency Detection
Speech and Audio Processing

City University of Hong Kong
2015-2024

Ben-Gurion University of the Negev
2020

ETH Zurich
2019

University of Hong Kong
2000-2008

South China University of Technology
2005

Hong Kong Chu Hai College
2005

Hong Kong Polytechnic University
1991-1994

A novel four-step search algorithm for fast block motion estimation

OPENALEX - Publications

Lai-Man Po Wing-Chung Ma

Based on the real world image sequence's characteristic of center-biased motion vector distribution, a new four-step search (4SS) algorithm with checking point pattern for fast block estimation is proposed in this paper. A halfway-stop technique employed searching steps 2 to 4 and total number points varied from 17 27. Simulation results show that 4SS performs better than well-known three-step has similar performance (N3SS) terms compensation errors. In addition, also reduces worst-case...

10.1109/76.499840 article EN IEEE Transactions on Circuits and Systems for Video Technology 1996-06-01

Integration of image quality and motion cues for face anti-spoofing: A neural network approach

OPENALEX - Publications

Litong Feng Lai-Man Po Yuming Li Xuyuan Xu Yuan Fang and 2 more

10.1016/j.jvcir.2016.03.019 article EN Journal of Visual Communication and Image Representation 2016-04-02

NTIRE 2020 Challenge on Spectral Reconstruction from an RGB Image

OPENALEX - Publications

Boaz Arad Radu Timofte Ohad Ben‐Shahar Yi‐Tun Lin Graham D. Finlayson and 43 more

This paper reviews the second challenge on spectral reconstruction from RGB images, i.e., recovery of whole- scene hyperspectral (HS) information a 3-channel image. As in previous challenge, two tracks were provided: (i) "Clean" track where HS images are estimated noise-free RGBs, themselves calculated numerically using ground-truth and supplied sensitivity functions (ii) "Real World" track, simulating capture by an uncalibrated unknown camera, recovered noisy JPEG-compressed images. A new,...

10.1109/cvprw50498.2020.00231 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2020-06-01

Large Separable Kernel Attention: Rethinking the Large Kernel Attention design in CNN

OPENALEX - Publications

Kin Wai Lau Lai-Man Po Yasar Abbas Ur Rehman

10.1016/j.eswa.2023.121352 article EN Expert Systems with Applications 2023-09-01

A novel cross-diamond search algorithm for fast block motion estimation

OPENALEX - Publications

Terence Cheung Lai-Man Po

In block motion estimation, search patterns with different shapes or sizes and the center-biased characteristics of motion-vector distribution have a large impact on searching speed quality performance. We propose novel algorithm using cross-search pattern as initial step large/small diamond (DS) subsequent steps for fast estimation. The is designed to fit cross-center-biased vector real-world sequences by evaluating nine relatively higher probable candidates located horizontally vertically...

10.1109/tcsvt.2002.806815 article EN IEEE Transactions on Circuits and Systems for Video Technology 2002-12-01

Motion-Resistant Remote Imaging Photoplethysmography Based on the Optical Properties of Skin

OPENALEX - Publications

Litong Feng Lai-Man Po Xuyuan Xu Yuming Li Ruiyi Ma

Remote imaging photoplethysmography (RIPPG) can achieve contactless monitoring of human vital signs. However, the robustness to a subject's motion is challenging problem for RIPPG, especially in facial video-based RIPPG. The RIPPG signal originates from radiant intensity variation skin with pulses blood and motions modulate skin. Based on optical properties skin, we build an model which origins artifacts be clearly described. region interest (ROI) regarded as Lambertian radiator effect ROI...

10.1109/tcsvt.2014.2364415 article EN IEEE Transactions on Circuits and Systems for Video Technology 2014-10-22

Novel cross-diamond-hexagonal search algorithms for fast block motion estimation

OPENALEX - Publications

Terence Cheung Lai-Man Po

We propose two cross-diamond-hexagonal search (CDHS) algorithms, which differ from each other by their sizes of hexagonal patterns. These algorithms basically employ cross-shaped patterns consecutively in the very beginning steps and switch using diamond-shaped To further reduce checking points, pairs are proposed conjunction with candidates found located at diamond corners. Experimental results show that CDHSs perform faster than (DS) about 144% cross-diamond (CDS) 73%, whereas similar...

10.1109/tmm.2004.840609 article EN IEEE Transactions on Multimedia 2005-01-24

Cuffless Blood Pressure Estimation Based on Photoplethysmography Signal and Its Second Derivative

OPENALEX - Publications

Mengyang Liu Lai-Man Po Hong Fu

In personal healthcare, blood pressure (BP) is an important vital sign to be monitored frequently.However, traditional BP measurement devices require cuff's inflation and deflation that very uncomfortable for many users.Cuffless noninvasive estimation methods are attractive especially on using Photoplethysmography (PPG) approach achieving continuous monitoring minimal user's inconvenience.From recent studies the second derivative of PPG (SDPPG) vascular aging, SDPPG contains information...

10.7763/ijcte.2017.v9.1138 article EN International Journal of Computer Theory and Engineering 2017-01-01

Hierarchical Regression Network for Spectral Reconstruction from RGB Images

OPENALEX - Publications

Yuzhi Zhao Lai-Man Po Qiong Yan Wei Liu Ting-Yu Lin

Capturing visual image with a hyperspectral camera has been successfully applied to many areas due its narrowband imaging technology. Hyperspectral reconstruction from RGB images denotes reverse process of by discovering an inverse response function. Current works mainly map directly corresponding spectrum but do not consider context information explicitly. Moreover, the use encoder-decoder pair in current algorithms leads loss information. To address these problems, we propose 4-level...

10.1109/cvprw50498.2020.00219 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2020-06-01

No-Reference Video Quality Assessment With 3D Shearlet Transform and Convolutional Neural Networks

OPENALEX - Publications

Yuming Li Lai-Man Po Terence Cheung Xuyuan Xu Litong Feng and 2 more

In this paper, we propose an efficient general-purpose no-reference (NR) video quality assessment (VQA) framework that is based on 3D shearlet transform and convolutional neural network (CNN). Taking blocks as input, simple primary spatiotemporal features are extracted by transform, which capable of capturing natural scene statistics properties. Then, CNN logistic regression concatenated to exaggerate the discriminative parts predict a perceptual score. The resulting algorithm, name...

10.1109/tcsvt.2015.2430711 article EN IEEE Transactions on Circuits and Systems for Video Technology 2015-05-06

LiveNet: Improving features generalization for face liveness detection using convolution neural networks

OPENALEX - Publications

Yasar Abbas Ur Rehman Lai-Man Po Mengyang Liu

10.1016/j.eswa.2018.05.004 article EN Expert Systems with Applications 2018-05-08

Normalized partial distortion search algorithm for block motion estimation

OPENALEX - Publications

Chok-Kwan Cheung Lai-Man Po

Many fast block-matching algorithms reduce computations by limiting the number of checking points. They can achieve high computation reduction, but often result in relatively higher matching error compared with full-search algorithm. A novel algorithm named normalized partial distortion search is proposed. The proposed reduces using a halfway-stop technique calculation block measure. In order to increase probability early rejection non-possible candidate motion vectors, accumulated and...

10.1109/76.836286 article EN IEEE Transactions on Circuits and Systems for Video Technology 2000-04-01

Enhanced Hexagonal Search for Fast Block Motion Estimation

OPENALEX - Publications

Ce Zhu Xiao Lin Lap‐Pui Chau Lai-Man Po

Fast block motion estimation normally consists of low-resolution coarse search and the following fine-resolution inner search. Most algorithms developed attempt to speed up without considering accelerating focused On top hexagonal method recently developed, an enhanced algorithm is proposed further improve performance in terms reducing number points distortion, where a novel fast employed by exploiting distortion information evaluated points. Our experimental results substantially justify...

10.1109/tcsvt.2004.833166 article EN IEEE Transactions on Circuits and Systems for Video Technology 2004-09-28

Edge-Based Structural Similarity for Image Quality Assessment

OPENALEX - Publications

Guan-hao Chen Chunling Yang Lai-Man Po Shengli Xie

Objective quality assessment has been widely used in image processing for decades and many researchers have studying the objective method based on Human Visual System (HVS). Recently Structural Similarity (SSIM) is proposed, under assumption that HVS highly adapted extracting structural information from a scene, simulation results proved it better than PSNR (or MSE). By deeply SSIM, we find fails measuring badly blurred images. Based this, develop an improved which called Edge-based (ESSIM)....

10.1109/icassp.2006.1660497 article EN 2006-08-02

No-reference image quality assessment with shearlet transform and deep neural networks

OPENALEX - Publications

Yuming Li Lai-Man Po Xuyuan Xu Litong Feng Yuan Fang and 2 more

10.1016/j.neucom.2014.12.015 article EN Neurocomputing 2014-12-15

No-reference image quality assessment with deep convolutional neural networks

OPENALEX - Publications

Yuming Li Lai-Man Po Litong Feng Yuan Fang

The state-of-the-art general-purpose no-reference image or video quality assessment (NR-I/VQA) algorithms usually rely on elaborated hand-crafted features which capture the Natural Scene Statistics (NSS) properties. However, designing these is not an easy problem. In this paper, we describe a novel NR-IQA framework based deep Convolutional Neural Networks (CNN). Directly taking raw as input and outputting score, new integrates feature learning regression into one optimization process,...

10.1109/icdsp.2016.7868646 article EN 2016-10-01

SCGAN: Saliency Map-Guided Colorization With Generative Adversarial Network

OPENALEX - Publications

Yuzhi Zhao Lai-Man Po William K. Cheung Wing-Yin Yu Yasar Abbas Ur Rehman

Given a grayscale photograph, the colorization system estimates visually plausible colorful image. Conventional methods often use semantics to colorize images. However, in these methods, only classification semantic information is embedded, resulting confusion and color bleeding final colorized To address issues, we propose fully automatic Saliency Map-guided Colorization with Generative Adversarial Network (SCGAN) framework. It jointly predicts saliency map minimize Since global features...

10.1109/tcsvt.2020.3037688 article EN IEEE Transactions on Circuits and Systems for Video Technology 2020-11-12

A Novel Patch Variance Biased Convolutional Neural Network for No-Reference Image Quality Assessment

OPENALEX - Publications

Lai-Man Po Mengyang Liu Wilson Y. F. Yuen Yuming Li Xuyuan Xu and 4 more

Deep convolutional neural networks (CNNs) have been successfully applied on no-reference image quality assessment (NR-IQA) with respect to human perception. Most of these methods deal small patches and use the average score test for predicting whole quality. We discovered that from homogenous regions are unreliable both network training final estimation. In addition, complex structures much higher chances achieving better prediction. Based findings, we enhanced conventional CNN-based NR-IQA...

10.1109/tcsvt.2019.2891159 article EN IEEE Transactions on Circuits and Systems for Video Technology 2019-01-09

AIM 2019 Challenge on RAW to RGB Mapping: Methods and Results

OPENALEX - Publications

Andrey Ignatov Radu Timofte Sung-Jea Ko Seungwook Kim Kwang-Hyun Uhm and 29 more

This paper reviews the first AIM challenge on mapping camera RAW to RGB images with focus proposed solutions and results. The participating teams were solving a real-world photo enhancement problem, where goal was map original low-quality from Huawei P20 device same photos captured Canon 5D DSLR camera. considered problem embraced number of computer vision subtasks, such as image demosaicing, denoising, gamma correction, resolution sharpness enhancement, etc. target metric used in this...

10.1109/iccvw.2019.00443 article EN 2019-10-01

CSRNet: Cascaded Selective Resolution Network for real-time semantic segmentation

OPENALEX - Publications

Jingjing Xiong Lai-Man Po Wing-Yin Yu Chang Zhou Pengfei Xian and 1 more

10.1016/j.eswa.2022.118537 article EN Expert Systems with Applications 2022-08-17

VCGAN: Video Colorization With Hybrid Generative Adversarial Network

OPENALEX - Publications

Yuzhi Zhao Lai-Man Po Wing-Yin Yu Yasar Abbas Ur Rehman Mengyang Liu and 2 more

We propose a hybrid recurrent Video Colorization with Hybrid Generative Adversarial Network (VCGAN), an improved approach to video colorization using end-to-end learning. The VCGAN addresses two prevalent issues in the domain: Temporal consistency and unification of network refinement into single architecture. To enhance quality spatiotemporal consistency, mainstream generator is assisted by additional networks, i.e., global feature extractor placeholder extractor, respectively. encodes...

10.1109/tmm.2022.3154600 article EN IEEE Transactions on Multimedia 2022-02-25

RMP-adapter: A region-based Multiple Prompt Adapter for multi-concept customization in text-to-image diffusion model

OPENALEX - Publications

Zeyu Jiang Lai-Man Po Xuyuan Xu Yexin Wang Haoxuan Wu and 2 more

10.1016/j.eswa.2025.126936 article EN Expert Systems with Applications 2025-02-01

Cofflow: Controllable Flow Field for High-Fidelity Virtual Try-On Using Diffusion Models

OPENALEX - Publications

Kun Li Lai-Man Po Wenhao Yu Yu Xue Haoxuan Wu and 4 more

10.2139/ssrn.5187559 preprint EN 2025-01-01

Adaptive motion tracking block matching algorithms for video coding

OPENALEX - Publications

Jie-Bin Xu Lai-Man Po Chok-Kwan Cheung

In most block-based video coding systems, the fast block matching algorithms (BMAs) use origin as initial search center, which may not track motion very well. To improve accuracy of BMAs, a new adaptive tracking algorithm is proposed. Based on spatial correlation blocks, predicted starting point, reflects trend current block, adaptively chosen. This center found closer to global minimum, and thus center-biased BMAs can be used find vector more efficiently. Experimental results show that...

10.1109/76.795056 article EN IEEE Transactions on Circuits and Systems for Video Technology 1999-01-01

Adjustable partial distortion search algorithm for fast block motion estimation

OPENALEX - Publications

Terence Cheung Lai-Man Po

The quality control for video coding usually absents from many traditional fast block motion estimators. A novel block-matching algorithm estimation named the adjustable partial distortion search (APDS) is proposed. It a new normalized comparison method capable of adjusting prediction accuracy against searching speed by factor k. With adjustability, APDS could act as (NPDS) when k equal to 0, and conventional (PDS) 1. In addition, it uses halfway-stop technique with progressive distortions...

10.1109/tcsvt.2002.808091 article EN IEEE Transactions on Circuits and Systems for Video Technology 2003-01-01

Coming Soon ...