Ngai‐Man Cheung

ORCID: 0000-0003-0135-3791
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Image and Video Retrieval Techniques
  • Video Coding and Compression Technologies
  • Generative Adversarial Networks and Image Synthesis
  • Video Surveillance and Tracking Methods
  • Domain Adaptation and Few-Shot Learning
  • Advanced Data Compression Techniques
  • Anomaly Detection Techniques and Applications
  • Image and Video Quality Assessment
  • Multimodal Machine Learning Applications
  • Advanced Vision and Imaging
  • Advanced Image Processing Techniques
  • Image Retrieval and Classification Techniques
  • Digital Media Forensic Detection
  • Adversarial Robustness in Machine Learning
  • Advanced Neural Network Applications
  • Robotics and Sensor-Based Localization
  • Multimedia Communication and Technology
  • Network Security and Intrusion Detection
  • Image Enhancement Techniques
  • Advanced Graph Neural Networks
  • Human Pose and Action Recognition
  • Video Analysis and Summarization
  • Sparse and Compressive Sensing Techniques
  • Wireless Communication Security Techniques
  • Image and Signal Denoising Methods

Singapore University of Technology and Design
2015-2024

Istituto Tecnico Industriale Alessandro Volta
2021

Weatherford College
2021

University of Dayton
2020

The University of Adelaide
2017-2019

Trường ĐH Nguyễn Tất Thành
2019

AT4 wireless (Spain)
2014

Mitsubishi Electric (United States)
2012

Stanford University
2010-2011

University of Southern California
2005-2011

The increasing application of Artificial Intelligence (AI) in health and medicine has attracted a great deal research interest recent decades. This study aims to provide global historical picture concerning AI medicine. A total 27,451 papers that were published between 1977 2018 (84.6% dated 2008⁻2018) retrieved from the Web Science platform. descriptive analysis examined publication volume, authors countries collaboration. network authors' keywords content related scientific literature...

10.3390/jcm8030360 article EN Journal of Clinical Medicine 2019-03-14

Mobile phones have evolved into powerful image and video processing devices equipped with high-resolution cameras, color displays, hardware-accelerated graphics. They are also increasingly a global positioning system connected to broadband wireless networks. All this enables new class of applications that use the camera phone initiate search queries about objects in visual proximity user (Figure 1). Such can be used, e.g., for identifying products, comparison shopping, finding information...

10.1109/msp.2011.940881 article EN IEEE Signal Processing Magazine 2011-06-20

Recent successes in Generative Adversarial Networks (GAN) have affirmed the importance of using more data GAN training. Yet it is expensive to collect many domains such as medical applications. Data Augmentation (DA) has been applied these In this work, we first argue that classical DA approach could mislead generator learn distribution augmented data, which be different from original data. We then propose a principled framework, termed Optimized for (DAG), enable use training improve...

10.1109/tip.2021.3049346 article EN IEEE Transactions on Image Processing 2021-01-01

In recent years Deep Neural Networks (DNNs) have been rapidly developed in various applications, together with increasingly complex architectures. The performance gain of these DNNs generally comes high computational costs and large memory consumption, which may not be affordable for mobile platforms. model quantization can used reducing the computation DNNs, deploying on equipment. this work, we propose an optimization framework deep quantization. First, a measurement to estimate effect...

10.1609/aaai.v32i1.11623 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-29

We survey popular data sets used in computer vision literature and point out their limitations for mobile visual search applications. To overcome many of the limitations, we propose Stanford Mobile Visual Search set. The set contains camera-phone images products, CDs, books, outdoor landmarks, business cards, text documents, museum paintings video clips. has several key characteristics lacking existing sets: rigid objects, widely varying lighting conditions, perspective distortion,...

10.1145/1943552.1943568 article EN 2011-02-15

We propose DGG: Deep clustering via a Gaussian-mixture variational autoencoder (VAE) with Graph embedding. To facilitate clustering, we apply Gaussian mixture model (GMM) as the prior in VAE. handle data complex spread, graph Our idea is that information which captures local structures an excellent complement to deep GMM. Combining them facilitates network learn powerful representations follow global and structural constraints. Therefore, our method unifies model-based similarity-based...

10.1109/iccv.2019.00654 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Nowadays, the occurrence of skin cancer cases has grown worldwide due to extended exposure harmful radiation from Sun. Most common approach detect malignancy moles is by visual inspection performed an expert dermatologist, using a set specific clinical rules. Computer-aided diagnosis, based on mole imaging, another concurrent method which experienced major advancements improvement imaging sensors and processing power. However, these schemes use hand-crafted features are difficult tune...

10.1109/icip.2016.7532834 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2016-09-01

Recently, the introduction of generative adversarial network (GAN) and its variants has enabled generation realistic synthetic samples, which been used for enlarging training sets. Previous work primarily focused on data augmentation semi-supervised supervised tasks. In this paper, we instead focus unsupervised anomaly detection propose a novel framework optimized task. By using GAN variant known as autoencoder (AAE), impose distribution latent space dataset systematically sample to generate...

10.1109/icdm.2018.00146 article EN 2021 IEEE International Conference on Data Mining (ICDM) 2018-11-01

Image blur and image noise are common distortions during acquisition. In this paper, we systematically study the effect of on deep neural network (DNN) classifiers. First, examine DNN classifier performance under four types distortions. Second, propose two approaches to alleviate distortion: re-training fine-tuning with noisy images. Our results suggest that, certain conditions, images can much due distorted inputs, is more practical than re-training.

10.1109/icassp.2017.7952349 article EN 2017-03-01

High-frame-rate (HFR) video is emerging in popular gaming applications to enhance the smooth experience perceived by end users. However, it challenging guarantee delivery quality of HFR mobile cloud scenarios because high transmission rate and limited wireless resources. To address this critical problem, we develop a novel scheduling framework dubbed AdaPtive vIdeo Streaming (APHIS). The term adaptive indicates scheme's capability dynamically adjusting traffic load forward error correction...

10.1109/tcsvt.2015.2441412 article EN IEEE Transactions on Circuits and Systems for Video Technology 2015-06-04

We address the vehicle detection and classification problems using Deep Neural Networks (DNNs) approaches. Here we answer to questions that are specific our application including how utilize DNN for detection, what features useful classification, extend a model trained on limited size dataset, cases of extreme lighting condition. Answering these propose approach outperforms state-of-the-art methods, achieves promising results image with conditions.

10.1109/icdsp.2016.7868561 article EN 2016-10-01

Few shot image classification aims at learning a classifier from limited labeled data. Generating the weights has been applied in many meta-learning methods for few due to its simplicity and effectiveness. In this work, we present Attentive Weights Generation via Information Maximization (AWGIM), which introduces two novel contributions: i) Mutual information maximization between generated data within task; enables retain of task specific query sample. ii) Self-attention cross-attention...

10.1109/cvpr42600.2020.01351 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

This paper presents a novel randomized algorithm for robust point cloud registration without correspondences. Most existing approaches require set of putative correspondences obtained by extracting invariant descriptors. However, such descriptors could become unreliable in noisy and contaminated settings. In these settings, methods that directly handle input sets are preferable. Without correspondences, however, conventional techniques very large number samples order to reach satisfactory...

10.1109/cvpr.2019.00021 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Person Re-Identification (ReID) requires comparing two images of person captured under different conditions. Existing work based on neural networks often computes the similarity feature maps from one single convolutional layer. In this work, we propose an efficient, end-to-end fully Siamese network that similarities at multiple levels. We demonstrate multi-level can improve accuracy considerably using low-complexity structures in ReID problem. Specifically, first, use several layers to...

10.1109/cvpr.2018.00248 article EN 2018-06-01

Modern GANs excel at generating high quality and diverse images. However, when transferring the pretrained on small target data (e.g., 10-shot), generator tends to replicate training samples. Several methods have been proposed address this few-shot image generation task, but there is a lack of effort analyze them under unified framework. As our first contribution, we propose framework existing during adaptation. Our analysis discovers that while some disproportionate focus diversity...

10.1109/cvpr52688.2022.00893 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

While much of multiview video coding focuses on the rate-distortion performance compressing all frames views for storage or non-interactive delivery over networks, we address problem designing a frame structure to enable interactive streaming, where clients can interactively switch during playback. Thus, as client is playing back successive (in time) given view, it send request server different view while continuing uninterrupted temporal Noting that standard tools random access (i.e.,...

10.1109/tip.2010.2070074 article EN IEEE Transactions on Image Processing 2010-08-27

In this paper, we propose a multimodal multi-stream deep learning framework to tackle the egocentric activity recognition problem, using both video and sensor data. First, experiment extend Convolutional Neural Network learn spatial temporal features from videos. Second, multistream Long Short-Term Memory architecture multiple streams (accelerometer, gyroscope, etc.). Third, use two-level fusion technique different pooling techniques compute prediction results. Experimental results dataset...

10.1109/cvprw.2016.54 article EN 2016-06-01

Delivering high-quality mobile video with the limited radio resources is challenging due to time-varying channel status and stringent Quality of Service (QoS) requirements. Multi-homing support enables terminals establish multiple simultaneous associations for enhancing transmission performance. In this paper, we study multi-homed communication delay-constrained High Definition (HD) in heterogeneous wireless networks. The low-delay encoded HD streaming consists exclusively Intra (I)...

10.1109/tmc.2015.2426710 article EN IEEE Transactions on Mobile Computing 2015-04-27

We investigate the design of an entire mobile imaging system for early detection melanoma. Different from previous work, we focus on smartphone-captured visible light images. Our addresses two major challenges. First, images acquired using a smartphone under loosely-controlled environmental conditions may be subject to various distortions, and this makes melanoma more difficult. Second, processing performed is stringent computation memory constraints. In our propose that optimized run...

10.1109/tmm.2018.2814346 article EN IEEE Transactions on Multimedia 2018-03-15

While Geerative Adversarial Networks (GANs) are fundamental to many generative modelling applications, they suffer from numerous issues. In this work, we propose a principled framework simultaneously mitigate two issues in GANs: catastrophic forgetting of the discriminator and mode collapse generator. We achieve by employing for GANs contrastive learning mutual information maximization approach, perform extensive analyses understand sources improvements. Our approach significantly stabilizes...

10.1109/wacv48630.2021.00399 article EN 2021-01-01

CNN-based generative modelling has evolved to produce synthetic images indistinguishable from real in the RGB pixel space. Recent works have observed that CNN-generated share a systematic shortcoming replicating high frequency Fourier spectrum decay attributes. Furthermore, these successfully exploited this detect reporting up 99% accuracy across multiple state-of-the-art GAN models.In work, we investigate validity of assertions claiming are unable achieve spectral consistency. We...

10.1109/cvpr46437.2021.00712 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Cross-domain few-shot classification task (CD-FSC) combines with the requirement to generalize across domains represented by datasets. This setup faces challenges originating from limited labeled data in each class and, additionally, domain shift between training and test sets. In this paper, we introduce a novel approach for existing FSC models. It leverages on explanation scores, obtained methods when applied predictions of models, computed intermediate feature maps Firstly, tailor...

10.1109/icpr48806.2021.9412941 article EN 2022 26th International Conference on Pattern Recognition (ICPR) 2021-01-10

Model inversion (MI) attacks aim to infer and reconstruct private training data by abusing access a model. MI have raised concerns about the leaking of sen-sitive information (e.g. face images used in recognition system). Recently, several algorithms for been proposed improve attack performance. In this work, we revisit MI, study two fundamental issues pertaining all state-of-the-art (SOTA) algorithms, propose solutions these which lead significant boost performance SOTA MI. particular, our...

10.1109/cvpr52729.2023.01572 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01
Coming Soon ...