NFDI4DS | UHH-SEMS - Publication Details

Kui Jia

ORCID: 0000-0003-2661-5700

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5065964089

Research Areas

3D Shape Modeling and Analysis
Domain Adaptation and Few-Shot Learning
Human Pose and Action Recognition
Computer Graphics and Visualization Techniques
Advanced Vision and Imaging
Advanced Neural Network Applications
3D Surveying and Cultural Heritage
Multimodal Machine Learning Applications
Advanced Image and Video Retrieval Techniques
Video Surveillance and Tracking Methods
Robotics and Sensor-Based Localization
Robot Manipulation and Learning
Face and Expression Recognition
Advanced Image Processing Techniques
Advanced Numerical Analysis Techniques
COVID-19 diagnosis using AI
Face recognition and analysis
Anomaly Detection Techniques and Applications
Gait Recognition and Analysis
Sparse and Compressive Sensing Techniques
Generative Adversarial Networks and Image Synthesis
Image and Signal Denoising Methods
Adversarial Robustness in Machine Learning
Remote Sensing and LiDAR Applications
Diabetic Foot Ulcer Assessment and Management

First Affiliated Hospital of GuangXi Medical University
2024-2025

Guangxi Medical University
2024-2025

South China University of Technology
2016-2025

Chinese University of Hong Kong, Shenzhen
2008-2025

Peng Cheng Laboratory
2020-2023

University of Macau
2014-2016

University of Hong Kong
2008-2015

Advanced Digital Sciences Center
2012-2015

Shandong Institute of Automation
2014

Chinese Academy of Sciences
2008-2012

DehazeNet: An End-to-End System for Single Image Haze Removal

OPENALEX - Publications

Bolun Cai Xiangmin Xu Kui Jia Chunmei Qing Dacheng Tao

Single image haze removal is a challenging ill-posed problem. Existing methods use various constraints/priors to get plausible dehazing solutions. The key achieve estimate medium transmission map for an input hazy image. In this paper, we propose trainable end-to-end system called DehazeNet, estimation. DehazeNet takes as input, and outputs its that subsequently used recover haze-free via atmospheric scattering model. adopts Convolutional Neural Networks (CNN) based deep architecture, whose...

10.1109/tip.2016.2598681 article EN IEEE Transactions on Image Processing 2016-08-10

PCANet: A Simple Deep Learning Baseline for Image Classification?

OPENALEX - Publications

Tsung‐Han Chan Kui Jia Shenghua Gao Jiwen Lu Zinan Zeng and 1 more

In this paper, we propose a very simple deep learning network for image classification that is based on basic data processing components: 1) cascaded principal component analysis (PCA); 2) binary hashing; and 3) blockwise histograms. the proposed architecture, PCA employed to learn multistage filter banks. This followed by hashing block histograms indexing pooling. architecture thus called (PCANet) can be extremely easily efficiently designed learned. For comparison provide better...

10.1109/tip.2015.2475625 article EN IEEE Transactions on Image Processing 2015-09-01

Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks

OPENALEX - Publications

Lin Sun Kui Jia Dit‐Yan Yeung Bertram E. Shi

Human actions in video sequences are three-dimensional (3D) spatio-temporal signals characterizing both the visual appearance and motion dynamics of involved humans objects. Inspired by success convolutional neural networks (CNN) for image classification, recent attempts have been made to learn 3D CNNs recognizing human videos. However, partly due high complexity training convolution kernels need large quantities videos, only limited has reported. This triggered us investigate this paper a...

10.1109/iccv.2015.522 article EN 2015-12-01

Frustum ConvNet: Sliding Frustums to Aggregate Local Point-Wise Features for Amodal 3D Object Detection

OPENALEX - Publications

Zhixin Wang Kui Jia

In this work, we propose a novel method termed Frustum ConvNet (F-ConvNet) for amodal 3D object detection from point clouds. Given 2D region proposals in an RGB image, our first generates sequence of frustums each proposal, and uses the obtained to group local points. F-ConvNet aggregates point-wise features as frustum-level feature vectors, arrays these vectors map use its subsequent component fully convolutional network (FCN), which spatially fuses supports end-to-end continuous estimation...

10.1109/iros40897.2019.8968513 article EN 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2019-11-01

Domain-Symmetric Networks for Adversarial Domain Adaptation

OPENALEX - Publications

Yabin Zhang Hui Tang Kui Jia Mingkui Tan

Unsupervised domain adaptation aims to learn a model of classifier for unlabeled samples on the target domain, given training data labeled source domain. Impressive progress is made recently by learning invariant features via domain-adversarial deep networks. In spite recent progress, still limited in achieving invariance feature distributions at finer category level. To this end, we propose paper new method called Domain-Symmetric Networks (SymNets). The proposed SymNet based symmetric...

10.1109/cvpr.2019.00517 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Multi-Task CNN Model for Attribute Prediction

OPENALEX - Publications

Abrar H. Abdulnabi Gang Wang Jiwen Lu Kui Jia

This paper proposes a joint multi-task learning algorithm to better predict attributes in images using deep convolutional neural networks (CNN). We consider binary semantic through CNN model, where each will one attribute. The allows models simultaneously share visual knowledge among different attribute categories. Each generate attribute-specific feature representations, and then we apply on the features their attributes. In our framework, propose method decompose overall model's parameters...

10.1109/tmm.2015.2477680 article EN IEEE Transactions on Multimedia 2015-09-11

Unsupervised Domain Adaptation via Structurally Regularized Deep Clustering

OPENALEX - Publications

Hui Tang Ke Chen Kui Jia

Unsupervised domain adaptation (UDA) is to make predictions for unlabeled data on a target domain, given labeled source whose distribution shifts from the one. Mainstream UDA methods learn aligned features between two domains, such that classifier trained can be readily applied ones. However, transferring strategy has potential risk of damaging intrinsic discrimination data. To alleviate this risk, we are motivated by assumption structural similarity, and propose directly uncover via...

10.1109/cvpr42600.2020.00875 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

A Joint Intrinsic-Extrinsic Prior Model for Retinex

OPENALEX - Publications

Bolun Cai Xianming Xu Kailing Guo Kui Jia Bin Hu and 1 more

We propose a joint intrinsic-extrinsic prior model to estimate both illumination and reflectance from an observed image. The 2D image formed 3D object in the scene is affected by intrinsic properties (shape texture) extrinsic property (illumination). Based on novel structure-preserving measure called local variation deviation, proposed for better representation. Better than conventional Retinex models, can preserve structure information shape prior, with fine details texture capture luminous...

10.1109/iccv.2017.431 article EN 2017-10-01

Single Sample Face Recognition via Learning Deep Supervised Autoencoders

OPENALEX - Publications

Shuhong Gao Yingying Zhang Kui Jia Jiwen Lu

This paper targets learning robust image representation for single training sample per person face recognition. Motivated by the success of deep in representation, we propose a supervised autoencoder, which is new type building block architectures. There are two features distinct our autoencoder from standard autoencoder. First, enforce faces with variants to be mapped canonical person, example, frontal neutral expression and normal illumination; Second, corresponding same similar. As...

10.1109/tifs.2015.2446438 article EN IEEE Transactions on Information Forensics and Security 2015-06-16

Lattice Long Short-Term Memory for Human Action Recognition

OPENALEX - Publications

Lin Sun Kui Jia Kevin Chen Dit Yan Yeung Bertram E. Shi and 1 more

Human actions captured in video sequences are threedimensional signals characterizing visual appearance and motion dynamics. To learn action patterns, existing methods adopt Convolutional and/or Recurrent Neural Networks (CNNs RNNs). CNN based effective learning spatial appearances, but limited modeling long-term RNNs, especially Long Short- Term Memory (LSTM), able to temporal However, naively applying RNNs a convolutional manner implicitly assumes that motions videos stationary across...

10.1109/iccv.2017.236 article EN 2017-10-01

Deep Mesh Reconstruction From Single RGB Images via Topology Modification Networks

OPENALEX - Publications

Junyi Pan Xiaoguang Han Weikai Chen Jiapeng Tang Kui Jia

Reconstructing the 3D mesh of a general object from single image is now possible thanks to latest advances deep learning technologies. However, due nontrivial difficulty generating feasible structure, state-of-the-art approaches often simplify problem by displacements template that deforms it target surface. Though reconstructing shape with complex topology can be achieved deforming multiple patches, remains difficult stitch results ensure high meshing quality. In this paper, we present an...

10.1109/iccv.2019.01006 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation

OPENALEX - Publications

Rui Chen Yongwei Chen Ningxin Jiao Kui Jia

Automatic 3D content creation has achieved rapid progress recently due to the availability of pre-trained, large language models and image diffusion models, forming emerging topic text-to-3D creation. Existing methods commonly use implicit scene representations, which couple geometry appearance via volume rendering are suboptimal in terms recovering finer geometries achieving photorealistic rendering; consequently, they less effective for generating high-quality assets. In this work, we...

10.1109/iccv51070.2023.02033 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Discriminative Adversarial Domain Adaptation

OPENALEX - Publications

Hui Tang Kui Jia

Given labeled instances on a source domain and unlabeled ones target domain, unsupervised adaptation aims to learn task classifier that can well classify instances. Recent advances rely domain-adversarial training of deep networks domain-invariant features. However, due an issue mode collapse induced by the separate design classifiers, these methods are limited in aligning joint distributions feature category across domains. To overcome it, we propose novel adversarial learning method termed...

10.1609/aaai.v34i04.6054 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Frustum ConvNet: Sliding Frustums to Aggregate Local Point-Wise Features for Amodal 3D Object Detection

OPENALEX - Publications

Zhixin Wang Kui Jia

In this work, we propose a novel method termed \emph{Frustum ConvNet (F-ConvNet)} for amodal 3D object detection from point clouds. Given 2D region proposals in an RGB image, our first generates sequence of frustums each proposal, and uses the obtained to group local points. F-ConvNet aggregates point-wise features as frustum-level feature vectors, arrays these vectors map use its subsequent component fully convolutional network (FCN), which spatially fuses supports end-to-end continuous...

10.48550/arxiv.1903.01864 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization

OPENALEX - Publications

Yabin Zhang Minghan Li Ruihuang Li Kui Jia Lei Zhang

Arbitrary style transfer (AST) and domain generalization (DG) are important yet challenging visual learning tasks, which can be cast as a feature distribution matching problem. With the assumption of Gaussian distribution, conventional methods usually match mean standard deviation features. However, distributions real-world data much more complicated than Gaussian, cannot accurately matched by using only first-order second-order statistics, while it is computationally prohibitive to use...

10.1109/cvpr52688.2022.00787 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Perception-Aware Multi-Sensor Fusion for 3D LiDAR Semantic Segmentation

OPENALEX - Publications

Zhuangwei Zhuang Rong Li Kui Jia Qicheng Wang Yuanqing Li and 1 more

3D LiDAR (light detection and ranging) semantic segmentation is important in scene understanding for many applications, such as auto-driving robotics. For example, autonomous cars equipped with RGB cameras LiDAR, it crucial to fuse complementary information from different sensors robust accurate segmentation. Existing fusion-based methods, however, may not achieve promising performance due the vast difference between two modalities. In this work, we investigate a collaborative fusion scheme...

10.1109/iccv48922.2021.01597 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Accelerometer-Based Gait Recognition by Sparse Representation of Signature Points With Clusters

OPENALEX - Publications

Yuting Zhang Gang Pan Kui Jia Minlong Lu Yueming Wang and 1 more

Gait, as a promising biometric for recognizing human identities, can be nonintrusively captured series of acceleration signals using wearable or portable smart devices. It used access control. Most existing methods on accelerometer-based gait recognition require explicit step-cycle detection, suffering from cycle detection failures and intercycle phase misalignment. We propose novel algorithm that avoids both the above two problems. makes use type salient points termed signature (SPs), has...

10.1109/tcyb.2014.2361287 article EN IEEE Transactions on Cybernetics 2014-11-20

A non-negative sparse promoting algorithm for high resolution hyperspectral imaging

OPENALEX - Publications

Eliot Wycoff Tsung‐Han Chan Kui Jia Wing‐Kin Ma Yi Ma

Promoting the spatial resolution of off-the-shelf hyperspectral sensors is expected to improve typical computer vision tasks, such as target tracking and image classification. In this paper, we investigate scenario in which two cameras, one with a conventional RGB sensor other sensor, capture same scene, attempting extract redundant complementary information. We propose non-negative sparse promoting framework integrate data into high set data. The formulated problem form matrix factorization...

10.1109/icassp.2013.6637883 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2013-05-01

Learning by Associating Ambiguously Labeled Images

OPENALEX - Publications

Zinan Zeng Shijie Xiao Kui Jia Tsung‐Han Chan Shenghua Gao and 2 more

We study in this paper the problem of learning classifiers from ambiguously labeled images. For instance, collection new images, each image contains some samples interest (\emph{e.g.,} human faces), and its associated caption has labels with true ones included, while sample-label association is unknown. The task to learn these images generalize An essential consideration here how make use information embedded relations between labels, both within across set. To end, we propose a novel...

10.1109/cvpr.2013.97 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2013-06-01

Image Transformation Based on Learning Dictionaries across Image Spaces

OPENALEX - Publications

Kui Jia Xiaogang Wang Xiaoou Tang

In this paper, we propose a framework of transforming images from source image space to target space, based on learning coupled dictionaries training set paired images. The can be used for applications such as super-resolution and estimation intrinsic components (shading albedo). It is local parametric regression approach, using sparse feature representations over learned across the spaces. After dictionary learning, coefficient vectors patch pairs are partitioned into easily retrievable...

10.1109/tpami.2012.95 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2012-04-27

Partial Occlusion Handling for Visual Tracking via Robust Part Matching

OPENALEX - Publications

Tianzhu Zhang Kui Jia Changsheng Xu Yi Ma Narendra Ahuja

Part-based visual tracking is advantageous due to its robustness against partial occlusion. However, how effectively exploit the confidence scores of individual parts construct a robust tracker still challenging problem. In this paper, we address problem by simultaneously matching in each multiple frames, which realized locality-constrained low-rank sparse learning method that establishes multi-frame part correspondences through optimization permutation matrices. The proposed (PMT) has...

10.1109/cvpr.2014.164 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2014-06-01

DL-SFA: Deeply-Learned Slow Feature Analysis for Action Recognition

OPENALEX - Publications

Lin Sun Kui Jia Tsung‐Han Chan Yuqiang Fang Gang Wang and 1 more

Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are implemented sophisticatedly difficult to be extended other sensor modalities. Recent studies discover that there no universally best hand-engineered features for all datasets, learning directly from data may more advantageous. One endeavor is Slow Feature Analysis (SFA) proposed by Wiskott Sejnowski [33]. SFA can learn invariant slowly varying...

10.1109/cvpr.2014.336 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2014-06-01

Coming Soon ...