NFDI4DS | UHH-SEMS - Publication Details

A Lightweight Model for Perceptual Image Compression via Implicit Priors

OPENALEX - Publications

Wei Hao Yanhui Zhou Yiwen Jia Chenyang Ge Saeed Anwar and 1 more

Perceptual image compression has shown strong potential for producing visually appealing results at low bitrates, surpassing classical standards and pixel-wise distortion-oriented neural methods. However, existing methods typically improve performance by incorporating explicit semantic priors, such as segmentation maps textual features, into the encoder or decoder, which increases model complexity adding parameters floating-point operations. This limits model's practicality, often occurs on...

10.48550/arxiv.2502.13988 preprint EN arXiv (Cornell University) 2025-02-18

Self-supervised depth super-resolution with contrastive multiview pre-training

OPENALEX - Publications

Xin Qiao Chenyang Ge Chaoqiang Zhao Fabio Tosi Matteo Poggi and 1 more

10.1016/j.neunet.2023.09.023 article EN Neural Networks 2023-09-22

Towards Extreme Image Rescaling with Generative Prior and Invertible Prior

OPENALEX - Publications

Wei Hao Chenyang Ge Zhiyuan Li Xin Qiao Pengchao Deng

The goal of image rescaling is to embed the information from high-resolution images into low-resolution and then reconstruct in reverse. Existing methods either focus on small scaling factors or do not generalize well natural with diverse content extreme settings, i.e., using (e.g., 16× 32×). When performing rescaling, previous often fail produce plausible high-quality results due insufficient cues images. In this work, we propose an framework that exploits rich generative prior integrated...

10.1109/tcsvt.2023.3349141 article EN IEEE Transactions on Circuits and Systems for Video Technology 2024-01-02

Real‐world image deblurring using data synthesis and feature complementary network

OPENALEX - Publications

Wei Hao Chenyang Ge Xin Qiao Pengchao Deng

Abstract Many learning‐based approaches to image deblurring have received increasing attention in recent years. However, the models trained on existing synthetic datasets do not generalize well real‐world blur, resulting undesirable artifacts and residual blur. This work attempts address this problem from two aspects: training data synthesis network architecture. To narrow domain gap between real domains, a realistic blur pipeline generate high‐quality blurred is proposed. Since non‐uniform...

10.1049/ipr2.13029 article EN cc-by-nc-nd IET Image Processing 2024-01-11

Attention-Aware Dual-Stream Network for Multimodal Face Anti-Spoofing

OPENALEX - Publications

Pengchao Deng Chenyang Ge Xin Qiao Wei Hao Yuan Sun

Since the rapid development of face recognition systems using 3D cameras, public has demanded great safety regulations for these devices. As a closely related topic, multimodal anti-spoofing (FAS) become an indispensable part systems. However, existing FAS tools suffer from performance degradation under external low-lighting conditions and insufficient representation capabilities fusion features. To address issues, we present attention-aware dual-stream method cameras (i.e., IR+Depth)...

10.1109/tifs.2023.3293423 article EN IEEE Transactions on Information Forensics and Security 2023-01-01

A High Spatial Resolution Depth Sensing Method Based on Binocular Structured Light

OPENALEX - Publications

Huimin Yao Chenyang Ge Jianru Xue Nanning Zheng

Depth information has been used in many fields because of its low cost and easy availability, since the Microsoft Kinect was released. However, Kinect-like RGB-D sensors show limited performance certain applications place high demands on accuracy robustness depth information. In this paper, we propose a sensing system that contains laser projector similar to Kinect, two infrared cameras located both sides projector, obtain higher spatial resolution We apply block-matching algorithm estimate...

10.3390/s17040805 article EN cc-by Sensors 2017-04-08

Multimodal contrastive learning for face anti-spoofing

OPENALEX - Publications

Pengchao Deng Chenyang Ge Wei Hao Yuan Sun Xin Qiao

10.1016/j.engappai.2023.107600 article EN Engineering Applications of Artificial Intelligence 2023-11-27

Depth Restoration in Under-Display Time-of-Flight Imaging

OPENALEX - Publications

Xin Qiao Chenyang Ge Pengchao Deng Wei Hao Matteo Poggi and 1 more

Under-display imaging has recently received considerable attention in both academia and industry. As a variation of this technique, under-display ToF (UD-ToF) cameras enable depth sensing for full-screen devices. However, it also brings problems image blurring, signal-to-noise ratio ranging accuracy reduction. To address these issues, we propose cascaded deep network to improve the quality UD-ToF maps. The comprises two subnets, with first using complex-valued raw domain perform denoising,...

10.1109/tpami.2022.3209905 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-01-01

Depth super-resolution from explicit and implicit high-frequency features

OPENALEX - Publications

Xin Qiao Chenyang Ge Youmin Zhang Yanhui Zhou Fabio Tosi and 2 more

10.1016/j.cviu.2023.103841 article EN Computer Vision and Image Understanding 2023-09-23

The VLSI implementation of a high-resolution depth-sensing SoC based on active structured light

OPENALEX - Publications

Huimin Yao Chenyang Ge Gang Hua Nanning Zheng

10.1007/s00138-015-0680-3 article EN Machine Vision and Applications 2015-04-09

Adequate Number of Lymph Nodes Sampled May Determine Appropriate Surgical Modality for Early-Stage NSCLC: A Population-Based Real-World Study

OPENALEX - Publications

Lixian Ling Hongjuan Zheng Haiping Lin Chenyang Ge Dan Li and 4 more

10.1016/j.cllc.2022.12.011 article EN Clinical Lung Cancer 2022-12-26

Design Compact YOLO based Network for Small Target Detection on Infrared Image

OPENALEX - Publications

Shuang Liu Zhicheng Liu Yuhai Li Wancheng Liu Chenyang Ge and 1 more

most YOLO object detection neural networks prefer to focus on traditional RGB image, but previous studies rarely consider special network with compact architecture for infrared image. In this paper, we analyze original architecture, and propose a based by using different blocks from work small target We use layer, GhostConv convolution, Focus structure, Focal EIOU Loss soft NMS modules improve which improves the accuracy speed of The experimental results show that model can be effectively...

10.1109/cac57257.2022.10054751 article EN 2021 China Automation Congress (CAC) 2022-11-25

An Efficient Motion Adaptive De-interlacing and Its VLSI Architecture Design

OPENALEX - Publications

Hongbin Sun Nanning Zheng Chenyang Ge Dong Wang Pengju Ren

This paper presents an efficient motion adaptive de-interlacing technique that consists of two main steps, i.e. 4-field extended Gaussian filtering detection and adjustable window ELA de-interlacing. Four consecutive interlaced fields are used to detect accurately. With a filter, the can eliminate influence noise. An is adopt process pixels, which reconstruct image with high quality even in areas horizontal edge texture. Experimental results show proposed algorithm outperforms previous...

10.1109/isvlsi.2008.46 article EN IEEE Computer Society Annual Symposium on VLSI 2008-01-01

Valid depth data extraction and correction for time-of-flight camera

OPENALEX - Publications

Xin Qiao Chenyang Ge Huimin Yao Pengchao Deng Yanhui Zhou

In this paper an algorithm is presented to extract the valid depth data and correct values of flying pixels by using information confidence image. An adaptive segmentation for measured image executed based on kernel density estimation one-pass connected component labeling. Then a modified structure tensor used detect invalid contained in Finally these are corrected with bi-cubic interpolation method or selectively removed voting operation. And also, erroneous excluded augmented confidence....

10.1117/12.2557533 article EN 2020-01-31

Direction estimation of uncorrelated and coherent narrowband signals with uniform linear array

OPENALEX - Publications

Guangmin Wang Jingmin Xin Chenyang Ge Nanning Zheng Akira Sano

In this paper, an effective direction-of-arrival (DOA) estimation method is proposed with a uniform linear array (ULA) when uncorrelated and coherent signals coexist. The direction-of-arrivals (DOAs) of are estimated in two steps. DOAs using conventional subspace firstly, then the information can be eliminated by matrix difference technique, finally decorrelated to estimated. theoretical analysis simulation results show that effective.

10.1109/isas.2011.5960928 article EN 2011-06-01

Multi-stream Face Anti-spoofing System Using 3D Information

OPENALEX - Publications

Pengchao Deng Chenyang Ge Xin Qiao Wei Hao

Face anti-spoofing plays a crucial role in face recog-nition systems widely used smart devices and security systems. In this paper, we propose multi-stream fusion system based on 3D camera by making full use of information for anti-spoofing. This is composed depth maps Surface Normal Maps (SNM). Detailed discussions about are given. Comparison among different modalities comparison other methods provided through several experiments the public WMCA dataset our self-build Anti-3D dataset. Due...

10.1109/icce53296.2022.9730258 article EN 2023 IEEE International Conference on Consumer Electronics (ICCE) 2022-01-07

Seismic quality factor estimation using continuous wavelet transform

OPENALEX - Publications

Yanhui Zhou Wei Zhao Yan Ge Jinghuai Gao Xiaokai Wang and 1 more

In this paper, seismic quality factor Q estimation from vertical profile (VSP) data is discussed, by using continuous wavelet transform (CWT). We suppose that source signature a general constant-phase which matches the real one better. Based on CWT of reference and target recording, we derive formula frequency-independent ratio wavelet-domain peak amplitude these two recordings. Wavelet-domain denotes module recording with every fixed scale. The related to dominant frequency standard...

10.1109/igarss.2010.5649272 article EN 2010-07-01

A hierarchical and parallel SoC architecture for vision procesor

OPENALEX - Publications

Kuizhi Mei Bin Zhang Chenyang Ge

This paper presents a hierarchical and parallel SoC (System on Chip) architecture for vision processor. The computing is divided into 3 task level modules, which are decision, feature reorganization (or pattern generation), extraction. In the proposed architecture, there two separately buses to integrate also new interrupt RISC processor implement synchronization between hardware modules software. human-face detecting tracking application demo has been mapped verified FPGA. Architecture...

10.1587/elex.6.1380 article EN IEICE Electronics Express 2009-01-01

Traditional Transformation Theory Guided Model for Learned Image Compression

OPENALEX - Publications

Zhiyuan Li Chenyang Ge Shun Li

Recently, many deep image compression methods have been proposed and achieved remarkable performance. However, these are dedicated to optimizing the performance speed at medium high bitrates, while research on ultra low bitrates is limited. In this work, we propose a enhanced invertible encoding network guided by traditional transformation theory, experiments show that our codec outperforms existing in both reconstruction Specifically, introduce Block Discrete Cosine Transformation model...

10.48550/arxiv.2402.15744 preprint EN arXiv (Cornell University) 2024-02-24

Traditional Transformation Theory Guided Model for Learned Image Compression

OPENALEX - Publications

Zhiyuan Li Chenyang Ge Shun Li

Recently, many deep image compression methods have been proposed and achieved remarkable performance. However, these are dedicated to optimizing the performance speed at medium high bitrates, while research on ultra low bitrates is limited. In this work, we propose a enhanced invertible encoding network guided by traditional transformation theory, experiments show that our codec outperforms existing in both reconstruction Specifically, introduce Block Discrete Cosine Transformation model...

10.1109/icce59016.2024.10444483 article EN 2023 IEEE International Conference on Consumer Electronics (ICCE) 2024-01-06

Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

OPENALEX - Publications

Marcos V. Conde Zhijun Lei Wen J. Li Cosmin Stejerean Ioannis Katsavounidis and 70 more

This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use diverse test set containing variety ranging digital art gaming and photography. The are using modern AVIF codec, instead JPEG. All proposed methods improve PSNR fidelity over Lanczos interpolation, process under 10ms. Out 160 participants, 25 teams...

10.48550/arxiv.2404.16484 preprint EN arXiv (Cornell University) 2024-04-25

Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior

OPENALEX - Publications

Zhiyuan Li Yanhui Zhou Hao Wei Chenyang Ge Jingwen Jiang

Compressing images at extremely low bitrates (below 0.1 bits per pixel (bpp)) is a significant challenge due to substantial information loss. Existing extreme image compression methods generally suffer from heavy artifacts or low-fidelity reconstructions. To address this problem, we propose novel framework that combines compressive VAEs and pre-trained text-to-image diffusion models in an end-to-end manner. Specifically, introduce latent feature-guided module based on VAEs. This compresses...

10.48550/arxiv.2404.18820 preprint EN arXiv (Cornell University) 2024-04-29

Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior

OPENALEX - Publications

Zhiyuan Li Yanhui Zhou Wei Hao Chenyang Ge Jingwen Jiang

10.1109/tcsvt.2024.3455576 article EN IEEE Transactions on Circuits and Systems for Video Technology 2024-01-01

Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

OPENALEX - Publications

Marcos V. Conde Zhijun Lei Wen J. Li Ioannis Katsavounidis Radu Timofte and 63 more

10.1109/cvprw63382.2024.00592 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2024-06-17