Chen Zhang

ORCID: 0000-0001-8556-0186
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Image and Signal Denoising Methods
  • Advanced Image Processing Techniques
  • Generative Adversarial Networks and Image Synthesis
  • Advanced Image Fusion Techniques
  • Vehicle License Plate Recognition
  • Advanced Vision and Imaging
  • Computer Graphics and Visualization Techniques
  • Image Enhancement Techniques
  • Image Retrieval and Classification Techniques
  • Medical Image Segmentation Techniques
  • Infrared Target Detection Methodologies
  • Digital Image Processing Techniques
  • Industrial Vision Systems and Defect Detection
  • Video Analysis and Summarization
  • Image and Object Detection Techniques
  • Advanced Image and Video Retrieval Techniques
  • Advanced Optical Imaging Technologies
  • Multimodal Machine Learning Applications
  • Image and Video Stabilization
  • Biomedical Text Mining and Ontologies
  • Advanced Neural Network Applications
  • Ultrasound Imaging and Elastography
  • Subtitles and Audiovisual Media
  • Interactive and Immersive Displays
  • Optical measurement and interference techniques

Huazhong University of Science and Technology
2024

Institute of Physics
2023

Tianjin University
2023

Microsoft (United States)
2023

Chongqing University of Technology
2023

Huawei Technologies (Sweden)
2022

OmniVision Technologies (United States)
2021

Lanzhou Jiaotong University
2020

Technische Universität Ilmenau
2017-2019

University of Dayton
2014-2018

Recently, there has been a growing interest in 3D CNN-based stereo matching methods due to their remarkable accuracy. However, the high complexity of convolution makes it challenging strike balance between accuracy and speed. Notably, explicit volumes contain considerable redundancy. In this study, we delve into more compact 2D implicit network eliminate redundancy boost real-time performance. simply replacing networks with causes issues that can lead performance degradation, including loss...

10.1609/aaai.v38i4.28107 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

The SoccerNet 2022 challenges were the second annual video understanding organized by team. In 2022, composed of 6 vision-based tasks: (1) action spotting, focusing on retrieving timestamps in long untrimmed videos, (2) replay grounding, live moment an shown a replay, (3) pitch localization, detecting line and goal part elements, (4) camera calibration, dedicated to intrinsic extrinsic parameters, (5) player re-identification, same players across multiple views, (6) object tracking, tracking...

10.1145/3552437.3558545 preprint EN 2022-09-30

To alleviate the vocabulary problem, this paper investigates role of user term feedback in interactive text-based image retrieval. Term refers to from a on specific terms regarding their relevance target image. Previous studies have indicated effectiveness text retrieval [14]. However, has not shown be effective our experiments Our results indicate that, although positive effect by allowing users identify more relevant terms, it also strong negative providing opportunities for specify...

10.1145/1076034.1076046 article EN 2005-08-15

We propose corrupted reference image quality assessment (CRIQA), a novel foundation for reasoning about and denoising problems jointly. In order to assess the visual of processed relative an ideal (not provided), we predict full-reference (FRIQA) scores denoised images without having direct access image, but with help observed instead. Our simulation studies verify that CRIQA indeed agree corresponding FRIQA scores, human subject confirm are more consistent perceived than NRIQA scores....

10.1109/tip.2018.2878326 article EN publisher-specific-oa IEEE Transactions on Image Processing 2018-10-26

Many important data mining problems can be modeled as learning a (bidirectional) multidimensional mapping between two domains. Based on the generative adversarial networks (GANs), particularly conditional ones, cross-domain joint distribution matching is an increasingly popular kind of methods addressing such problems. Though significant advances have been achieved, there are still main disadvantages existing models, i.e., requirement large amount paired training samples and notorious...

10.1145/3219819.3219957 article EN 2018-07-19

Variational autoencoders (VAEs) have witnessed great success in performing the compression of image datasets. This success, made possible by bits-back coding framework, has produced competitive performance across many benchmarks. However, despite this, VAE architectures are currently limited a combination practicalities and ratios. That is, not only do state-of the-art methods, such as normalizing flows, often demonstrate out-performance, but initial bits required makes single parallel...

10.1109/cvpr52688.2022.00048 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Text queries are natural and intuitive for users to describe their information needs. However, text-based image retrieval faces many challenges. Traditional text techniques on descriptions have not been very successful. This is mainly due the inconsistent textual discrepancies between user terms in descriptions. To investigate strategies alleviate this vocabulary problem, article examines role of term feedback targeted search that based retrieval. Term refers from a specific regarding...

10.1145/1198296.1198299 article EN ACM transactions on office information systems 2007-02-01

At present, the traffic engineering and automation have developed, vehicle license plate recognition technology need get a corresponding improvement also.In case of identifying car picture, principle automatic is illustrated in this paper, processing described detail which includes preprocessing, edge extraction, location, character segmentation, recognition.The program implementing edited by Matlab.The example result shows that method feasible, it can be put into practice.

10.2991/iccsee.2013.715 article EN cc-by-nc Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013) 2013-01-01

This study presents a hierarchical palmprint feature extraction and recognition approach based on multi‐wavelet complex network (CN) since they can effectively decrease redundant information enhance key points of main lines wrinkles. The is first pre‐filtered decomposed once using multi‐wavelet. Three components (LL 1,2,3 ) corresponding to the pre‐filter except for diagonal component are extracted as elementary features. Second, binary images (BLL obtained by average window method different...

10.1049/iet-ipr.2017.0520 article EN IET Image Processing 2018-01-22

Video dubbing aims to translate the original speech in a film or television program into target language, which can be achieved with cascaded system consisting of recognition, machine translation and synthesis. To ensure translated well aligned corresponding video, length/duration should as close possible that speech, requires strict length control. Previous works usually control number words characters generated by model similar source sentence, without considering isochronicity duration...

10.1609/aaai.v37i11.26613 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

At present, the traffic engineering and automation have developed, vehicle license plate recognition technology need get a corresponding improvement also. In case of identifying car picture, principle automatic is illustrated in this paper, processing described detail which includes pre-processing, edge extraction, location, character segmentation, recognition. The program implementing edited by Matlab. example result shows that method feasible, it can be put into practice.

10.4028/www.scientific.net/amr.760-762.1638 article EN Advanced materials research 2013-09-18

The distribution of real camera sensor is well approximated by Poisson, and the estimation light intensity signal from Poisson count data plays a prominent role in digital imaging. It highly desirable for imaging devices to carry ability assess performance image restoration. Drawing on new category quality assessment called corrupted reference (CR-QA), we develop computational technique predicting score popular structural similarity index (SSIM) without having direct access ideal image. We...

10.1109/icip.2014.7025550 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2014-10-01

Generative Adversarial Networks (GANs) have made great success in synthesizing high-quality images. However, how to steer the generation process of a well-trained GAN model and customize output image is much less explored. It has been recently found that modulating input latent code used GANs can reasonably alter some variation factors image, but such manipulation usually presents change entire as whole. In this work, we propose an effective approach, termed LoGAN, support local editing...

10.48550/arxiv.2105.08222 preprint EN cc-by-nc-nd arXiv (Cornell University) 2021-01-01

In the low-photon imaging regime, noise in image sensors is dominated by shot noise, best modeled statistically as Poisson distribution. this work, we show that likelihood function very well matched with Bayesian estimation of "difference log contrast pixel intensities." More specifically, our work rooted statistical compositional data analysis, whereby reinterpret Aitchison geometry a multi-resolution analysis log-pixel domain. We demonstrate difference-log-contrast has wavelet-like...

10.1109/tip.2021.3087943 article EN IEEE Transactions on Image Processing 2021-01-01

Aiming at the problem that traditional prior information‐based defogging algorithm fails in some special scenarios, an end‐to‐end convolutional network based on attention mechanism is proposed. The consists of two modules: parameter estimation and image restoration. First, multi‐scale convolution used to extract feature information. Residual skip connection methods are improve utilisation rate shallow Secondly, channel domain add weight input from previous select useful Finally, atmospheric...

10.1049/el.2020.1128 article EN Electronics Letters 2020-05-14

Since detection result is not good enough when detecting real image using line search mechanism corner algorithm, a feature selection criterion based on priority and adaptive non-maximal suppression (ANMS) proposed in this paper to control the number density of features image. Experimental results show that algorithm can detect more reasonable, be used mosaics well.

10.1109/emeit.2011.6024069 article EN 2011-08-01

Surface reconstruction has traditionally relied on the Multi-View Stereo (MVS)-based pipeline, which often suffers from noisy and incomplete geometry. This is due to that although MVS been proven be an effective way recover geometry of scenes, especially for locally detailed areas with rich textures, it struggles deal low texture large variations illumination where photometric consistency unreliable. Recently, Neural Implicit Reconstruction (NISR) combines surface rendering volume techniques...

10.48550/arxiv.2401.12751 preprint EN other-oa arXiv (Cornell University) 2024-01-01

Invisible watermarking is essential for safeguarding digital content, enabling copyright protection and content authentication. However, existing methods fall short in robustness against regeneration attacks. In this paper, we propose a novel method called FreqMark that involves unconstrained optimization of the image latent frequency space obtained after VAE encoding. Specifically, embeds watermark by optimizing images then extracts through pre-trained encoder. This allows flexible...

10.48550/arxiv.2410.20824 preprint EN arXiv (Cornell University) 2024-10-28

The examination of the Computer Graphics is basically computer to investigate drawing ability in universities recent years.Based on many years teaching practice and according transformation trend intelligent paper marking, image processing technology adopted, key information extracted, similarity calculation program compiled, CAD automatic marking function implemented by contrast students' plots with standard answer.Through examples, grading results are consistent artificial ideally.The...

10.2991/icsem.2013.20 article EN cc-by-nc Proceedings of the 2nd International Conference On Systems Engineering and Modeling 2013-01-01

In conventional binaural rendering a pair of head-related impulse responses (HRIR), measured from source direction to left and right ears, is convolved with signal create the impression virtual 3D sound when played on headphones. It well known that using HRIRs in real room, which includes natural reverberant decay, increases externalization realism simulation. However, HRIR filter length even small room can be many thousands taps leading computational complexity issues world implementations....

10.1121/1.4800867 article EN Proceedings of meetings on acoustics 2013-01-01

How to preserve the spectral information when enhancing spatial details is a key issue of remote sensing image fusion. The component substitution (CS)-based fusion methods can effectively enhance while suffering distortion, and multiresolution analysis (MRA)-based have advantages in preserving but are not satisfactory terms details. This paper proposes hybrid method integrate CS- MRA-based approaches. intensity first obtained from an original multispectral (MS) by hyperspherical color space...

10.1117/1.jrs.12.035017 article EN cc-by Journal of Applied Remote Sensing 2018-09-13

For the pores edge detecting of Activated Carbon Fibers (ACF) material images, traditional approaches are difficult to obtain complete information. Snake algorithm is a reasonable approach for detection. An improved initial contour model proposed in this paper. A rectangle first located surround be detected instead drawing series points as contour. Then, we map these on surrounded according certain rule constitute After mapping strategy, used iterate Experiments show that information...

10.4028/www.scientific.net/amr.217-218.1663 article EN Advanced materials research 2011-03-01

In this paper, we propose a method of data processing for swept volumetric display and its experimental result. The proposed consists four main parts: acquiring, pre-processing, transmitting, post-processing. Different acquisition techniques is adopted various types. Data pre-processing mainly include normalization, Coordinate transformation, reduction & uniform, splitting. post-processing part realizes three functions: receiving broadcasting, saving, transmitting to driver circuit. Compared...

10.1109/isise.2012.25 article EN 2012-12-01

This paper proposes a new approach of automated toll gate passing an driving vehicle. enables the vehicle to select optimal and automatically pass by using object detection, 3D environment construction, virtual line generation, path planning motion control. After designing concept approach, some demonstrations are conducted prove it. data-based scenario shows that proposed can not only perceive well for this purpose but also plan appropriate trajectories when encountering complex scene near plazas.

10.1109/ivs.2018.8500687 article EN 2022 IEEE Intelligent Vehicles Symposium (IV) 2018-06-01
Coming Soon ...