NFDI4DS | UHH-SEMS - Publication Details

Yanxiang Gong

ORCID: 0000-0002-3481-4454

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5030751920

Research Areas

Handwritten Text Recognition Techniques
Advanced Image and Video Retrieval Techniques
Advanced Neural Network Applications
Anomaly Detection Techniques and Applications
Generative Adversarial Networks and Image Synthesis
Image Retrieval and Classification Techniques
Digital Media Forensic Detection
Natural Language Processing Techniques
Image Processing and 3D Reconstruction
Video Analysis and Summarization
Human Pose and Action Recognition
Vehicle License Plate Recognition
Multimodal Machine Learning Applications
Adversarial Robustness in Machine Learning
Remote-Sensing Image Classification
Advanced Vision and Imaging
Model Reduction and Neural Networks
Machine Fault Diagnosis Techniques
Automated Road and Building Extraction
Domain Adaptation and Few-Shot Learning
Advanced Image Processing Techniques
Computer Graphics and Visualization Techniques
Video Surveillance and Tracking Methods
Image Processing Techniques and Applications

University of Electronic Science and Technology of China
2019-2024

Unified Chinese License Plate detection and recognition with high efficiency

OPENALEX - Publications

Yanxiang Gong Linjie Deng Shuai Tao Xinchen Lu Peicheng Wu and 3 more

10.1016/j.jvcir.2022.103541 article EN Journal of Visual Communication and Image Representation 2022-05-17

Mask guided two-stream network for end-to-end few-shot action recognition

OPENALEX - Publications

Zhiwei Xie Yanxiang Gong Jiangfei Ji Zheng Ma Mei Xie

10.1016/j.neucom.2024.127582 article EN Neurocomputing 2024-03-22

Detecting multi-oriented text with corner-based region proposals

OPENALEX - Publications

Linjie Deng Yanxiang Gong Yi Lin Jingwen Shuai Xiaoguang Tu and 3 more

10.1016/j.neucom.2019.01.013 article EN Neurocomputing 2019-01-11

STELA: A Real-Time Scene Text Detector With Learned Anchor

OPENALEX - Publications

Linjie Deng Yanxiang Gong Xinchen Lu Yi Lin Zheng Ma and 1 more

To achieve high coverage of target boxes, a normal strategy conventional one-stage anchor-based detectors is to utilize multiple priors at each spatial position, especially in scene text detection tasks. In this work, we present simple and intuitive method for multi-oriented where location feature maps only associates with one reference box. The idea inspired from the two-stage R-CNN framework that can estimate objects any shape by using learned proposals. aim our integrate mechanism into...

10.1109/access.2019.2948405 article EN cc-by IEEE Access 2019-01-01

Focus-Enhanced Scene Text Recognition with Deformable Convolutions

OPENALEX - Publications

Linjie Deng Yanxiang Gong Xinchen Lu Xin Yi Zheng Ma and 1 more

Recently, scene text recognition methods based on deep learning have sprung up in computer vision area. The existing achieved great performances, but the of irregular is still challenging due to various shapes and distorted patterns. Consider that at time reading words real world, normally we will not rectify it our mind adjust focus visual fields. Similarly, through utilizing deformable convolutional layers whose geometric structures are adjustable, present an enhanced network without steps...

10.1109/iccc47050.2019.9064428 preprint EN 2019-12-01

Generating Text Sequence Images for Recognition

OPENALEX - Publications

Yanxiang Gong Linjie Deng Zheng Ma Mei Xie

10.1007/s11063-019-10166-x article EN Neural Processing Letters 2020-01-02

Unsupervised domain adaptation via coarse-to-fine feature alignment method using contrastive learning

OPENALEX - Publications

Shiyu Tang Peijun Tang Yanxiang Gong Zheng Ma Mei Xie

Previous feature alignment methods in Unsupervised domain adaptation(UDA) mostly only align global features without considering the mismatch between class-wise features. In this work, we propose a new coarse-to-fine method using contrastive learning called CFContra. It draws closer than coarse or only, therefore improves model's performance to great extent. We build it upon one of most effective UDA entropy minimization further improve performance. particular, prevent excessive memory...

10.48550/arxiv.2103.12371 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Lightweight semantic image synthesis with mutable framework

OPENALEX - Publications

Yanxiang Gong Guozhen Duan Zheng Ma Mei Xie

Image synthesis is a critical task in various computer vision technologies, and lots of methods tried to translate semantic images into realistic ones for controllable synthesis. With the increasing image resolution, networks are becoming larger, applications related restricted. To alleviate problem, we propose lightweight mutable network The based on generative adversarial networks. We introduce feature pyramid architecture generator reduce hidden node numbers. also design scheme where will...

10.1117/1.jei.32.3.033027 article EN Journal of Electronic Imaging 2023-06-14

Distribution constraining for combating mode collapse in generative adversarial networks

OPENALEX - Publications

Yanxiang Gong Minjiang Zhong Ji Yang Mei Xie Xin Ma

Image synthesis is a critical technique in the image processing field. Recently, generative adversarial networks (GANs) have played significant role tasks. However, issue of mode collapse remains major challenge GANs, which limits their potential applications. We propose method to address problem. Our approach focuses on minimizing divergence between distributions real and generated features, thereby reducing learning pressure discriminator. An advantage our that it does not require prior...

10.1117/1.jei.32.4.043029 article EN Journal of Electronic Imaging 2023-08-16

Distribution Fitting for Combating Mode Collapse in Generative Adversarial Networks

OPENALEX - Publications

Yanxiang Gong Zhiwei Xie Guozhen Duan Zheng Ma Mei Xie

Mode collapse is a significant unsolved issue of generative adversarial networks (GANs). In this work, we examine the causes mode from novel perspective. Due to nonuniform sampling in training process, some subdistributions may be missed when data. As result, even generated distribution differs real one, GAN objective can still achieve minimum. To address issue, propose global fitting (GDF) method with penalty term confine data distribution. When GDF will make harder reach minimal value,...

10.1109/tnnls.2023.3313600 article EN IEEE Transactions on Neural Networks and Learning Systems 2023-09-20

Unattached irregular scene text rectification with refined objective

OPENALEX - Publications

Yanxiang Gong Linjie Deng Zhiqiang Zhang Guozhen Duan Zheng Ma and 1 more

10.1016/j.neucom.2021.08.047 article EN Neurocomputing 2021-08-13

Enhancing Feature Fusion Using Attention for Small Object Detection

OPENALEX - Publications

Jie Li Yanxiang Gong Zheng Ma Mei Xie

At present, object detection performance can meet some routine tasks' requirements. However, the for small-sized objects is far from satisfactory. Therefore, we propose feature layer attention module and nonlinear positioning loss penalty based on size to improve small performance. Our work proposes module, which introduces an mechanism in enhance model's objects. Through fusion scheme proposed this paper, solve problem of insufficient features a certain extent reduce difficulty model...

10.1109/iccc56324.2022.10066003 article EN 2022-12-09

STELA: A Real-Time Scene Text Detector with Learned Anchor

OPENALEX - Publications

Linjie Deng Yanxiang Gong Xinchen Lu Yi Lin Zheng Ma and 1 more

To achieve high coverage of target boxes, a normal strategy conventional one-stage anchor-based detectors is to utilize multiple priors at each spatial position, especially in scene text detection tasks. In this work, we present simple and intuitive method for multi-oriented where location feature maps only associates with one reference box. The idea inspired from the twostage R-CNN framework that can estimate objects any shape by using learned proposals. aim our integrate mechanism into...

10.48550/arxiv.1909.07549 preprint EN other-oa arXiv (Cornell University) 2019-01-01

What's the relationship between CNNs and communication systems?

OPENALEX - Publications

Hao Ge Xiaoguang Tu Yanxiang Gong Mei Xie Zheng Ma

The interpretability of Convolutional Neural Networks (CNNs) is an important topic in the field computer vision. In recent years, works this generally adopt a mature model to reveal internal mechanism CNNs, helping understand CNNs thoroughly. paper, we argue working can be revealed through totally different interpretation, by comparing communication systems and CNNs. This paper successfully obtained corresponding relationship between modules two, verified rationality with experiments....

10.48550/arxiv.2003.01413 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Semantic Segmentation of High Resolution Remote Sensing Images with Extra Context Attention Mechanism

OPENALEX - Publications

Weifu Fu Qing Peng Yanxiang Gong Mei Xie Shicheng Wang and 1 more

High Resolution Remote Sensing Images (HRRSIs) usually have a larger size compared with natural images. Because of the limitation GPU memory, it is not possible to train semantic segmentation models on HRRSIs directly. Commonly used methodologies perform training and prediction cropped sub-images. Thus they fail model potential dependencies between pixels beyond To solve this problem, we firstly propose extra context attention capture global information from receptive fields discriminative...

10.1109/icct50939.2020.9295814 article EN 2020-10-28

AccNet: occluded scene text enhancing network with accretion blocks

OPENALEX - Publications

Yanxiang Gong Zhiqiang Zhang Guozhen Duan Zheng Ma Mei Xie

10.1007/s00138-022-01351-5 article EN Machine Vision and Applications 2022-11-05

Realistic Image-to-Image Translation with Enhanced Texture

OPENALEX - Publications

Guozhen Duan Yanxiang Gong Huijie Zhao Wen Ma Dongxing Song and 2 more

In the image-to-image translation field, most researchers tend to achieve overall of images without paying too much attention texture details images. However, it is also great importance have enhanced and more realistic textures for synthesized images, which could bring better impressions. Therefore, in this work, we propose a method based on CycleGAN output highly improved. The presented generator involves dilated convolutions are conducive processing image details. Furthermore, an improved...

10.1109/assp54407.2021.00010 article EN 2021-11-01

Coming Soon ...