Bin Chen

ORCID: 0009-0007-9955-9347
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Neural Network Applications
  • Advanced Image and Video Retrieval Techniques
  • Advanced Image Processing Techniques
  • Image Enhancement Techniques
  • Video Surveillance and Tracking Methods
  • Anomaly Detection Techniques and Applications
  • Advanced Image Fusion Techniques
  • Robotics and Sensor-Based Localization
  • Remote-Sensing Image Classification
  • Generative Adversarial Networks and Image Synthesis
  • Advanced Memory and Neural Computing
  • Outsourcing and Supply Chain Management
  • Industrial Vision Systems and Defect Detection
  • Domain Adaptation and Few-Shot Learning
  • Image and Signal Denoising Methods
  • Image Processing and 3D Reconstruction
  • Handwritten Text Recognition Techniques
  • Image and Object Detection Techniques
  • CCD and CMOS Imaging Sensors
  • Collaboration in agile enterprises
  • Multimodal Machine Learning Applications
  • Advanced Vision and Imaging

Institute of Computing Technology
2022-2025

Chinese Academy of Sciences
2023-2025

University of Chinese Academy of Sciences
2022-2023

Fuzhou University
2023

Jiaxing University
2023

Harbin Institute of Technology
2020

Limited data usually cause deep neural networks to hold poor performance after training, and many generative models are proposed synthesize improve the of models. However, existing ignore capturing small defect details (e.g., features locations), resulting in that most cannot augment Defect Location Sensitive Data (DLS data) which ratio object size image is 20%) locations defects only on object. In this paper, we propose a new augmentation model, named GAN (DLS-GAN), address DLS problem....

10.1109/tase.2023.3309629 article EN IEEE Transactions on Automation Science and Engineering 2023-09-04

This report introduces two high-quality datasets Flickr360 and ODV360 for omnidirectional image video super-resolution, respectively, reports the NTIRE 2023 challenge on 360° super-resolution. Unlike ordinary 2D images/videos with a narrow field of view, can represent whole scene from all directions in one shot. There exists large gap between image/video both degradation restoration processes. The is held to facilitate development super-resolution by considering their special...

10.1109/cvprw59228.2023.00174 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2023-06-01

10.1109/wacv61041.2025.00852 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025-02-26

FPN is a common component used in object detectors, it supplements multi-scale information by adjacent level features interpolation and summation. However, due to the existence of nonlinear operations convolutional layers with different output dimensions, relationship between levels much more complex, pixel-wise summation not an efficient approach. In this paper, we first analyze design defects from pixel feature map level. Then, novel parameter-free pyramid networks named Dual Refinement...

10.48550/arxiv.2012.01733 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Object detection on panoramic/spherical images has been developed rapidly in the past few years, where IoU-calculator is a fundamental part of various detector components, i.e. Label Assignment, Loss and NMS. Due to low efficiency non-differentiability spherical Unbiased IoU, approximate IoU methods have proposed recently. We find that key these map boxes planar boxes. However, there exists two problems methods: (1) they do not eliminate influence panoramic image distortion; (2) break...

10.24963/ijcai.2023/137 article EN 2023-08-01

Recently, many semi-supervised object detection (SSOD) methods adopt teacher-student framework and have achieved state-of-the-art results. However, the teacher network is tightly coupled with student since an exponential moving average (EMA) of student, which causes a performance bottleneck. To address coupling problem, we propose Cycle Self-Training (CST) for SSOD, consists two teachers T1 T2, students S1 S2. Based on these networks, cycle self-training mechanism built, i.e., S1$\rightarrow...

10.1145/3503161.3548040 article EN Proceedings of the 30th ACM International Conference on Multimedia 2022-10-10

<title>Abstract</title> The importance of the attention mechanism in CV is growing, as it allows a neural network to focus more on what should pay to. Channel and spatial are two basic strategies now use. Using one them alone can enhance some level, while combining beneficial, but adds computational burden. We propose Spatial Attention Fusion Module(CSAFM), besides use channel information, GroupNorm reorganization operation applied ability feature extraction representation maps, which...

10.21203/rs.3.rs-2804607/v1 preprint EN cc-by Research Square (Research Square) 2023-04-18

Displaying high-quality images on edge devices, such as augmented reality is essential for enhancing the user experience. However, these devices often face power consumption and computing resource limitations, making it challenging to apply many deep learning-based image compression algorithms in this field. Implicit Neural Representation (INR) an emerging technology that offers two key benefits compared cutting-edge autoencoder models: low computational complexity parameter-free decoding....

10.48550/arxiv.2401.12587 preprint EN other-oa arXiv (Cornell University) 2024-01-01

Strategic alliance provides a new way for logistics enterprises to increase competitiveness and adapt the competitive environment based on win-win. This paper defines strategic in analyses its causes forming manifestation. A reasonable profit distribution is key ensuring success of alliance, determine ratio. According principle that all an should have equal responsibilities, rights, interests, risks, this constructs model proves feasibility superiority model. case also given show how applied...

10.1061/41127(382)436 article EN 2010-07-22

In order to reduce the time consuming and expensive process of manually annotating data, achieve purpose lightweight deployment. this paper, an object detection method for weakly supervised learning with discrimination mechanism is proposed. We introduce classification branch location based on Darknet-53 backbone network YOLO model, utilize Global Average Pooling (GAP) Softmax complete selected areas, adopt activation map location. addition, we use a model compression pruning operations,...

10.1145/3425577.3425581 article EN 2020-08-23

Recent advances of Transformers have brought new trust to computer vision tasks. However, on small dataset, is hard train and has lower performance than convolutional neural networks. We make transformers as data-efficient networks by introducing multi-focal attention bias. Inspired the distance in a well-trained ViT, we constrain self-attention ViT multi-scale localized receptive field. The size field adaptable during training so that optimal configuration can be learned. provide empirical...

10.48550/arxiv.2203.02358 preprint EN other-oa arXiv (Cornell University) 2022-01-01
Coming Soon ...