NFDI4DS | UHH-SEMS - Publication Details

Binglu Wang

ORCID: 0000-0002-9266-4685

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5043220498

Research Areas

Multimodal Machine Learning Applications
Anomaly Detection Techniques and Applications
Advanced Image Processing Techniques
Advanced Image Fusion Techniques
Advanced Image and Video Retrieval Techniques
Human Pose and Action Recognition
Gaze Tracking and Assistive Technology
Image Enhancement Techniques
Advanced Neural Network Applications
Domain Adaptation and Few-Shot Learning
Video Surveillance and Tracking Methods
Visual Attention and Saliency Detection
Image and Signal Denoising Methods
Target Tracking and Data Fusion in Sensor Networks
Advanced Computing and Algorithms
Topic Modeling
scientometrics and bibliometrics research
Remote-Sensing Image Classification
Advanced Vision and Imaging
Infrared Target Detection Methodologies
Hand Gesture Recognition Systems
Image Retrieval and Classification Techniques
Gaussian Processes and Bayesian Inference
COVID-19 diagnosis using AI
Tactile and Sensory Interactions

Central South University
2023-2025

Northwestern Polytechnical University
2020-2025

Beijing Institute of Technology
2023-2024

Xi'an University of Architecture and Technology
2022-2024

Wuhan Polytechnic University
2024

Third Xiangya Hospital
2024

Ocean University of China
2023

China Railway Group (China)
2022

Peking Union Medical College Hospital
2021

Chinese Academy of Medical Sciences & Peking Union Medical College
2021

Differential Feature Awareness Network Within Antagonistic Learning for Infrared-Visible Object Detection

OPENALEX - Publications

Ruiheng Zhang Lu Li Qi Zhang J.Y. Zhang Lixin Xu and 2 more

The combination of infrared and visible videos aims to gather more comprehensive feature information from multiple sources reach superior results on various practical tasks, such as detection segmentation, over that a single modality. However, most existing dual-modality object algorithms ignore the modal differences fail consider correlation between extraction fusion, which leads incomplete inadequate fusion features. Hence, there raises an issue how preserve each unique fully utilize...

10.1109/tcsvt.2023.3289142 article EN IEEE Transactions on Circuits and Systems for Video Technology 2023-06-26

YOLO-DCTI: Small Object Detection in Remote Sensing Base on Contextual Transformer Enhancement

OPENALEX - Publications

Lingtong Min Ziman Fan Qinyi Lv Mohamed Reda Linghao Shen and 1 more

Object detection for remote sensing is a fundamental task in image processing of sensing; as one the core components, small or tiny object plays an important role. Despite considerable advancements achieved with integration CNN and transformer networks, there remains untapped potential enhancing extraction utilization information associated objects. Particularly within structures, this arises from disregard complex intertwined interplay between spatial context channel during global modeling...

10.3390/rs15163970 article EN cc-by Remote Sensing 2023-08-10

U²PNet: An Unsupervised Underwater Image-Restoration Network Using Polarization

OPENALEX - Publications

Linghao Shen Haisheng Xia Xun Zhang Yongqiang Zhao Ning Li and 3 more

This article presents U 2PNet, a novel unsupervised underwater image restoration network using polarization for improving signal-to-noise ratio and quality in imaging environments. Traditional methods require specific cues or pairs of datasets, which limit their practical applications. Our proposed method requires only one mosaicked polarized the scene does not datasets pretraining cues. We design two subnetworks (T-net B textsubscript ∞ -net) to accurately estimate transmission map...

10.1109/tcyb.2024.3365693 article EN IEEE Transactions on Cybernetics 2024-02-29

Two-Stage Spatial-Frequency Joint Learning for Large-Factor Remote Sensing Image Super-Resolution

OPENALEX - Publications

Jiarui Wang Yuting Lu Shunzhou Wang Binglu Wang Xiaoxu Wang and 1 more

Super-resolution neural networks have recently achieved great progress in restoring high-quality remote sensing images at low zoom-in magnitude. However, these often struggle with challenges like shape distortion and blurring effects due to the severe absence of structure texture details large-factor image super-resolution. Addressing challenges, we propose a novel Two-Stage Spatial-Frequency Joint Learning Network (TSFNet). TSFNet innovatively merges insights from both spatial frequency...

10.1109/tgrs.2024.3357173 article EN IEEE Transactions on Geoscience and Remote Sensing 2024-01-01

Hyperspectral and Multispectral Image Fusion via Graph Laplacian-Guided Coupled Tensor Decomposition

OPENALEX - Publications

Yuanyang Bu Yongqiang Zhao Jize Xue Jonathan Cheung-Wai Chan Seong G. Kong and 3 more

We propose a novel graph Laplacian-guided coupled tensor decomposition (gLGCTD) model for fusion of hyperspectral image (HSI) and multispectral (MSI) spatial spectral resolution enhancements. The Tucker is employed to capture the global interdependencies across different modes fully exploit intrinsic spatial-spectral information. To preserve local characteristics, complementary submanifold structures embedded in high-resolution (HR)-HSI are encoded by Laplacian regularizations. information...

10.1109/tgrs.2020.2992788 article EN IEEE Transactions on Geoscience and Remote Sensing 2020-05-18

Mosaic Convolution-Attention Network for Demosaicing Multispectral Filter Array Images

OPENALEX - Publications

Kai Feng Yongqiang Zhao Jonathan Cheung-Wai Chan Seong G. Kong Xun Zhang and 1 more

This paper presents a mosaic convolution-attention network (MCAN) for demosaicing spectral images captured using multispectral filter array (MSFA) imaging sensors. MSFA-based systems acquire information of scene in single snap-shot operation. A complete image is reconstructed by an image. To avoid aliasing and artifacts demosaicing, we utilize joint spatial-spectral correlation raw The proposed MCAN includes convolution module (MCM) attention (MAM). MCM extracts features via learning...

10.1109/tci.2021.3102052 article EN IEEE Transactions on Computational Imaging 2021-01-01

Multiple Instance Graph Learning for Weakly Supervised Remote Sensing Object Detection

OPENALEX - Publications

Binglu Wang Yongqiang Zhao Xuelong Li

Weakly supervised object detection (WSOD) has recently attracted much attention in the field of remote sensing, where only image-level labels that distinguish existence an images are required. However, existing methods frequently treat most discriminative area as optimal solution and, meanwhile, ignore fact more than one instance may exist a certain class sensing (RSIs). To address issue, we propose unique multiple graph (MIG) learning framework for WSOD RSIs. The motivation this work is...

10.1109/tgrs.2021.3123231 article EN IEEE Transactions on Geoscience and Remote Sensing 2021-10-26

GaTector: A Unified Framework for Gaze Object Prediction

OPENALEX - Publications

Binglu Wang Tao Hu Baoshan Li Xiaojuan Chen Zhijie Zhang

Gaze object prediction is a newly proposed task that aims to discover the objects being stared at by humans. It of great application significance but still lacks unified solution framework. An intuitive incorporate an detection branch into existing gaze method. However, previous methods usually use two different networks extract features from scene image and head image, which would lead heavy network architecture prevent each joint optimization. In this paper, we build novel framework named...

10.1109/cvpr52688.2022.01898 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Can large language models provide useful feedback on research papers? A large-scale empirical analysis

OPENALEX - Publications

Weixin Liang Yuhui Zhang Hancheng Cao Binglu Wang Daisy Yi Ding and 7 more

Expert feedback lays the foundation of rigorous research. However, rapid growth scholarly production and intricate knowledge specialization challenge conventional scientific mechanisms. High-quality peer reviews are increasingly difficult to obtain. Researchers who more junior or from under-resourced settings have especially hard times getting timely feedback. With breakthrough large language models (LLM) such as GPT-4, there is growing interest in using LLMs generate on research...

10.48550/arxiv.2310.01783 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Temporal Action Localization in the Deep Learning Era: A Survey

OPENALEX - Publications

Binglu Wang Yongqiang Zhao Le Yang Teng Long Xuelong Li

The temporal action localization research aims to discover instances from untrimmed videos, representing a fundamental step in the field of intelligent video understanding. With advent deep learning, backbone networks have been instrumental providing representative spatiotemporal features, while end-to-end learning paradigm has enabled development high-quality models through data-driven training. Both supervised and weakly approaches contributed rapid progress localization, resulting...

10.1109/tpami.2023.3330794 article EN cc-by-nc-nd IEEE Transactions on Pattern Analysis and Machine Intelligence 2023-11-06

Core: Cooperative Reconstruction for Multi-Agent Perception

OPENALEX - Publications

Binglu Wang Lei Zhang Zhaozhong Wang Yongqiang Zhao Tianfei Zhou

This paper presents Core, a conceptually simple, effective and communication-efficient model for multi-agent cooperative perception. It addresses the task from novel perspective of reconstruction, based on two key insights: 1) cooperating agents together provide more holistic observation environment, 2) can serve as valuable supervision to explicitly guide learning how reconstruct ideal collaboration. Core instantiates idea with three major components: compressor each agent create compact...

10.1109/iccv51070.2023.00800 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Radiologist-inspired Symmetric Local-Global Multi-Supervised Learning for early diagnosis of pneumoconiosis

OPENALEX - Publications

Jiarui Wang Meiyue Song Deng-Ping Fan Xiaoxu Wang Shaoting Zhang and 4 more

10.1016/j.eswa.2025.127173 article EN Expert Systems with Applications 2025-03-01

Hybrid Attention-Based U-Shaped Network for Remote Sensing Image Super-Resolution

OPENALEX - Publications

Jiarui Wang Binglu Wang Xiaoxu Wang Yongqiang Zhao Teng Long

Recently, remote sensing image super-resolution (RSISR) has drawn considerable attention and made great breakthroughs based on convolutional neural networks (CNNs). Due to the scale richness of texture structural information frequently recurring inside same images (RSIs) but varying greatly with different RSIs, state-of-the-art CNN-based methods have begun explore multiscale global features in RSIs by using mechanisms. However, they are still insufficient significant content clues RSIs. In...

10.1109/tgrs.2023.3283769 article EN cc-by IEEE Transactions on Geoscience and Remote Sensing 2023-01-01

PneumoLLM: Harnessing the power of large language model for pneumoconiosis diagnosis

OPENALEX - Publications

Meiyue Song Jiarui Wang Zhihua Yu Jiaxin Wang Le Yang and 11 more

10.1016/j.media.2024.103248 article EN Medical Image Analysis 2024-06-20

Multimodal Large Models Are Effective Action Anticipators

OPENALEX - Publications

Binglu Wang Yao Tian Shunzhou Wang Le Yang

The task of long-term action anticipation demands solutions that can effectively model temporal dynamics over extended periods while deeply understanding the inherent semantics actions. Traditional approaches, which primarily rely on recurrent units or Transformer layers to capture dependencies, often fall short in addressing these challenges. Large Language Models (LLMs), with their robust sequential modeling capabilities and extensive commonsense knowledge, present new opportunities for...

10.48550/arxiv.2501.00795 preprint EN arXiv (Cornell University) 2025-01-01

CM-YOLO: Context Modulated Representation Learning for Ship Detection

OPENALEX - Publications

Lingtong Min Feiyang Dou Yani Zhang Dian Shao Li Li and 1 more

10.1109/tgrs.2025.3538848 article EN IEEE Transactions on Geoscience and Remote Sensing 2025-01-01

Considering author sequence in all-author co-citation analysis

OPENALEX - Publications

Yi Bu Binglu Wang Zaida Chinchilla‐Rodríguez Cassidy R. Sugimoto Yong Huang and 1 more

10.1016/j.ipm.2020.102300 article EN Information Processing & Management 2020-06-18

SODA: Weakly Supervised Temporal Action Localization Based on Astute Background Response and Self-Distillation Learning

OPENALEX - Publications

Tao Zhao Junwei Han Le Yang Binglu Wang Dingwen Zhang

10.1007/s11263-021-01473-9 article EN International Journal of Computer Vision 2021-05-31

Exploring Sub-Action Granularity for Weakly Supervised Temporal Action Localization

OPENALEX - Publications

Binglu Wang Xun Zhang Yongqiang Zhao

Modeling cross-video relationship is an important issue for the weakly supervised temporal action localization task. To this end, traditional methods operate at level and rely on complicated strategies to prepare triplet samples, which only mines relationships among three videos from two categories. In work, we observe that instances different categories could exhibit similar motion patterns, i.e. subaction, propose sub-action granularity elaborately explore relationships. However, given...

10.1109/tcsvt.2021.3089323 article EN IEEE Transactions on Circuits and Systems for Video Technology 2021-06-14

Field measurement and numerical investigation of artificial ground freezing for the construction of a subway cross passage under groundwater flow

OPENALEX - Publications

Xin Liu Yupeng Shen Zhicheng Zhang Zhijian Liu Binglu Wang and 2 more

10.1016/j.trgeo.2022.100869 article EN Transportation Geotechnics 2022-09-28

POLO: Learning Explicit Cross-Modality Fusion for Temporal Action Localization

OPENALEX - Publications

Binglu Wang Le Yang Yongqiang Zhao

Temporal action localization aims at discovering instances in untrimmed videos, where RGB and flow are two widely used feature modalities. Specifically, chiefly reveals appearance mainly depicts motion. Given features, previous methods employ the early fusion or late paradigm to mine complementarity between them. By concatenating raw implicitly achieved by network, but it partly discards particularity of each modality. The independently maintains branches explore modality, only fuses...

10.1109/lsp.2021.3061289 article EN IEEE Signal Processing Letters 2021-01-01

Prototype-Based Intent Perception

OPENALEX - Publications

Binglu Wang Kang Yang Yongqiang Zhao Teng Long Xuelong Li

Intent perception is a novel task that aims to understand the intention of images, regular classification methods usually perform unsatisfactorily on intent due semantic ambiguity problem,i.e. intra-class variety problem in which images same class may contain objects different categories and inter-class confusion classes similar categories. To address this problem, paper introduces prototype learning into proposes unified framework named PIP-Net reduce influence ambiguity. Specifically, for...

10.1109/tmm.2023.3234817 article EN IEEE Transactions on Multimedia 2023-01-01

Joint Denoising-Demosaicking Network for Long-Wave Infrared Division-of-Focal-Plane Polarization Images With Mixed Noise Level Estimation

OPENALEX - Publications

Ning Li Binglu Wang François Goudail Yongqiang Zhao Quan Pan

Denoising and demosaicking long-wave infrared (LWIR) division-of-focal-plane (DoFP) polarization images are crucial for various vision applications. However, existing methods rely on the sequential application of individual denoising processes, which may result in accumulation errors produced by each process. To address this issue, we propose a joint method LWIR DoFP based three-stage progressive deep convolutional neural network. ensure generalization ability network, it is essential to...

10.1109/tip.2023.3327590 article EN IEEE Transactions on Image Processing 2023-01-01

BEVRefiner: Improving 3D Object Detection in Bird’s-Eye-View via Dual Refinement

OPENALEX - Publications

Binglu Wang Haowen Zheng Lei Zhang Nian Liu Rao Muhammad Anwer and 3 more

Many multi-view camera-based 3D object detection models transform the image features into Bird's-Eye-View (BEV) via Lift-Splat-Shoot (LSS) mechanism, which "lifts" 2D camera-view to voxel space based on predicted depth distribution and then "splats" a BEV plane for subsequent detection. However, feature in such one-stage view transformation scheme heavily relies quality of features, further determines final performance. In this paper, we propose BEVRefiner model performs dual refinement both...

10.1109/tits.2024.3394550 article EN IEEE Transactions on Intelligent Transportation Systems 2024-05-10

Coming Soon ...