Bo Zhang

ORCID: 0000-0001-8052-782X
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Domain Adaptation and Few-Shot Learning
  • Advanced Neural Network Applications
  • Multimodal Machine Learning Applications
  • Remote-Sensing Image Classification
  • Advanced Image and Video Retrieval Techniques
  • Adversarial Robustness in Machine Learning
  • Generative Adversarial Networks and Image Synthesis
  • Video Surveillance and Tracking Methods
  • Geophysical Methods and Applications
  • Anomaly Detection Techniques and Applications
  • Machine Learning and Data Classification
  • Human Pose and Action Recognition
  • Data Management and Algorithms
  • Digital Media Forensic Detection
  • COVID-19 diagnosis using AI
  • Machine Learning and ELM
  • Advanced SAR Imaging Techniques
  • Remote Sensing and LiDAR Applications
  • Neural Networks and Applications
  • Synthetic Aperture Radar (SAR) Applications and Techniques
  • Machine Learning and Algorithms
  • Topological and Geometric Data Analysis
  • Semantic Web and Ontologies
  • Robotics and Sensor-Based Localization
  • Face and Expression Recognition

Beijing Academy of Artificial Intelligence
2024-2025

Shanghai Artificial Intelligence Laboratory
2023-2025

Fudan University
2007-2024

Didi Chuxing (China)
2023

Shandong University
2023

Zhejiang Sci-Tech University
2023

Civil Aviation University of China
2023

Zhongshan Hospital
2021-2022

Microsoft Research Asia (China)
2020-2021

Donghua University
2021

Self-training is a competitive approach in domain adaptive segmentation, which trains the network with pseudo labels on target domain. However inevitably, are noisy and features dispersed due to discrepancy between source domains. In this paper, we rely representative prototypes, feature centroids of classes, address two issues for unsupervised adaptation. particular, take one step further exploit distances from prototypes that provide richer information than mere prototypes. Specifically,...

10.1109/cvpr46437.2021.01223 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Cholangiocarcinoma was a highly malignant liver cancer with poor prognosis, and immune infiltration status considered an important factor in response to immunotherapy. In this investigation, we tried locate related genes of cholangiocarcinoma through combination bulk-sequencing single-cell sequencing technology. Single sample gene set enrichment analysis used annotate datasets TCGA CHOL, GSE32225, GSE26566. Differentially expressed between high- low-infiltrated groups dataset were yielded...

10.3389/fimmu.2021.647209 article EN cc-by Frontiers in Immunology 2021-03-24

Although domain adaptation has been extensively studied in natural image-based segmentation tasks, the research on cross-domain for very-high-resolution (VHR) remote sensing images (RSIs) still remains underexplored. The VHR RSI-based mainly faces two critical challenges: 1) large area land covers with many diverse object categories bring severe local patch-level data distribution deviations, thus yielding different difficulties patches and 2) sensor types or dynamically changing modes cause...

10.1109/tgrs.2021.3117851 article EN IEEE Transactions on Geoscience and Remote Sensing 2021-10-15

Global channel pruning (GCP) aims to remove a subset of channels (filters) across different layers from deep model without hurting the performance. Previous works focus on either single task or simply adapting it multitask scenario, and still face following problems when handling pruning: 1) Due mismatch, well-pruned backbone for classification focuses preserving filters that can extract category-sensitive information, causing may be useful other tasks pruned during stage; 2) For...

10.1109/tpami.2023.3260903 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2023-03-28

Few-shot fine-grained learning aims to classify a query image into one of set support categories with differences. Although different objects' local differences via Deep Neural Networks has achieved success, how exploit the query-support cross-image object semantic relations in Transformer-based architecture remains under-explored few-shot scenario. In this work, we propose double-helix model, namely HelixFormer, achieve relation mining bidirectional and symmetrical manner. The HelixFormer...

10.1145/3503161.3547961 article EN Proceedings of the 30th ACM International Conference on Multimedia 2022-10-10

Simplicity is the ultimate sophistication. Differentiable Architecture Search (DARTS) has now become one of mainstream paradigms neural architecture search. However, it largely suffers from well-known performance collapse issue due to aggregation skip connections. It thought have overly benefited residual structure which accelerates information flow. To weaken this impact, we propose inject unbiased random noise impede We name novel approach NoisyDARTS. In effect, a network optimizer should...

10.48550/arxiv.2005.03566 preprint EN other-oa arXiv (Cornell University) 2020-01-01

The goal of few-shot fine-grained image classification is to recognize rarely seen objects in the query set, given only a few samples this class support set. Previous works focus on learning discriminative features from limited number training for distinguishing various classes, but ignore one important fact that spatial alignment semantic between with arbitrary changes and image, also critical computing similarity each support-query pair. In work, we propose an object-aware long-short-range...

10.1145/3474085.3475532 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

<sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup> Deepfakes raised serious concerns on the authenticity of visual contents. Prior works revealed possibility to disrupt deepfakes by adding adversarial perturbations source data, but we argue that threat has not been eliminated yet. This paper presents MagDR, a mask-guided detection and reconstruction pipeline for defending from attacks. MagDR starts with module defines few criteria judge...

10.1109/cvpr46437.2021.00890 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Semi-supervised few-shot learning aims to improve the model generalization ability by means of both limited labeled data and widely-available unlabeled data. Previous works attempt relations between extra data, performing a label propagation or pseudo-labeling process using an episodic training strategy. However, feature distribution represented pseudo-labeled itself is coarse-grained, meaning that there might be large gap real query To this end, we propose sample-centric generation (SFG)...

10.1109/tip.2022.3154938 article EN IEEE Transactions on Image Processing 2022-01-01

Autophagy in tumor was also found to influence immune microenvironment. The relation between autophagy and cancer intrinsic PD1 PD-L1 expression not clear.With data from TCGA GTEx databases, mRNA levels of autophagy-related genes were compared samples normal tissues, which correlated with survival status. Expression associated clinical traits datasets GSE14520 ICGC LIRI. Single sample gene set enrichment analysis (ssGSEA) used calculate scores samples, using signatures MSigDB database....

10.1016/j.bbrc.2022.03.075 article EN cc-by-nc-nd Biochemical and Biophysical Research Communications 2022-03-17

Abstract Deep neural networks (DNNs) have achieved great success in many object detection tasks. However, such DNNS-based large models are generally computationally expensive and memory intensive. It is difficult to deploy them devices with low resources or scenarios high real-time requirements, which greatly limits their application promotion. In recent years, researchers focused on compressing without significantly degrading performance, made progress. Therefore, this paper presents a...

10.1007/s11042-023-17192-x article EN cc-by Multimedia Tools and Applications 2023-11-02

Although remote sensing (RS) data with multiple modalities can be used to significantly improve the accuracy of semantic segmentation in RS data, how effectively extract multimodal information through feature fusion remains a challenging task. Specifically, existing methods for still face two major challenges: 1) Due diverse imaging mechanisms boundaries same foreground may vary across different modalities, leading inclusion unwanted background semantics fused features; 2) from exhibit...

10.1109/tgrs.2025.3526247 article EN IEEE Transactions on Geoscience and Remote Sensing 2025-01-01

Deep learning has emerged as a transformative approach for solving complex pattern recognition and object detection challenges. This paper focuses on the application of novel framework based RT-DETR model analyzing intricate image data, particularly in areas such diabetic retinopathy detection. Diabetic retinopathy, leading cause vision loss globally, requires accurate efficient analysis to identify early-stage lesions. The proposed model, built Transformer-based architecture, excels at...

10.48550/arxiv.2501.16469 preprint EN arXiv (Cornell University) 2025-01-27

Multi-task dense prediction aims at handling multiple pixel-wise tasks within a unified network simultaneously for visual scene understanding. However, cross-task feature interactions of current methods are still suffering from incomplete levels representations, less discriminative semantics in participants, and inefficient pair-wise task interaction processes. To tackle these under-explored issues, we propose novel BridgeNet framework, which extracts comprehensive intermediate Bridge...

10.1109/tpami.2025.3535875 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2025-01-01

10.1109/jstars.2025.3545831 article EN cc-by IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2025-01-01

<?Pub Dtl=""?> Compared with general videos, movies and TV shows attract a significantly larger portion of people across time contain very rich interesting narrative patterns shots scenes. In this paper, we aim to recover the inherent structure scenes in such video narratives. The obtained could be useful for subsequent analysis tasks as tracking objects cuts, action retrieval, well enriching user browsing editing interfaces. Recent research on problem has mainly focused combining multiple...

10.1109/tmm.2013.2238522 article EN publisher-specific-oa IEEE Transactions on Multimedia 2013-01-09

10.1016/j.neucom.2016.08.059 article EN Neurocomputing 2016-09-02

10.1007/s11263-016-0970-x article EN International Journal of Computer Vision 2016-11-28

Unsupervised domain adaptive object detection aims to adapt a well-trained detector from its original source with rich labeled data new target unlabeled data. Previous works focus on improving the adaptability of region-based detectors, <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">e.g.</i> , Faster-RCNN, through matching cross-domain instance-level features that are explicitly extracted region proposal network (RPN). However, this is...

10.1109/tcsvt.2021.3069034 article EN IEEE Transactions on Circuits and Systems for Video Technology 2021-03-26

As a fundamental task of 3D perception, point cloud recognition has shown significant progress in recent years. However, existing methods still face challenges when dealing with geometry differences, resulting performance degradation distribution gap exists between the training and testing data, also known as domain generalization. In this work, we focus on problem propose general framework, named Push-and-Pull, aimed at effectively improving generalization ability models unseen target...

10.1109/tcsvt.2024.3371089 article EN IEEE Transactions on Circuits and Systems for Video Technology 2024-02-28

Few-shot object detection aims to localize and recognize potential objects of interest only by using a few annotated data, it is beneficial for remote sensing images (RSIs) based applications such as urban monitoring. Previous RSIs-based few-shot works often try convert the support from class-agnostic features class-specific vectors, then perform feature attention operations on query image be tested. However, methods still face two critical challenges: 1) They ignore spatial similarity...

10.1109/jstars.2022.3203126 article EN cc-by IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2022-01-01
Coming Soon ...