NFDI4DS | UHH-SEMS - Publication Details

Automatic Aug-Aware Contrastive Proposal Encoding for Few-Shot Object Detection of Remote Sensing Images

OPENALEX - Publications

Siteng Ma Biao Hou Zitong Wu Zhihao Li Xianpeng Guo and 2 more

In the annotation of remote sensing images (RSIs), effectiveness common object detection methods trained on only a few samples decreases instantly, which has prompted increasing research few-shot problem in sensing. RSIs often exhibit suboptimal performance scenarios due to intricate nature scene information interference and high degree cosine similarity, both present significant challenges their effectiveness. this paper, two-stage framework based fine-tuning is selected deal with problems...

10.1109/tgrs.2023.3294943 article EN IEEE Transactions on Geoscience and Remote Sensing 2023-01-01

Contrastive Learning Based on Multiscale Hard Features for Remote-Sensing Image Scene Classification

OPENALEX - Publications

Zhihao Li Biao Hou Xianpeng Guo Siteng Ma Yanyu Cui and 2 more

The overwhelming majority of models for remote sensing image (RSI) scene classification generally require the weights pre-trained on natural images initialization before formal training. However, differences in imaging mechanisms lead to huge discrepancies between and RSIs, strong visual representation learned from massive limits performance when inferencing RSIs. To address this issue, well-established self-supervised contrastive learning paradigm field is introduced RSI field. We propose a...

10.1109/tgrs.2023.3291878 article EN IEEE Transactions on Geoscience and Remote Sensing 2023-01-01

Unsupervised Prototype-Wise Contrastive Learning for Domain Adaptive Semantic Segmentation in Remote Sensing Image

OPENALEX - Publications

Siteng Ma Biao Hou Xianpeng Guo Zitong Wu Zhihao Li and 2 more

Labeling data in the field of remote sensing is time-consuming and labor-intensive, making domain adaptation between different domains an urgently needed solution. To address gap diverse datasets domain, numerous methods tailored for high-resolution imagery have emerged. Some existing focus on reducing at either feature level or pixel level, often overlooking their underlying connection. tackle this issue, we introduce a prototype-wise contrastive alignment paradigm (PCFA) aimed bridging...

10.1109/tgrs.2023.3334294 article EN IEEE Transactions on Geoscience and Remote Sensing 2023-01-01

Visual explanations with detailed spatial information for remote sensing image classification via channel saliency

OPENALEX - Publications

Xianpeng Guo Biao Hou Chen Yang Siteng Ma Bo Ren and 2 more

For the past few years, barrier of explainability accompanying by deep neural networks (DNNs) has been increasingly studied. The methods based on class activation map (CAM) which interpret model decision mapping output back to input space, have achieved a notable momentum among research. However, CAM-based cannot stably produce effective explanation results remote sensing images (RSIs), owing coarse location generated high-level features, whereas, RSIs contain abundant detailed spatial...

10.1016/j.jag.2023.103244 article EN cc-by-nc-nd International Journal of Applied Earth Observation and Geoinformation 2023-03-03

Model-data-driven adversarial active learning for brain tumor segmentation

OPENALEX - Publications

Siteng Ma Prateek Mathur Zheng Ju Aonghus Lawlor Ruihai Dong

Active learning (AL) attempts to select informative samples in a dataset minimize the number of required labels while maximizing performance model. Current AL segmentation tasks is limited expansion popular classification-based methods including entropy, MC-dropout, etc. Meanwhile, most applications medical field are simply migrations that fail consider nature images, such as high class imbalance, domain difference, and data scarcity. In this study, we address these challenges propose novel...

10.1016/j.compbiomed.2024.108585 article EN cc-by Computers in Biology and Medicine 2024-05-12

Adaptive Nonlinear Latent Transformation for Conditional Face Editing

OPENALEX - Publications

Zhizhong Huang Siteng Ma Junping Zhang Hongming Shan

Recent works for face editing usually manipulate the latent space of StyleGAN via linear semantic directions. However, they suffer from entanglement facial attributes, need to tune optimal strength, and are limited binary attributes with strong supervision signals. This paper proposes a novel adaptive nonlinear transformation disentangled conditional editing, termed AdaTrans. Specifically, our AdaTrans divides manipulation process into several finer steps; i.e., direction size at each step...

10.1109/iccv51070.2023.01922 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Multicrop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification

OPENALEX - Publications

Siteng Ma Biao Hou Xianpeng Guo Zhihao Li Zitong Wu and 2 more

The gap between self-supervised visual representation learning and supervised is gradually closing. Self-supervised does not rely on a large amount of labeled data reduces the loss human information. Compared with natural images, remote sensing images require rich samples annotation by experts. Moreover, many algorithms have poor interpretability unconvincing results. Therefore, this paper proposes method based prototype assignment designing pretext task so that network maps features to...

10.1109/tgrs.2022.3216831 article EN IEEE Transactions on Geoscience and Remote Sensing 2022-01-01

DO-FAM: Disentangled Non-Linear Latent Navigation For Facial Attribute Manipulation

OPENALEX - Publications

Yifan Yuan Siteng Ma Hongming Shan Junping Zhang

Facial attribute manipulation (FAM) aims to edit the semantic attributes of facial images according user's requirements. Unfortunately, majority existing FAM methods struggle in meeting at least one two requirements: high reconstruction quality and irrelevance preservation. To alleviate these limitations, we propose a novel Disentangled nOn-linear latent navigation framework for FAM, termed DO-FAM. promote quality, leverage hypernetworks fine-tune pre-trained StyleGAN2 generator. decouple...

10.1109/icassp49357.2023.10095959 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Weakly supervised object localization via knowledge distillation based on foreground–background contrast

OPENALEX - Publications

Siteng Ma Biao Hou Zhihao Li Zitong Wu Xianpeng Guo and 2 more

10.1016/j.neucom.2023.127213 article EN Neurocomputing 2023-12-31

Breaking the Barrier: Selective Uncertainty-based Active Learning for Medical Image Segmentation

OPENALEX - Publications

Siteng Ma Haochang Wu Aonghus Lawlor Ruihai Dong

Active learning (AL) has found wide applications in medical image segmentation, aiming to alleviate the annotation workload and enhance performance. Conventional uncertainty-based AL methods, such as entropy Bayesian, often rely on an aggregate of all pixel-level metrics. However, imbalanced settings, these methods tend neglect significance target regions, eg., lesions, tumors. Moreover, selection introduces redundancy. These factors lead unsatisfactory performance, many cases, even...

10.48550/arxiv.2401.16298 preprint EN arXiv (Cornell University) 2024-01-29

Masked Angle-Aware Autoencoder for Remote Sensing Images

OPENALEX - Publications

Zhihao Li Biao Hou Siteng Ma Zitong Wu Xianpeng Guo and 2 more

To overcome the inherent domain gap between remote sensing (RS) images and natural images, some self-supervised representation learning methods have made promising progress. However, they overlooked diverse angles present in RS objects. This paper proposes Masked Angle-Aware Autoencoder (MA3E) to perceive learn during pre-training. We design a \textit{scaling center crop} operation create rotated crop with random orientation on each original image, introducing explicit angle variation. MA3E...

10.48550/arxiv.2408.01946 preprint EN arXiv (Cornell University) 2024-08-04

VR-FAM: Variance-Reduced Encoder with Nonlinear Transformation for Facial Attribute Manipulation

OPENALEX - Publications

Yifan Yuan Siteng Ma Junping Zhang

Facial attribute manipulation (FAM) aims to infer desired facial images by modifying specific attributes while keeping others unchanged. Existing works suffer from the entanglement of attributes, leading unexpected artifacts and loss identity information after editing. To alleviate these issues, we propose a novel FAM framework based on StyleGAN, termed VR-FAM, which can meet requirements FAM—editing ability, distortion, fidelity. First, variance-reduced encoder make latent space close one...

10.1109/icassp43922.2022.9746046 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

Adaptive Nonlinear Latent Transformation for Conditional Face Editing

OPENALEX - Publications

Zhizhong Huang Siteng Ma Junping Zhang Hongming Shan

Recent works for face editing usually manipulate the latent space of StyleGAN via linear semantic directions. However, they suffer from entanglement facial attributes, need to tune optimal strength, and are limited binary attributes with strong supervision signals. This paper proposes a novel adaptive nonlinear transformation disentangled conditional editing, termed AdaTrans. Specifically, our AdaTrans divides manipulation process into several finer steps; i.e., direction size at each step...

10.48550/arxiv.2307.07790 preprint EN other-oa arXiv (Cornell University) 2023-01-01