Siteng Ma

ORCID: 0000-0001-9678-0213
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Image and Video Retrieval Techniques
  • Remote-Sensing Image Classification
  • Generative Adversarial Networks and Image Synthesis
  • Face recognition and analysis
  • Advanced Neural Network Applications
  • Domain Adaptation and Few-Shot Learning
  • Facial Nerve Paralysis Treatment and Research
  • Cell Image Analysis Techniques
  • Explainable Artificial Intelligence (XAI)
  • Machine Learning and Algorithms
  • Remote Sensing and LiDAR Applications
  • Multimodal Machine Learning Applications
  • Robotics and Sensor-Based Localization
  • Machine Learning and Data Classification
  • Image Retrieval and Classification Techniques
  • Video Surveillance and Tracking Methods
  • COVID-19 diagnosis using AI
  • AI in cancer detection
  • Radiomics and Machine Learning in Medical Imaging

University College Dublin
2024

Xidian University
2022-2023

Fudan University
2022-2023

In the annotation of remote sensing images (RSIs), effectiveness common object detection methods trained on only a few samples decreases instantly, which has prompted increasing research few-shot problem in sensing. RSIs often exhibit suboptimal performance scenarios due to intricate nature scene information interference and high degree cosine similarity, both present significant challenges their effectiveness. this paper, two-stage framework based fine-tuning is selected deal with problems...

10.1109/tgrs.2023.3294943 article EN IEEE Transactions on Geoscience and Remote Sensing 2023-01-01

The overwhelming majority of models for remote sensing image (RSI) scene classification generally require the weights pre-trained on natural images initialization before formal training. However, differences in imaging mechanisms lead to huge discrepancies between and RSIs, strong visual representation learned from massive limits performance when inferencing RSIs. To address this issue, well-established self-supervised contrastive learning paradigm field is introduced RSI field. We propose a...

10.1109/tgrs.2023.3291878 article EN IEEE Transactions on Geoscience and Remote Sensing 2023-01-01

Labeling data in the field of remote sensing is time-consuming and labor-intensive, making domain adaptation between different domains an urgently needed solution. To address gap diverse datasets domain, numerous methods tailored for high-resolution imagery have emerged. Some existing focus on reducing at either feature level or pixel level, often overlooking their underlying connection. tackle this issue, we introduce a prototype-wise contrastive alignment paradigm (PCFA) aimed bridging...

10.1109/tgrs.2023.3334294 article EN IEEE Transactions on Geoscience and Remote Sensing 2023-01-01

For the past few years, barrier of explainability accompanying by deep neural networks (DNNs) has been increasingly studied. The methods based on class activation map (CAM) which interpret model decision mapping output back to input space, have achieved a notable momentum among research. However, CAM-based cannot stably produce effective explanation results remote sensing images (RSIs), owing coarse location generated high-level features, whereas, RSIs contain abundant detailed spatial...

10.1016/j.jag.2023.103244 article EN cc-by-nc-nd International Journal of Applied Earth Observation and Geoinformation 2023-03-03

Active learning (AL) attempts to select informative samples in a dataset minimize the number of required labels while maximizing performance model. Current AL segmentation tasks is limited expansion popular classification-based methods including entropy, MC-dropout, etc. Meanwhile, most applications medical field are simply migrations that fail consider nature images, such as high class imbalance, domain difference, and data scarcity. In this study, we address these challenges propose novel...

10.1016/j.compbiomed.2024.108585 article EN cc-by Computers in Biology and Medicine 2024-05-12

Recent works for face editing usually manipulate the latent space of StyleGAN via linear semantic directions. However, they suffer from entanglement facial attributes, need to tune optimal strength, and are limited binary attributes with strong supervision signals. This paper proposes a novel adaptive nonlinear transformation disentangled conditional editing, termed AdaTrans. Specifically, our AdaTrans divides manipulation process into several finer steps; i.e., direction size at each step...

10.1109/iccv51070.2023.01922 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

The gap between self-supervised visual representation learning and supervised is gradually closing. Self-supervised does not rely on a large amount of labeled data reduces the loss human information. Compared with natural images, remote sensing images require rich samples annotation by experts. Moreover, many algorithms have poor interpretability unconvincing results. Therefore, this paper proposes method based prototype assignment designing pretext task so that network maps features to...

10.1109/tgrs.2022.3216831 article EN IEEE Transactions on Geoscience and Remote Sensing 2022-01-01

Facial attribute manipulation (FAM) aims to edit the semantic attributes of facial images according user's requirements. Unfortunately, majority existing FAM methods struggle in meeting at least one two requirements: high reconstruction quality and irrelevance preservation. To alleviate these limitations, we propose a novel Disentangled nOn-linear latent navigation framework for FAM, termed DO-FAM. promote quality, leverage hypernetworks fine-tune pre-trained StyleGAN2 generator. decouple...

10.1109/icassp49357.2023.10095959 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Active learning (AL) has found wide applications in medical image segmentation, aiming to alleviate the annotation workload and enhance performance. Conventional uncertainty-based AL methods, such as entropy Bayesian, often rely on an aggregate of all pixel-level metrics. However, imbalanced settings, these methods tend neglect significance target regions, eg., lesions, tumors. Moreover, selection introduces redundancy. These factors lead unsatisfactory performance, many cases, even...

10.48550/arxiv.2401.16298 preprint EN arXiv (Cornell University) 2024-01-29

To overcome the inherent domain gap between remote sensing (RS) images and natural images, some self-supervised representation learning methods have made promising progress. However, they overlooked diverse angles present in RS objects. This paper proposes Masked Angle-Aware Autoencoder (MA3E) to perceive learn during pre-training. We design a \textit{scaling center crop} operation create rotated crop with random orientation on each original image, introducing explicit angle variation. MA3E...

10.48550/arxiv.2408.01946 preprint EN arXiv (Cornell University) 2024-08-04

Facial attribute manipulation (FAM) aims to infer desired facial images by modifying specific attributes while keeping others unchanged. Existing works suffer from the entanglement of attributes, leading unexpected artifacts and loss identity information after editing. To alleviate these issues, we propose a novel FAM framework based on StyleGAN, termed VR-FAM, which can meet requirements FAM—editing ability, distortion, fidelity. First, variance-reduced encoder make latent space close one...

10.1109/icassp43922.2022.9746046 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

Recent works for face editing usually manipulate the latent space of StyleGAN via linear semantic directions. However, they suffer from entanglement facial attributes, need to tune optimal strength, and are limited binary attributes with strong supervision signals. This paper proposes a novel adaptive nonlinear transformation disentangled conditional editing, termed AdaTrans. Specifically, our AdaTrans divides manipulation process into several finer steps; i.e., direction size at each step...

10.48550/arxiv.2307.07790 preprint EN other-oa arXiv (Cornell University) 2023-01-01
Coming Soon ...