Weitao Feng

ORCID: 0000-0003-2517-9206
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Video Surveillance and Tracking Methods
  • Infrared Target Detection Methodologies
  • Autonomous Vehicle Technology and Safety
  • Visual Attention and Saliency Detection
  • Advanced Steganography and Watermarking Techniques
  • Anomaly Detection Techniques and Applications
  • Human Pose and Action Recognition
  • Digital Media Forensic Detection
  • Image Enhancement Techniques
  • Semantic Web and Ontologies
  • Fire Detection and Safety Systems
  • Internet Traffic Analysis and Secure E-voting
  • Service-Oriented Architecture and Web Services
  • Advanced Vision and Imaging
  • Chaos-based Image/Signal Encryption
  • Impact of Light on Environment and Health
  • Human-Animal Interaction Studies
  • Business Process Modeling and Analysis
  • Domain Adaptation and Few-Shot Learning
  • Advanced Image and Video Retrieval Techniques

The University of Sydney
2022-2024

Beihang University
2018

In this paper, we propose a unified Multi-Object Tracking (MOT) framework learning to make full use of long term and short cues for handling complex cases in MOT scenes. Besides, better association, switcher-aware classification (SAC), which takes the potential identity-switch causer (switcher) into consideration. Specifically, proposed includes Single Object (SOT) sub-net capture cues, re-identification (ReID) extract classifier matching decisions using extracted features from main target...

10.48550/arxiv.1901.06129 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Unsupervised learning has been popular in various computer vision tasks, including visual object tracking. However, prior unsupervised tracking approaches rely heavily on spatial supervision from templatesearch pairs and are still unable to track objects with strong variation over a long time span. As unlimited self-supervision signals can be obtained by video along cycle time, we investigate evolving Siamese tracker videos forward-backward. We present novel framework, which learn temporal...

10.1109/cvpr52688.2022.00793 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Process-supervised reward models serve as a fine-grained function that provides detailed step-wise feedback to model responses, facilitating effective selection of reasoning trajectories for complex tasks. Despite its advantages, evaluation on PRMs remains less explored, especially in the multimodal domain. To address this gap, paper first benchmarks current vision large language (VLLMs) two types models: output (ORMs) and process (PRMs) multiple vision-language benchmarks, which reveal...

10.48550/arxiv.2503.20271 preprint EN arXiv (Cornell University) 2025-03-26

Large scale surveillance video analysis is one of the most important components in future artificial intelligent city. It a very challenging but practical system, consists multiple functionalities such as object detection, tracking, identification and behavior analysis. In this paper, we try to address three tasks hosted NVIDIA AI City Challenge contest. First, system that transforming image coordinate world has been proposed, which useful estimate vehicle speed on road. Second, anomalies...

10.1109/cvprw.2018.00017 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2018-06-01

There are many decisions which usually made heuristically both in single object tracking (SOT) and multiple (MOT). Existing methods focus on tackling decision-making problems special tasks without a unified framework. In this paper, we propose decision controller (DC) is generally applicable to SOT MOT tasks. The learns an optimal policy with deep reinforcement learning algorithm that maximizes long term performance supervision. To prove the generalization ability of DC, apply it challenging...

10.1109/access.2019.2900476 article EN cc-by-nc-nd IEEE Access 2019-01-01

10.1007/s11263-023-01943-2 article EN International Journal of Computer Vision 2023-11-20

Joint detection and tracking, which solves two fundamental vision challenges in a unified manner, is challenging topic computer vision. In this area, the proper use of spatial-temporal information videos can help reduce local defects improve quality feature representations. Although modeling low-level (usually pixel-wise) has been studied, instance-level correlations (i.e., relations between semantic regions instances have occurred) not fully exploited. comparison, correlation more flexible...

10.1109/tmm.2023.3279670 article EN IEEE Transactions on Multimedia 2024-01-01

AIGC (AI-Generated Content) has achieved tremendous success in many applications such as text-to-image tasks, where the model can generate high-quality images with diverse prompts, namely, different descriptions natural languages. More surprisingly, emerging personalization techniques even succeed describing unseen concepts only a few personal references, and there have been some commercial platforms for sharing valuable personalized concept. However, an advanced technique also introduces...

10.48550/arxiv.2309.05940 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Diffusion models have achieved remarkable success in generating high-quality images. Recently, the open-source represented by Stable (SD) are thriving and accessible for customization, giving rise to a vibrant community of creators enthusiasts. However, widespread availability customized SD has led copyright concerns, like unauthorized model distribution unconsented commercial use. To address it, recent works aim let output watermarked content post-hoc forensics. Unfortunately, none them can...

10.48550/arxiv.2405.11135 preprint EN arXiv (Cornell University) 2024-05-17

Multi-Object Tracking (MOT) is a popular topic in computer vision. However, identity issue, i.e., an object wrongly associated with another of different identity, still remains to be challenging problem. To address it, switchers, confusing targets thatmay cause issues, should focused. Based on this motivation,this paper proposes novel switcher-aware framework for multi-object tracking, which consists Spatial Conflict Graph model (SCG) and Switcher-Aware Association (SAA). The SCG eliminates...

10.48550/arxiv.2009.10338 preprint EN other-oa arXiv (Cornell University) 2020-01-01
Coming Soon ...