NFDI4DS | UHH-SEMS - Publication Details

Few-Shot Object Detection With Attention-RPN and Multi-Relation Detector

OPENALEX - Publications

Qi Fan Wei Zhuo Chi–Keung Tang Yu‐Wing Tai

Conventional methods for object detection typically require a substantial amount of training data and preparing such high-quality is very labor-intensive. In this paper, we propose novel few-shot network that aims at detecting objects unseen categories with only few annotated examples. Central to our method are Attention-RPN, Multi-Relation Detector Contrastive Training strategy, which exploit the similarity between shot support set query detect while suppressing false in background. To...

10.1109/cvpr42600.2020.00407 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

ST++: Make Self-trainingWork Better for Semi-supervised Semantic Segmentation

OPENALEX - Publications

Lihe Yang Wei Zhuo Lei Qi Yinghuan Shi Yang Gao

Self-training via pseudo labeling is a conventional, simple, and popular pipeline to leverage unlabeled data. In this work, we first construct strong baseline of self-training (namely ST) for semi-supervised semantic segmentation injecting data augmentations (SDA) on images alleviate overfitting noisy labels as well decouple similar predictions between the teacher student. With simple mechanism, our ST outperforms all existing methods without any bells whistles, e.g., iterative retraining....

10.1109/cvpr52688.2022.00423 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Indoor scene structure analysis for single image depth estimation

OPENALEX - Publications

Wei Zhuo Mathieu Salzmann Xuming He Miaomiao Liu

We tackle the problem of single image depth estimation, which, without additional knowledge, suffers from many ambiguities. Unlike previous approaches that only reason locally, we propose to exploit global structure scene estimate its depth. To this end, introduce a hierarchical representation scene, which models local jointly with mid-level and structures. formulate estimation as inference in graphical model whose edges let us encode interactions within across different layers our...

10.1109/cvpr.2015.7298660 article EN 2015-06-01

Indoor Scene Parsing with Instance Segmentation, Semantic Labeling and Support Relationship Inference

OPENALEX - Publications

Wei Zhuo Mathieu Salzmann Xuming He Miaomiao Liu

Over the years, indoor scene parsing has attracted a growing interest in computer vision community. Existing methods have typically focused on diverse subtasks of this challenging problem. In particular, while some them aim at segmenting image into regions, such as object or surface instances, others inferring semantic labels given their support relationships. These different tasks are treated separate ones. However, they bear strong connections: good regions should respect labels, can only...

10.1109/cvpr.2017.664 article EN 2017-07-01

DeepPipes: Learning 3D pipelines reconstruction from point clouds

OPENALEX - Publications

Lili Cheng Wei Zhuo Mingchao Sun Shiqing Xin Andrei Sharf and 3 more

10.1016/j.gmod.2020.101079 article EN Graphical Models 2020-07-03

As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection

OPENALEX - Publications

Shuilian Yao Yu Liu Qi Jia Sihong Chen Wei Zhuo

Compared to fully supervised object detection, training with sparse annotations typically leads a decline in performance due insufficient feature diversity. Existing sparsely annotated detection (SAOD) methods often rely on pseudo-labeling strategies, but these pseudo-labels tend introduce noise under extreme sparsity. To simultaneously avoid the impact of pseudo-label and enhance diversity, we propose novel Adaptive Feature Generation (AdaptFG) model that generates features based class...

10.1609/aaai.v39i9.33020 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

Semi-Supervised Representation Learning for Segmentation on Medical Volumes and Sequences

OPENALEX - Publications

Zejian Chen Wei Zhuo Tianfu Wang Jun Cheng Wufeng Xue and 1 more

Benefiting from the massive labeled samples, deep learning-based segmentation methods have achieved great success for two dimensional natural images. However, it is still a challenging task to segment high medical volumes and sequences, due considerable efforts clinical expertise make large scale annotations. Self/semi-supervised learning been shown improve performance by exploiting unlabeled data. they are lack of mining local semantic discrimination exploitation volume/sequence structures....

10.1109/tmi.2023.3319973 article EN IEEE Transactions on Medical Imaging 2023-09-27

FSODv2: A Deep Calibrated Few-Shot Object Detection Network

OPENALEX - Publications

Qi Fan Wei Zhuo Chi–Keung Tang Yu‐Wing Tai

10.1007/s11263-024-02049-z article EN International Journal of Computer Vision 2024-04-04

MalwareVis

OPENALEX - Publications

Wei Zhuo Yacin Nadjin

This paper presents MalwareVis, a utility that provides security researchers method to browse, filter, view and compare malware network traces as entities.

10.1145/2379690.2379696 article EN 2012-10-04

Fully Self-Supervised Learning for Semantic Segmentation

OPENALEX - Publications

Yuan Wang Wei Zhuo Yucong Li Zhi Wang Qi Ju and 1 more

In this work, we present a fully self-supervised framework for semantic segmentation(FS^4). A bootstrapped strategy segmentation, which saves efforts the huge amount of annotation, is crucial building customized models from end-to-end open-world domains. This application eagerly needed in realistic scenarios. Even though recent segmentation methods have gained great progress, these works however heavily depend on fully-supervised pretrained model and make it impossible pipeline. To solve...

10.48550/arxiv.2202.11981 preprint EN other-oa arXiv (Cornell University) 2022-01-01

QA-CLIMS: Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation

OPENALEX - Publications

Songhe Deng Wei Zhuo Jinheng Xie Linlin Shen

Class Activation Map (CAM) has emerged as a popular tool for weakly supervised semantic segmentation (WSSS), allowing the localization of object regions in an image using only image-level labels. However, existing CAM methods suffer from under-activation target and false-activation background due to fact that lack detailed supervision can hinder model's ability understand whole. In this paper, we propose novel Question-Answer Cross-Language-Image Matching framework WSSS (QA-CLIMS),...

10.1145/3581783.3612148 article EN 2023-10-26

AutoPrivacy: Automatic privacy protection and tagging suggestion for mobile social photo

OPENALEX - Publications

Wei Zhuo Yongdong Wu Yanjiang Yang Zheng Yan Qingqi Pei and 2 more

10.1016/j.cose.2017.12.002 article EN Computers & Security 2018-01-05

Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation

OPENALEX - Publications

Songhe Deng Wei Zhuo Jinheng Xie Linlin Shen

Class Activation Map (CAM) has emerged as a popular tool for weakly supervised semantic segmentation (WSSS), allowing the localization of object regions in an image using only image-level labels. However, existing CAM methods suffer from under-activation target and false-activation background due to fact that lack detailed supervision can hinder model's ability understand whole. In this paper, we propose novel Question-Answer Cross-Language-Image Matching framework WSSS (QA-CLIMS),...

10.48550/arxiv.2401.09883 preprint EN other-oa arXiv (Cornell University) 2024-01-01

Enhancing Unsupervised Semantic Segmentation Through Context-Aware Clustering

OPENALEX - Publications

Wei Zhuo Yuan Wang Junliang Chen Songhe Deng Zhi Wang and 2 more

Despite the great progress of semantic segmentation with supervised learning, annotating large amounts pixel-wise labels is, however, very expensive and time-consuming. To this end, Unsupervised Semantic Segmentation(USS) has been proposed to learn segmentation, without any form annotations. This approach involves dense prediction semantics which is however challenging due unreliable nature local representations. solve problem, we propose a newly context-aware unsupervised framework, aims...

10.1109/tmm.2024.3405648 article EN IEEE Transactions on Multimedia 2024-01-01

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

OPENALEX - Publications

Qi Fan Wei Zhuo Chi–Keung Tang Yu‐Wing Tai

Conventional methods for object detection typically require a substantial amount of training data and preparing such high-quality is very labor-intensive. In this paper, we propose novel few-shot network that aims at detecting objects unseen categories with only few annotated examples. Central to our method are Attention-RPN, Multi-Relation Detector Contrastive Training strategy, which exploit the similarity between shot support set query detect while suppressing false in background. To...

10.48550/arxiv.1908.01998 preprint EN other-oa arXiv (Cornell University) 2019-01-01

3D Box Proposals From a Single Monocular Image of an Indoor Scene

OPENALEX - Publications

Wei Zhuo Mathieu Salzmann Xuming He Miaomiao Liu

Modern object detection methods typically rely on bounding box proposals as input. While initially popularized in the 2D case, this idea has received increasing attention for 3D boxes. Nevertheless, existing proposal techniques all assume having access to depth input, which is unfortunately not always available practice. In paper, we therefore introduce an approach generating from a single monocular RGB image. To end, develop integrated, fully differentiable framework that inherently...

10.1609/aaai.v32i1.12314 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-27

GPU Acceleration of Integer Wavelet Transform for TIFF Image

OPENALEX - Publications

Wei Zhuo Zhi Sun Yajuan Xie Shengsheng Yu

In publishing and printing of network version field, there are enormous number TIFF format (CMYK) images which requires too huge space for storing enough bandwidth transmitting. Therefore, common need to manipulate amount data brought about the issue fast lossless compression. 2D integer wavelet transform can be used compression static image, such as, 5/3 lifting is JEPG2000. Today, Multi-core (dual, four or eight cores) CPU technology help accelerate speed. However, current multi-more limit...

10.1109/paap.2010.42 article EN 2010-12-01

Orthogonal Hyper-category Guided Multi-interest Elicitation for Micro-video Matching

OPENALEX - Publications

Beibei Li Beihong Jin Yisong Yu Yiyuan Zheng Jiageng Song and 2 more

10.1109/icme57554.2024.10687880 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2024-07-15

Orthogonal Hyper-category Guided Multi-interest Elicitation for Micro-video Matching

OPENALEX - Publications

Beibei Li Beihong Jin Yisong Yu Yiyuan Zheng Jiageng Song and 2 more

Watching micro-videos is becoming a part of public daily life. Usually, user watching behaviors are thought to be rooted in their multiple different interests. In the paper, we propose model named OPAL for micro-video matching, which elicits user's heterogeneous interests by disentangling soft and hard interest embeddings from interactions. Moreover, employs two-stage training strategy, pre-train generate historical interactions under guidance orthogonal hyper-categories fine-tune reinforce...

10.48550/arxiv.2407.14741 preprint EN arXiv (Cornell University) 2024-07-19

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

OPENALEX - Publications

Lihe Yang Wei Zhuo Lei Qi Yinghuan Shi Yang Gao

Self-training via pseudo labeling is a conventional, simple, and popular pipeline to leverage unlabeled data. In this work, we first construct strong baseline of self-training (namely ST) for semi-supervised semantic segmentation injecting data augmentations (SDA) on images alleviate overfitting noisy labels as well decouple similar predictions between the teacher student. With simple mechanism, our ST outperforms all existing methods without any bells whistles, e.g., iterative re-training....

10.48550/arxiv.2106.05095 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Image Lossless Compression and Secure Transmission System Based on Integer Wavelet Transform

OPENALEX - Publications

Wei Zhuo Zhong Shu Yajuan Xie

Network is widely used in today. However, limit bandwidth and no security are the current network situation. Transmission of image data Internet may be affected because above shortcomings. On one hand, transmission important must efficient secure, such as, medical images, remote sensing image, so on. other large images (TIFF, BMP, DCMOM), applied reality, which cause difficult, or unavailable Internet. Considering scheme, using FTP, QICQ, email general tools, however, they can't satisfy both...

10.1109/mmit.2010.139 article EN 2010-01-01

Detection of potato early blight based on hyperspectral imaging

OPENALEX - Publications

Feng Zhang hong mei li Xinting Li Wei Zhuo Xu Feng Yu and 2 more

Two non-destructive detection methods for potato blight based on hyperspectral imaging were used: convolutional neural network (CNN) and support vector machine (SVM) to classify leaves. By comparing the classification results, advantages disadvantages of different are analyzed. In experiment, normal leaves early selected as research objects. Hyperspectral images samples obtained by system, then principal component extracted analysis method. It was found that significantly different, finally...

10.1117/12.2575049 article EN 2020-10-09