NFDI4DS | UHH-SEMS - Publication Details

Jian Sun

ORCID: 0000-0001-6270-2698

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5101425421

Research Areas

Advanced Neural Network Applications
Advanced Image and Video Retrieval Techniques
Domain Adaptation and Few-Shot Learning
Video Surveillance and Tracking Methods
Human Pose and Action Recognition
Multimodal Machine Learning Applications
Image Enhancement Techniques
Advanced Vision and Imaging
Anomaly Detection Techniques and Applications
Computer Graphics and Visualization Techniques
Image Retrieval and Classification Techniques
Infrastructure Maintenance and Monitoring
Automated Road and Building Extraction
Advanced Image Processing Techniques
Hand Gesture Recognition Systems
Adversarial Robustness in Machine Learning
Remote Sensing and LiDAR Applications
Robotics and Sensor-Based Localization
Advanced Data Storage Technologies
Industrial Vision Systems and Defect Detection
Advanced Image Fusion Techniques
Visual Attention and Saliency Detection
Generative Adversarial Networks and Image Synthesis
Face recognition and analysis
Neural Networks and Applications

University of Macau
2024

National University of Defense Technology
2024

Shandong University of Science and Technology
2024

Xi'an Jiaotong University
2022-2023

Megvii (China)
2017-2022

University of Tennessee Health Science Center
2022

German Center for Neurodegenerative Diseases
2022

Vi Technology (United States)
2019-2022

Northwestern Polytechnical University
1998-2022

Fudan University
2021

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

OPENALEX - Publications

Shaoqing Ren Kaiming He Ross Girshick Jian Sun

State-of-the-art object detection networks depend on region proposal algorithms to hypothesize locations. Advances like SPPnet [1] and Fast R-CNN [2] have reduced the running time of these networks, exposing computation as a bottleneck. In this work, we introduce Region Proposal Network(RPN) that shares full-image convolutional features with network, thus enabling nearly cost-free proposals. An RPN is fully network simultaneously predicts bounds objectness scores at each position. The...

10.1109/tpami.2016.2577031 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2016-06-06

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

OPENALEX - Publications

Kaiming He Xiangyu Zhang Shaoqing Ren Jian Sun

Rectified activation units (rectifiers) are essential for state-of-the-art neural networks. In this work, we study rectifier networks image classification from two aspects. First, propose a Parametric Linear Unit (PReLU) that generalizes the traditional rectified unit. PReLU improves model fitting with nearly zero extra computational cost and little overfitting risk. Second, derive robust initialization method particularly considers nonlinearities. This enables us to train extremely deep...

10.1109/iccv.2015.123 article EN 2015-12-01

Single Image Haze Removal Using Dark Channel Prior

OPENALEX - Publications

Kaiming He Jian Sun Xiaoou Tang

In this paper, we propose a simple but effective image prior-dark channel prior to remove haze from single input image. The dark is kind of statistics outdoor haze-free images. It based on key observation-most local patches in images contain some pixels whose intensity very low at least one color channel. Using with the imaging model, can directly estimate thickness and recover high-quality Results variety hazy demonstrate power proposed prior. Moreover, depth map also be obtained as...

10.1109/tpami.2010.168 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2010-09-15

Deep Residual Learning for Image Recognition

OPENALEX - Publications

Kaiming He Xiangyu Zhang Shaoqing Ren Jian Sun

Deeper neural networks are more difficult to train. We present a residual learning framework ease the training of that substantially deeper than those used previously. explicitly reformulate layers as functions with reference layer inputs, instead unreferenced functions. provide comprehensive empirical evidence showing these easier optimize, and can gain accuracy from considerably increased depth. On ImageNet dataset we evaluate nets depth up 152 layers---8x VGG but still having lower...

10.48550/arxiv.1512.03385 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Single image haze removal using dark channel prior

OPENALEX - Publications

Kaiming He Jian Sun Xiaoou Tang

In this paper, we propose a simple but effective image prior - dark channel to remove haze from single input image. The is kind of statistics the haze-free outdoor images. It based on key observation most local patches in images contain some pixels which have very low intensities at least one color channel. Using with imaging model, can directly estimate thickness and recover high quality Results variety demonstrate power proposed prior. Moreover, depth map also be obtained as by-product removal.

10.1109/cvpr.2009.5206515 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2009-06-01

Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network

OPENALEX - Publications

Chao Peng Xiangyu Zhang Gang Yu Guiming Luo Jian Sun

One of recent trends [31, 32, 14] in network architecture design is stacking small filters (e.g., 1×1 or 3×3) the entire because stacked more efficient than a large kernel, given same computational complexity. However, field semantic segmentation, where we need to perform dense per-pixel prediction, find that kernel (and effective receptive field) plays an important role when have classification and localization tasks simultaneously. Following our principle, propose Global Convolutional...

10.1109/cvpr.2017.189 preprint EN 2017-07-01

Cascaded Pyramid Network for Multi-person Pose Estimation

OPENALEX - Publications

Yilun Chen Zhicheng Wang Yuxiang Peng Zhiqiang Zhang Gang Yu and 1 more

The topic of multi-person pose estimation has been largely improved recently, especially with the development convolutional neural network. However, there still exist a lot challenging cases, such as occluded keypoints, invisible keypoints and complex background, which cannot be well addressed. In this paper, we present novel network structure called Cascaded Pyramid Network (CPN) targets to relieve problem from these "hard" keypoints. More specifically, our algorithm includes two stages:...

10.1109/cvpr.2018.00742 preprint EN 2018-06-01

Convolutional neural networks at constrained time cost

OPENALEX - Publications

Kaiming He Jian Sun

Though recent advanced convolutional neural networks (CNNs) have been improving the image recognition accuracy, models are getting more complex and time-consuming. For real-world applications in industrial commercial scenarios, engineers developers often faced with requirement of constrained time budget. In this paper, we investigate accuracy CNNs under cost. Under constraint, designs network architectures should exhibit as trade-offs among factors like depth, numbers filters, filter sizes,...

10.1109/cvpr.2015.7299173 preprint EN 2015-06-01

Instance-Aware Semantic Segmentation via Multi-task Network Cascades

OPENALEX - Publications

Jifeng Dai Kaiming He Jian Sun

Semantic segmentation research has recently witnessed rapid progress, but many leading methods are unable to identify object instances. In this paper, we present Multitask Network Cascades for instance-aware semantic segmentation. Our model consists of three networks, respectively differentiating instances, estimating masks, and categorizing objects. These networks form a cascaded structure, designed share their convolutional features. We develop an algorithm the nontrivial end-to-end...

10.1109/cvpr.2016.343 article EN 2016-06-01

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

OPENALEX - Publications

Kaiming He Xiangyu Zhang Shaoqing Ren Jian Sun

10.48550/arxiv.1502.01852 preprint EN other-oa arXiv (Cornell University) 2015-01-01

ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation

OPENALEX - Publications

Di Lin Jifeng Dai Jiaya Jia Kaiming He Jian Sun

Large-scale data is of crucial importance for learning semantic segmentation models, but annotating per-pixel masks a tedious and inefficient procedure. We note that the topic interactive image segmentation, scribbles are very widely used in academic research commercial software, recognized as one most userfriendly ways interacting. In this paper, we propose to use annotate images, develop an algorithm train convolutional networks supervised by scribbles. Our based on graphical model jointly...

10.1109/cvpr.2016.344 preprint EN 2016-06-01

BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation

OPENALEX - Publications

Jifeng Dai Kaiming He Jian Sun

Recent leading approaches to semantic segmentation rely on deep convolutional networks trained with human-annotated, pixel-level masks. Such pixel-accurate supervision demands expensive labeling effort and limits the performance of that usually benefit from more training data. In this paper, we propose a method achieves competitive accuracy but only requires easily obtained bounding box annotations. The basic idea is iterate between automatically generating region proposals networks. These...

10.1109/iccv.2015.191 preprint EN 2015-12-01

Poisson matting

OPENALEX - Publications

Jian Sun Jiaya Jia Chi-Keung Tang Heung‐Yeung Shum

In this paper, we formulate the problem of natural image matting as one solving Poisson equations with matte gradient field. Our approach, which call , has following advantages. First, is directly reconstructed from a continuous field by using boundary information user-supplied trimap. Second, interactively manipulating number filtering tools, user can further improve results locally until he or she satisfied. The modified local result seamlessly integrated into final result. Experiments on...

10.1145/1015706.1015721 article EN ACM Transactions on Graphics 2004-08-01

Convolutional feature masking for joint object and stuff segmentation

OPENALEX - Publications

Jifeng Dai Kaiming He Jian Sun

The topic of semantic segmentation has witnessed considerable progress due to the powerful features learned by convolutional neural networks (CNNs) [13]. current leading approaches for exploit shape information extracting CNN from masked image regions. This strategy introduces artificial boundaries on images and may impact quality extracted features. Besides, operations raw domain require compute thousands a single image, which is time-consuming. In this paper, we propose via masking...

10.1109/cvpr.2015.7299025 preprint EN 2015-06-01

Cascaded hand pose regression

OPENALEX - Publications

Xiao Sun Yichen Wei Shuang Liang Xiaoou Tang Jian Sun

We extends the previous 2D cascaded object pose regression work [9] in two aspects so that it works better for 3D articulated objects. Our first contribution is pose-indexed features generalize parameterized and achieve invariance to transformations. second a principled hierarchical adapted structure. It therefore more accurate faster. Comprehensive experiments verify state-of-the-art accuracy efficiency of proposed approach on challenging hand estimation problem, public dataset our new dataset.

10.1109/cvpr.2015.7298683 article EN 2015-06-01

MegDet: A Large Mini-Batch Object Detector

OPENALEX - Publications

Chao Peng Tete Xiao Zeming Li Yuning Jiang Xiangyu Zhang and 3 more

The development of object detection in the era deep learning, from R-CNN [11], Fast/Faster [10, 31] to recent Mask [14] and RetinaNet [24], mainly come novel network, new framework, or loss design. However, mini-batch size, a key factor for training neural networks, has not been well studied detection. In this paper, we propose Large Mini-Batch Object Detector (MegDet) enable with large size up 256, so that can effectively utilize at most 128 GPUs significantly shorten time. Technically,...

10.1109/cvpr.2018.00647 article EN 2018-06-01

A global sampling method for alpha matting

OPENALEX - Publications

Kaiming He Christoph Rhemann Carsten Rother Xiaoou Tang Jian Sun

Alpha matting refers to the problem of softly extracting foreground from an image. Given a trimap (specifying known foreground/background and unknown pixels), straightforward way compute alpha value is sample some background colors for each pixel. Existing sampling-based methods often collect samples near pixels only. They fail if good cannot be found nearby. In this paper, we propose global sampling method that uses all available in Our set avoids missing samples. A simple but effective...

10.1109/cvpr.2011.5995495 article EN 2011-06-01

Efficient and accurate approximations of nonlinear convolutional networks

OPENALEX - Publications

Xiangyu Zhang Jianhua Zou Ming Xiang Kaiming He Jian Sun

This paper aims to accelerate the test-time computation of deep convolutional neural networks (CNNs). Unlike existing methods that are designed for approximating linear filters or responses, our method takes nonlinear units into account. We minimize reconstruction error subject a low-rank constraint which helps reduce complexity filters. develop an effective solution this constrained optimization problem. An algorithm is also presented reducing accumulated when multiple layers approximated....

10.1109/cvpr.2015.7298809 preprint EN 2015-06-01

Light-Head R-CNN: In Defense of Two-Stage Object Detector

OPENALEX - Publications

Zeming Li Chao Peng Gang Yu Xiangyu Zhang Yangdong Deng and 1 more

In this paper, we first investigate why typical two-stage methods are not as fast single-stage, detectors like YOLO and SSD. We find that Faster R-CNN R-FCN perform an intensive computation after or before RoI warping. involves two fully connected layers for recognition, while produces a large score maps. Thus, the speed of these networks is slow due to heavy-head design in architecture. Even if significantly reduce base model, cost cannot be largely decreased accordingly. propose new...

10.48550/arxiv.1711.07264 preprint EN other-oa arXiv (Cornell University) 2017-01-01

DetNet: A Backbone network for Object Detection

OPENALEX - Publications

Zeming Li Chao Peng Gang Yu Xiangyu Zhang Yangdong Deng and 1 more

Recent CNN based object detectors, no matter one-stage methods like YOLO, SSD, and RetinaNe or two-stage detectors Faster R-CNN, R-FCN FPN are usually trying to directly finetune from ImageNet pre-trained models designed for image classification. There has been little work discussing on the backbone feature extractor specifically detection. More importantly, there several differences between tasks of classification 1. RetinaNet involve extra stages against task handle objects with various...

10.48550/arxiv.1804.06215 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Identity Mappings in Deep Residual Networks

OPENALEX - Publications

Kaiming He Xiangyu Zhang Shaoqing Ren Jian Sun

Deep residual networks have emerged as a family of extremely deep architectures showing compelling accuracy and nice convergence behaviors. In this paper, we analyze the propagation formulations behind building blocks, which suggest that forward backward signals can be directly propagated from one block to any other block, when using identity mappings skip connections after-addition activation. A series ablation experiments support importance these mappings. This motivates us propose new...

10.48550/arxiv.1603.05027 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Coming Soon ...