NFDI4DS | UHH-SEMS - Publication Details

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

OPENALEX - Publications

Wenhai Wang Jifeng Dai Zhe Chen Zhenhang Huang Zhiqi Li and 7 more

Compared to the great progress of large-scale vision transformers (ViTs) in recent years, models based on convolutional neural networks (CNNs) are still an early state. This work presents a new CNN-based foundation model, termed InternImage, which can obtain gain from increasing parameters and training data like ViTs. Different CNNs that focus large dense kernels, InternImage takes deformable convolution as core operator, so our model not only has effective receptive field required for...

10.1109/cvpr52729.2023.01385 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World

OPENALEX - Publications

Wei‐Yun Wang Min Shi Qingyun Li Wenhai Wang Zhenhang Huang and 9 more

We present the All-Seeing (AS) project: a large-scale data and model for recognizing understanding everything in open world. Using scalable engine that incorporates human feedback efficient models loop, we create new dataset (AS-1B) with over 1 billion regions annotated semantic tags, question-answering pairs, detailed captions. It covers wide range of 3.5 million common rare concepts real world, has 132.2 tokens describe their attributes. Leveraging this dataset, develop (ASM), unified...

10.48550/arxiv.2308.01907 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Fast Single-Shot Ship Instance Segmentation Based on Polar Template Mask in Remote Sensing Images

OPENALEX - Publications

Zhenhang Huang Shihao Sun Ruirui Li

Object detection and instance segmentation in remote sensing images is a fundamental challenging task, due to the complexity of scenes targets. The latest methods tried take into account both efficiency accuracy segmentation. In order improve them, this paper, we propose single-shot convolutional neural network structure, which conceptually simple straightforward, meanwhile makes up for problem low networks. Our method, termed with SSS-Net, detects targets based on location object's center...

10.1109/igarss39084.2020.9324705 article EN IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium 2020-09-26

Orientated Silhouette Matching for Single-Shot Ship Instance Segmentation

OPENALEX - Publications

Zhenhang Huang Ruirui Li

Object detection and semantic segmentation have achieved remarkable performance propelled by deep convolutional neural networks. However, neither of them can well parse deal with swarms rotating ships in remote sensing images. In this article, we pay more attention to the instance-level task, which recognizes objects effectively straightly. We propose a new network architecture, called orientated silhouette matching network, employing multiscale features masks enable single-shot...

10.1109/jstars.2021.3132005 article EN cc-by IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2021-12-02

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

OPENALEX - Publications

Wenhai Wang Jifeng Dai Zhe Chen Zhenhang Huang Zhiqi Li and 7 more

Compared to the great progress of large-scale vision transformers (ViTs) in recent years, models based on convolutional neural networks (CNNs) are still an early state. This work presents a new CNN-based foundation model, termed InternImage, which can obtain gain from increasing parameters and training data like ViTs. Different CNNs that focus large dense kernels, InternImage takes deformable convolution as core operator, so our model not only has effective receptive field required for...

10.48550/arxiv.2211.05778 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Semantic segmentation of urban remote sensing image based on optimized tree structure convolutional neural network

OPENALEX - Publications

Wei Hu Bochuan Gao Zhenhang Huang Ruirui Li

10.11834/jig.190324 article EN Journal of Image and Graphics 2020-01-01

iShape: A First Step Towards Irregular Shape Instance Segmentation

OPENALEX - Publications

Lei Yang Yan Wei Yisheng He Wei Sun Zhenhang Huang and 2 more

In this paper, we introduce a brand new dataset to promote the study of instance segmentation for objects with irregular shapes. Our key observation is that though irregularly shaped widely exist in daily life and industrial scenarios, they received little attention field due lack corresponding datasets. To fill gap, propose iShape, an shape segmentation. iShape contains six sub-datasets one real five synthetics, each represents scene typical shape. Unlike most existing datasets regular...

10.48550/arxiv.2109.15068 preprint EN public-domain arXiv (Cornell University) 2021-01-01

ForgeryNet -- Face Forgery Analysis Challenge 2021: Methods and Results

OPENALEX - Publications

Yinan He Lu Sheng Jing Shao Ziwei Liu Zhaofan Zou and 17 more

The rapid progress of photorealistic synthesis techniques has reached a critical point where the boundary between real and manipulated images starts to blur. Recently, mega-scale deep face forgery dataset, ForgeryNet which comprised 2.9 million 221,247 videos been released. It is by far largest publicly available in terms data-scale, manipulations (7 image-level approaches, 8 video-level approaches), perturbations (36 independent more mixed perturbations), annotations (6.3 classification...

10.48550/arxiv.2112.08325 preprint EN cc-by-nc-sa arXiv (Cornell University) 2021-01-01

Vector Quantization for Large Scale CBIR

OPENALEX - Publications

Fan Gao Hao Zeng Zhenhang Huang

In order to meet the requirements of image retrieval and real-time index storage massive datasets, a Content-based Image Retrieval (CBIR) based on vector quantization is used in this paper. By feature extraction depth residual network, improved Product Quantization (PQ) coding algorithm generate dynamic database under FAISS Framework, which can realize nonlinear large-scale CBIR with high robustness. Empirical results show that PQ has achieved good performance compression rate, recall rate...

10.1109/icicas48597.2019.00035 article EN 2021 International Conference on Intelligent Computing, Automation and Systems (ICICAS) 2019-12-01

Fast Single-shot Ship Instance Segmentation Based on Polar Template Mask in Remote Sensing Images

OPENALEX - Publications

Zhenhang Huang Shihao Sun Ruirui Li

Object detection and instance segmentation in remote sensing images is a fundamental challenging task, due to the complexity of scenes targets. The latest methods tried take into account both efficiency accuracy segmentation. In order improve them, this paper, we propose single-shot convolutional neural network structure, which conceptually simple straightforward, meanwhile makes up for problem low networks. Our method, termed with SSS-Net, detects targets based on location object's center...

10.48550/arxiv.2008.12447 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Application research of MOF and its Ni-doping materials in supercapacitor

OPENALEX - Publications

Zhenhang Huang

With the development of technology and automation modern industry, finding a new kind energy storing device gradually occupies significant position. The supercapacitor takes up scientists’ eyes, because its advantages high watt density, long servicing life non-pollution. However, there is also some barriers blocking way forward development. In this project, we use MOF-5 as electrode material, combining with other metal atoms to enhance specific capacitance. Through researching, found that...

10.54097/hset.v13i.1352 article EN cc-by-nc Highlights in Science Engineering and Technology 2022-08-21