Zhenhang Huang

ORCID: 0000-0002-1925-2924
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Neural Network Applications
  • Advanced Image and Video Retrieval Techniques
  • Robotics and Sensor-Based Localization
  • Remote Sensing and LiDAR Applications
  • Advanced Memory and Neural Computing
  • Advancements in Battery Materials
  • Image Retrieval and Classification Techniques
  • Remote Sensing and Land Use
  • Multimodal Machine Learning Applications
  • Industrial Vision Systems and Defect Detection
  • Face recognition and analysis
  • AI in cancer detection
  • Domain Adaptation and Few-Shot Learning
  • Advanced Battery Technologies Research
  • QR Code Applications and Technologies
  • Supercapacitor Materials and Fabrication
  • CCD and CMOS Imaging Sensors
  • Facial Nerve Paralysis Treatment and Research
  • Visual Attention and Saliency Detection
  • Medical Image Segmentation Techniques

Shanghai Artificial Intelligence Laboratory
2023

ShangHai JiAi Genetics & IVF Institute
2023

Dalian University of Technology
2022

Beijing University of Chemical Technology
2020-2021

Chongqing University of Posts and Telecommunications
2019

Compared to the great progress of large-scale vision transformers (ViTs) in recent years, models based on convolutional neural networks (CNNs) are still an early state. This work presents a new CNN-based foundation model, termed InternImage, which can obtain gain from increasing parameters and training data like ViTs. Different CNNs that focus large dense kernels, InternImage takes deformable convolution as core operator, so our model not only has effective receptive field required for...

10.1109/cvpr52729.2023.01385 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

We present the All-Seeing (AS) project: a large-scale data and model for recognizing understanding everything in open world. Using scalable engine that incorporates human feedback efficient models loop, we create new dataset (AS-1B) with over 1 billion regions annotated semantic tags, question-answering pairs, detailed captions. It covers wide range of 3.5 million common rare concepts real world, has 132.2 tokens describe their attributes. Leveraging this dataset, develop (ASM), unified...

10.48550/arxiv.2308.01907 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Object detection and instance segmentation in remote sensing images is a fundamental challenging task, due to the complexity of scenes targets. The latest methods tried take into account both efficiency accuracy segmentation. In order improve them, this paper, we propose single-shot convolutional neural network structure, which conceptually simple straightforward, meanwhile makes up for problem low networks. Our method, termed with SSS-Net, detects targets based on location object's center...

10.1109/igarss39084.2020.9324705 article EN IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium 2020-09-26

Object detection and semantic segmentation have achieved remarkable performance propelled by deep convolutional neural networks. However, neither of them can well parse deal with swarms rotating ships in remote sensing images. In this article, we pay more attention to the instance-level task, which recognizes objects effectively straightly. We propose a new network architecture, called orientated silhouette matching network, employing multiscale features masks enable single-shot...

10.1109/jstars.2021.3132005 article EN cc-by IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2021-12-02

Compared to the great progress of large-scale vision transformers (ViTs) in recent years, models based on convolutional neural networks (CNNs) are still an early state. This work presents a new CNN-based foundation model, termed InternImage, which can obtain gain from increasing parameters and training data like ViTs. Different CNNs that focus large dense kernels, InternImage takes deformable convolution as core operator, so our model not only has effective receptive field required for...

10.48550/arxiv.2211.05778 preprint EN cc-by arXiv (Cornell University) 2022-01-01

In this paper, we introduce a brand new dataset to promote the study of instance segmentation for objects with irregular shapes. Our key observation is that though irregularly shaped widely exist in daily life and industrial scenarios, they received little attention field due lack corresponding datasets. To fill gap, propose iShape, an shape segmentation. iShape contains six sub-datasets one real five synthetics, each represents scene typical shape. Unlike most existing datasets regular...

10.48550/arxiv.2109.15068 preprint EN public-domain arXiv (Cornell University) 2021-01-01

The rapid progress of photorealistic synthesis techniques has reached a critical point where the boundary between real and manipulated images starts to blur. Recently, mega-scale deep face forgery dataset, ForgeryNet which comprised 2.9 million 221,247 videos been released. It is by far largest publicly available in terms data-scale, manipulations (7 image-level approaches, 8 video-level approaches), perturbations (36 independent more mixed perturbations), annotations (6.3 classification...

10.48550/arxiv.2112.08325 preprint EN cc-by-nc-sa arXiv (Cornell University) 2021-01-01

In order to meet the requirements of image retrieval and real-time index storage massive datasets, a Content-based Image Retrieval (CBIR) based on vector quantization is used in this paper. By feature extraction depth residual network, improved Product Quantization (PQ) coding algorithm generate dynamic database under FAISS Framework, which can realize nonlinear large-scale CBIR with high robustness. Empirical results show that PQ has achieved good performance compression rate, recall rate...

10.1109/icicas48597.2019.00035 article EN 2021 International Conference on Intelligent Computing, Automation and Systems (ICICAS) 2019-12-01

Object detection and instance segmentation in remote sensing images is a fundamental challenging task, due to the complexity of scenes targets. The latest methods tried take into account both efficiency accuracy segmentation. In order improve them, this paper, we propose single-shot convolutional neural network structure, which conceptually simple straightforward, meanwhile makes up for problem low networks. Our method, termed with SSS-Net, detects targets based on location object's center...

10.48550/arxiv.2008.12447 preprint EN other-oa arXiv (Cornell University) 2020-01-01

With the development of technology and automation modern industry, finding a new kind energy storing device gradually occupies significant position. The supercapacitor takes up scientists’ eyes, because its advantages high watt density, long servicing life non-pollution. However, there is also some barriers blocking way forward development. In this project, we use MOF-5 as electrode material, combining with other metal atoms to enhance specific capacitance. Through researching, found that...

10.54097/hset.v13i.1352 article EN cc-by-nc Highlights in Science Engineering and Technology 2022-08-21
Coming Soon ...