NFDI4DS | UHH-SEMS - Publication Details

Jiewei Cao

ORCID: 0000-0003-0681-6134

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5055565056

Research Areas

Advanced Image and Video Retrieval Techniques
Multimodal Machine Learning Applications
Image Retrieval and Classification Techniques
Human Pose and Action Recognition
Video Analysis and Summarization
Advanced Neural Network Applications
Robotics and Sensor-Based Localization
Video Surveillance and Tracking Methods
Gamma-ray bursts and supernovae
Space Satellite Systems and Control
Particle Detector Development and Performance
Image and Object Detection Techniques
X-ray Spectroscopy and Fluorescence Analysis
Astro and Planetary Science
3D Surveying and Cultural Heritage
Spacecraft Design and Technology
Advanced Vision and Imaging
Topic Modeling
Gait Recognition and Analysis
Satellite Communication Systems
Atomic and Molecular Physics
Text and Document Classification Technologies
Face and Expression Recognition
Natural Language Processing Techniques
Cancer-related molecular mechanisms research

Institute of High Energy Physics
2022-2024

Chinese Academy of Sciences
2024

National Institute for Astrophysics
2020-2022

University of Udine
2022

Hebei Medical University
2022

Second Hospital of Hebei Medical University
2022

The University of Queensland
2014-2021

The University of Adelaide
2019-2020

Australian Centre for Robotic Vision
2019

Tongji University
2019

Self-Training With Progressive Augmentation for Unsupervised Cross-Domain Person Re-Identification

OPENALEX - Publications

Xinyu Zhang Jiewei Cao Chunhua Shen Mingyu You

Person re-identification (Re-ID) has achieved great improvement with deep learning and a large amount of labelled training data. However, it remains challenging task for adapting model trained in source domain data to target only unlabelled available. In this work, we develop self-training method progressive augmentation framework (PAST) promote the performance progressively on dataset. Specially, our PAST consists two stages, namely, conservative stage promoting stage. The captures local...

10.1109/iccv.2019.00831 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks

OPENALEX - Publications

Peng Wang Qi Wu Jiewei Cao Chunhua Shen Lianli Gao and 1 more

The task in referring expression comprehension is to localize the object instance an image described by a phrased natural language. As language-to-vision matching task, key this problem learn discriminative feature that can adapt used. To avoid ambiguity, normally tends describe not only properties of referent itself, but also its relationships neighbourhood. capture and exploit important information we propose graph-based, language-guided attention mechanism. Being composed node component...

10.1109/cvpr.2019.00206 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Satellite Pose Estimation with Deep Landmark Regression and Nonlinear Pose Refinement

OPENALEX - Publications

Bo Chen Jiewei Cao Álvaro Parra Tat-Jun Chin

We propose an approach to estimate the 6DOF pose of a satellite, relative canonical pose, from single image. Such problem is crucial in many space proximity operations, such as docking, debris removal, and inter-spacecraft communications. Our combines machine learning geometric optimisation, by predicting coordinates set landmarks input image, associating their corresponding 3D points on priori reconstructed model, then solving for object using non-linear optimisation. not only novel this...

10.1109/iccvw.2019.00343 preprint EN 2019-10-01

End-to-End Learnable Geometric Vision by Backpropagating PnP Optimization

OPENALEX - Publications

Bo Chen Álvaro Parra Jiewei Cao Nan Li Tat-Jun Chin

Deep networks excel in learning patterns from large amounts of data. On the other hand, many geometric vision tasks are specified as optimization problems. To seamlessly combine deep and vision, it is vital to perform end-to-end. Towards this aim, we present BPnP, a novel network module that backpropagates gradients through Perspective-n-Points (PnP) solver guide parameter updates neural network. Based on implicit differentiation, show ``self-contained" PnP can be derived accurately...

10.1109/cvpr42600.2020.00812 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Part-Guided Attention Learning for Vehicle Instance Retrieval

OPENALEX - Publications

Xinyu Zhang Rufeng Zhang Jiewei Cao Dong Gong Mingyu You and 1 more

Vehicle instance retrieval (IR) often requires one to recognize the fine-grained visual differences between vehicles. Besides holistic appearance of vehicles which is easily affected by viewpoint variation and distortion, vehicle parts also provide crucial cues differentiate near-identical Motivated these observations, we introduce a <i>Part-Guided Attention Network</i> (PGAN) pinpoint prominent part regions effectively combine global local information for discriminative feature learning....

10.1109/tits.2020.3030301 article EN IEEE Transactions on Intelligent Transportation Systems 2020-10-29

The HERMES-technologic and scientific pathfinder

OPENALEX - Publications

F. Fiore L. Burderi Michèle Lavagna Roberto Bertacin Y. Evangelista and 94 more

HERMES-TP/SP (High Energy Rapid Modular Ensemble of Satellites Technologic and Scientific Pathfinder) is a constellation six 3U nano-satellites hosting simple but innovative X-ray detectors, characterized by large energy band excellent temporal resolution, thus optimized for the monitoring Cosmic High transients such as Gamma Ray Bursts electromagnetic counterparts Gravitational Wave Events, determination their positions. The projects are funded Italian Ministry University Research Space...

10.1117/12.2560680 preprint EN Space Telescopes and Instrumentation 2022: Ultraviolet to Gamma Ray 2020-12-12

Hierarchical Latent Concept Discovery for Video Event Detection

OPENALEX - Publications

Chao Li Zi Huang Yang Yang Jiewei Cao Xiaoshuai Sun and 1 more

Semantic information is important for video event detection. How to automatically discover, model, and utilize semantic facilitate detection has been a challenging problem. In this paper, we propose novel hierarchical which deliberately unifies the processes of underlying semantics discovery modeling from data. Specially, different most approaches based on manually pre-defined concepts, devise an effective model uncover by hierarchically capturing latent static-visual concepts in frame-level...

10.1109/tip.2017.2670782 article EN IEEE Transactions on Image Processing 2017-02-17

Scalable Video Event Retrieval by Visual State Binary Embedding

OPENALEX - Publications

Litao Yu Zi Huang Jiewei Cao Heng Tao Shen

With the exponential increase of media data on web, fast retrieval is becoming a significant research topic in multimedia content analysis. Among variety techniques, learning binary embedding (hashing) functions one most popular approaches that can achieve scalable information large databases, and it mainly used near-duplicate search. However, till now hashing methods are specifically designed for at visual level rather than semantic level. In this paper, we propose state (VSBE) model to...

10.1109/tmm.2016.2557059 article EN IEEE Transactions on Multimedia 2016-04-20

The enhanced x-ray timing and polarimetry mission: eXTP: an update on its scientific cases, mission profile and development status

OPENALEX - Publications

Shuang‐Nan Zhang A. Santangelo Y. P. Xu Marco Feroci M. Hernanz and 95 more

The enhanced x-ray timing and polarimetry mission (eXTP) is a flagship observatory for timing, spectroscopy developed by an international consortium. Thanks to its very large collecting area, good spectral resolution unprecedented capabilities, eXTP will explore the properties of matter propagation light in most extreme conditions found universe. will, addition, be powerful observatory. continuously monitor sky, enable multi-wavelength multi-messenger studies. currently phase B, which...

10.1117/12.2629340 article EN Space Telescopes and Instrumentation 2022: Ultraviolet to Gamma Ray 2022-09-02

Spatial-aware Multimodal Location Estimation for Social Images

OPENALEX - Publications

Jiewei Cao Zi Huang Yang Yang

Nowadays the locations of social images play an important role in geographic knowledge discovery. However, most still lack location information, driving estimation for to have recently become active research topic. With rapid growth images, new challenges been posed: 1) data quality is issue because they are often associated with noises and error-prone user-generated content, such as junk comments misspelled words; 2) sparsity exists despite large volume, since them unevenly distributed...

10.1145/2733373.2806249 article EN 2015-10-13

Where to Focus: Query Adaptive Matching for Instance Retrieval Using Convolutional Feature Maps

OPENALEX - Publications

Jiewei Cao Lingqiao Liu Peng Wang Zi Huang Chunhua Shen and 1 more

Instance retrieval requires one to search for images that contain a particular object within large corpus. Recent studies show using image features generated by pooling convolutional layer feature maps (CFMs) of pretrained neural network (CNN) leads promising performance this task. However, due the global strategy adopted in those works, is less robust clutter and tends be contaminated irrelevant patterns. In article, we alleviate drawback proposing novel reranking algorithm CFMs refine...

10.48550/arxiv.1606.06811 preprint EN other-oa arXiv (Cornell University) 2016-01-01

The large area detector onboard the eXTP mission

OPENALEX - Publications

M. Feroci G. Ambrosi Filippo Ambrosino M. Antonelli A. Argan and 95 more

The Large Area Detector (LAD) is the high-throughput, spectral-timing instrument onboard eXTP mission, a flagship mission of Chinese Academy Sciences and China National Space Administration, with large European participation coordinated by Italy Spain. currently performing its phase B study, target launch at end-2027. scientific payload includes four instruments (SFA, PFA, LAD WFM) offering unprecedented simultaneous wide-band X-ray timing polarimetry sensitivity. based on design originally...

10.1117/12.2628814 article EN Space Telescopes and Instrumentation 2022: Ultraviolet to Gamma Ray 2022-08-31

Quartet-net Learning for Visual Instance Retrieval

OPENALEX - Publications

Jiewei Cao Zi Huang Peng Wang Chao Li Xiaoshuai Sun and 1 more

Recently, neuron activations extracted from a pre-trained convolutional neural network (CNN) show promising performance in various visual tasks. However, due to the domain and task bias, using features generated model for image classification as representations instance retrieval is problematic. In this paper, we propose quartet-net learning improve discriminative power of CNN retrieval. The general idea map into space where similarity can be better evaluated. Our differs traditional...

10.1145/2964284.2967262 article EN Proceedings of the 30th ACM International Conference on Multimedia 2016-09-29

Jointly Modeling Static Visual Appearance and Temporal Pattern for Unsupervised Video Hashing

OPENALEX - Publications

Chao Li Yang Yang Jiewei Cao Zi Huang

Recently, hashing has been evidenced as an efficient and effective method to facilitate large-scale video retrieval. Most of existing methods are based on visual features, which expected capture the appearance videos. The intrinsic temporal pattern embedded in videos also shown its discriminative power for similarity search, is explored utilised some recent studies. However, how leverage strengths both aspects remains unknown.

10.1145/3132847.3133030 article EN 2017-11-06

V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices

OPENALEX - Publications

Damien Teney Peng Wang Jiewei Cao Lingqiao Liu Chunhua Shen and 1 more

Advances in machine learning have generated increasing enthusiasm for tasks that require high-level reasoning on top of perceptual capabilities, particularly over visual data. Such include, example, image captioning, question answering, and navigation. Their evaluation is however hindered by task-specific confounding factors dataset biases. In parallel, the existing benchmarks abstract are limited to synthetic stimuli (e.g. images simple shapes) do not capture challenges real-world We...

10.1609/aaai.v34i07.6885 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Dual attention granularity network for vehicle re-identification

OPENALEX - Publications

Jianhua Zhang Jingbo Chen Jiewei Cao Ruyu Liu Linjie Bian and 1 more

10.1007/s00521-021-06559-6 article EN Neural Computing and Applications 2021-10-05

The Large Area Detector for the eXTP mission

OPENALEX - Publications

M. Feroci G. Ambrosi M. Antonelli A. Argan Viktor Babinec and 95 more

10.1117/12.3019868 article EN Space Telescopes and Instrumentation 2022: Ultraviolet to Gamma Ray 2024-10-01

Leveraging Weak Semantic Relevance for Complex Video Event Classification

OPENALEX - Publications

Chao Li Jiewei Cao Zi Huang Lei Zhu Heng Tao Shen

Existing video event classification approaches suffer from limited human-labeled semantic annotations. Weak annotations can be harvested Web-knowledge without involving any human interaction. However such weak are noisy, thus not effectively utilized distinguishing its reliability. In this paper, we propose a novel approach to automatically maximize the utility of (formalized as relevance shots target event) facilitate classification. A attention model is designed determine scores shots,...

10.1109/iccv.2017.394 article EN 2017-10-01

Part-Guided Attention Learning for Vehicle Instance Retrieval

OPENALEX - Publications

Xinyu Zhang Rufeng Zhang Jiewei Cao Dong Gong Mingyu You and 1 more

Vehicle instance retrieval often requires one to recognize the fine-grained visual differences between vehicles. Besides holistic appearance of vehicles which is easily affected by viewpoint variation and distortion, vehicle parts also provide crucial cues differentiate near-identical Motivated these observations, we introduce a Part-Guided Attention Network (PGAN) pinpoint prominent part regions effectively combine global information for discriminative feature learning. PGAN first detects...

10.48550/arxiv.1909.06023 preprint EN cc-by-nc-sa arXiv (Cornell University) 2019-01-01

MIR31HG Expression Predicts Poor Prognosis and Promotes Colorectal Cancer Progression

OPENALEX - Publications

Jianlong Wang Bin Liu Jiewei Cao Lianmei Zhao Guiying Wang

Long noncoding RNAs (lncRNAs) are correlated with cancer pathogenesis and prognosis. Many studies have shown that aberrant expression of MIR31HG is implicated in the progression patient However, biological function predictive value colorectal unclear.The correlation between clinicopathological characteristics patients was analyzed by collating information from The Cancer Genome Atlas (TCGA) database. Kaplan-Meier analysis, univariable multivariable Cox regression analysis were performed to...

10.2147/cmar.s351928 article EN cc-by-nc Cancer Management and Research 2022-06-01

Memorizing Comprehensively to Learn Adaptively: Unsupervised Cross-Domain Person Re-ID with Multi-level Memory

OPENALEX - Publications

Xinyu Zhang Dong Gong Jiewei Cao Chunhua Shen

Unsupervised cross-domain person re-identification (Re-ID) aims to adapt the information from labelled source domain an unlabelled target domain. Due lack of supervision in domain, it is crucial identify underlying similarity-and-dissimilarity relationships among samples In order use whole data efficiently mini-batch training, we apply a series memory modules maintain up-to-date representation entire dataset. Unlike simple exemplar previous works, propose novel multi-level network (MMN)...

10.48550/arxiv.2001.04123 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Local Deep Descriptors in Bag-of-Words for Image Retrieval

OPENALEX - Publications

Jiewei Cao Zi Huang Heng Tao Shen

The Bag-of-Words (BoW) models using the SIFT descriptors have achieved great success in content-based image retrieval over past decade. Recent studies show that neuron activations of convolutional neural networks (CNN) can be viewed as local descriptors, which aggregated into effective global for retrieval. However, little work has been done on these deep BoW models, especially case large visual vocabularies.

10.1145/3126686.3127018 article EN 2017-10-23

Coming Soon ...