Qi Zhang

ORCID: 0000-0003-0709-3273
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Image and Video Retrieval Techniques
  • Advanced Neural Network Applications
  • earthquake and tectonic studies
  • Image Retrieval and Classification Techniques
  • Advanced Image Processing Techniques
  • Advanced Vision and Imaging
  • Image Processing Techniques and Applications
  • Video Surveillance and Tracking Methods
  • Seismology and Earthquake Studies
  • Earthquake Detection and Analysis
  • Face recognition and analysis
  • Neural Networks and Applications
  • Medical Image Segmentation Techniques
  • Image and Signal Denoising Methods
  • Image Processing and 3D Reconstruction
  • Video Coding and Compression Technologies
  • Image and Video Quality Assessment
  • Face and Expression Recognition
  • Advanced Data Compression Techniques
  • Robotics and Sensor-Based Localization
  • Cryospheric studies and observations
  • Advanced Measurement and Detection Methods
  • Image Enhancement Techniques
  • Remote Sensing and Land Use
  • Digital Media Forensic Detection

Beijing Normal University
2024-2025

Shanghai Artificial Intelligence Laboratory
2025

Xi'an Jiaotong University
2016-2025

Peking University
2011-2025

Beijing Academy of Artificial Intelligence
2025

Meta (United States)
2024

Fujian Agriculture and Forestry University
2024

Anhui University
2024

University of Chinese Academy of Sciences
2016-2024

Optica
2024

This article offers a comprehensive AI-centric review of deep learning in exploring landslides with remote-sensing techniques, breaking new ground beyond traditional methodologies. We categorize tasks into five key frameworks—classification, detection, segmentation, sequence, and the hybrid framework—and analyze their specific applications landslide-related tasks. Following presented frameworks, we state-or-art studies provide clear insights powerful capability models for landslide mapping,...

10.3390/rs16081344 article EN cc-by Remote Sensing 2024-04-11

Abstract Seismology is witnessing rapid growth in both the volume and variety of earthquake observational data, but current tools for effectively integrating these heterogeneous data remain limited. Here, we propose SafeNet, a scalable deep learning framework designed to address challenges through use multimodal fusion neural networks. SafeNet integrates 282-dimensional seismic indicators from catalogs, capturing long-, medium-, short-term patterns, associates activity with geological...

10.1038/s41598-025-93877-7 article EN cc-by Scientific Reports 2025-03-21

Partial face recognition (PFR) in unconstrained environment is a very important task, especially video surveillance, mobile devices, etc. However, few studies have tackled how to recognize an arbitrary patch of image. This study combines Fully Convolutional Network (FCN) with Sparse Representation Classification (SRC) propose novel partial approach, called Dynamic Feature Matching (DFM), address images regardless size. Based on DFM, we sliding loss optimize FCN by reducing the...

10.1109/cvpr.2018.00737 article EN 2018-06-01

Visual storytelling aims at generating a story from an image stream. Most existing methods tend to represent images directly with the extracted high-level features, which is not intuitive and difficult interpret. We argue that translating each into graph-based semantic representation, i.e., scene graph, explicitly encodes objects relationships detected within image, would benefit representing describing images. To this end, we propose novel architecture for visual by modeling two-level on...

10.1609/aaai.v34i05.6455 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Landslides are a major geohazard that endangers human lives and properties. Recently, efforts have been made to use Synthetic Aperture Radar Interferometry (InSAR) for landslide monitoring. However, it is still difficult effectively automatically identify slow-moving landslides distributed over large area due phase unwrapping errors, decorrelation, troposphere turbulence computational requirements. In this study, we develop new approach combining phase-gradient stacking deep-learning network...

10.3389/fenvs.2022.963322 article EN cc-by Frontiers in Environmental Science 2022-08-31

Abstract Visual simultaneous localization and mapping (vSLAM) is inherently constrained by the static world assumption, which renders success in presence of dynamic objects rather challenging. In this paper, we propose a real-time semantic vSLAM system designed for both indoor outdoor environments. By employing object detection, identify 80 categories utilize motion consistency checks to pinpoint outliers each image. Distinct methods are presented examining states humans other objects. For...

10.1088/1361-6501/acd1a4 article EN cc-by Measurement Science and Technology 2023-05-02

Abstract. Drought indices are crucial for assessing and managing water scarcity agricultural risks; however, the lack of a unified data foundation in existing datasets leads to inconsistencies that challenge comparability drought indices. This study is dedicated creating CHM_Drought, an innovative comprehensive long-term meteorological dataset with spatial resolution 0.1° collected from 1961 2022 mainland China. It features six pivotal indices: standardized precipitation index (SPI),...

10.5194/essd-2024-270 preprint EN cc-by 2024-07-23

With the ever-increasing growth of Internet, numerous copies documents become serious problem for search engine, opinion mining and many other web applications. Since partial-duplicates only contain a small piece text taken from sources most existing near-duplicate detection approaches focus on document level, partial duplicates can not be dealt with well. In this paper, we propose novel algorithm to realize partial-duplicate task. Besides similarities between documents, our proposed...

10.1145/1835449.1835562 article EN 2010-07-19

Person re-identification (ReID), which aims at matching individuals across non-overlapping cameras, has attracted much attention in the field of computer vision due to its research significance and potential applications. Triplet loss-based CNN models have been very successful for person ReID, optimize feature embedding space such that distances between samples with same identity are shorter than those different identities. Researchers found hard triplets' mining is crucial success triplet...

10.1109/access.2018.2884743 article EN cc-by-nc-nd IEEE Access 2018-01-01

Near infrared (NIR) partial face images acquired in iris recognition less constrained environments contain plentiful identity information which have not been fully exploited. In this paper, a NIR (PFR) algorithm is designed according to the characteristics of systems. preprocessing stage, eye corners are utilized for alignment because regions usually visible. feature representation highly compact and discriminatory features extracted based on Multiscale Double Supervision Convolutional...

10.1109/btas.2016.7791187 article EN 2016-09-01

Document retrieval techniques form the foundation for development of large-scale information systems. The prevailing methodology is to construct a bi-encoder and compute semantic similarity. However, such scalar similarity difficult reflect enough impedes our comprehension results. In addition, this computational process mainly emphasizes global semantics ignores fine-grained relationship between query complex text in document. paper, we propose new method called $\textbf{Ge}$neration...

10.48550/arxiv.2501.02772 preprint EN arXiv (Cornell University) 2025-01-06

Abstract Objective: Deformable registration aims to achieve nonlinear alignment of image space by estimating a dense displacement field. It is commonly used as preprocessing step in clinical and analysis applications, such surgical planning, diagnostic assistance, navigation. We aim overcome these challenges: Deep learning-based methods often struggle with complex displacements lack effective interaction between global local feature information. They also neglect the spatial position...

10.1088/1361-6560/adaacd article EN Physics in Medicine and Biology 2025-01-15

Abstract. Precipitation is a critical driver of the water cycle, profoundly influencing resources, agricultural productivity, and natural disasters. However, existing gridded precipitation datasets exhibit markable deficiencies in capturing spatiotemporal physical correlations precipitation, which limits their accuracy, particularly regions with sparse meteorological stations. Therefore, this study proposes completely new generation scheme to address these issues. The long-term daily...

10.5194/essd-2025-20 preprint EN cc-by 2025-02-12

Atmospheric moisture plays a crucial role in connecting global water and energy exchanges within the cycle. Using recycling model, this study examines spatiotemporal characteristics of precipitation evaporation ratios (PRR ERR) across 200 river basins worldwide from 1980 to 2021, with data fused three reanalysis datasets. The results reveal that regions near equator exhibit higher PRR values, signifying strong self-sufficiency, whereas arid, high-latitude, inland show lower indicating...

10.5194/egusphere-egu25-21892 preprint EN 2025-03-15

Decoding natural visual scenes from brain activity has flourished, with extensive research in single-subject tasks and, however, less cross-subject tasks. Reconstructing high-quality images is a challenging problem due to profound individual differences between subjects and the scarcity of data annotation. In this work, we proposed MindTuner for decoding, which achieves rich semantic reconstructions using only 1 hour fMRI training benefiting phenomena fingerprint human system novel...

10.1609/aaai.v39i13.33560 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

10.1109/tgrs.2025.3568932 article EN IEEE Transactions on Geoscience and Remote Sensing 2025-01-01

The vehicle detection and tracking are important tasks in intelligent transportation system. traditional methods of often cause the coarse-grained results due to suffering from complex environments. YOLO is a pragmatic approach multi-target with simple effective algorithm. This paper use detect moving vehicles modified Kalman filter algorithm dynamically track detected vehicles, achieving overall competitive performance day or night. experimental show method robust occluding congested roads...

10.1109/ddcls.2019.8908873 article EN 2022 IEEE 11th Data Driven Control and Learning Systems Conference (DDCLS) 2019-05-01

The emergence of the Internet Things (IoT) has led to a remarkable increase in volume data generated at network edge. In order support real-time smart IoT applications, massive amounts from edge devices need be processed using methods such as deep neural networks (DNNs) with low latency. To improve application performance and minimize resource cost, enterprises have begun adopt Edge computing, computation paradigm that advocates processing input locally However, nodes are often...

10.1145/3404397.3404473 article EN 2020-08-09

Facial expression recognition (FER) task in the wild is challenging due to some uncertainties, such as ambiguity of facial expressions, subjective annotations, and low-quality images. A novel model for FER in-the-wild datasets proposed this study solve these uncertainties. The overview method follows. First, images are grouped into high low uncertainties by pre-trained network. graph convolutional network (GCN) framework then used with uncertainty obtain geometry cues, including relationship...

10.1016/j.jksuci.2023.101605 article EN cc-by-nc-nd Journal of King Saud University - Computer and Information Sciences 2023-06-09
Coming Soon ...