NFDI4DS | UHH-SEMS - Publication Details

Shiliang Zhang

ORCID: 0000-0001-9053-9314

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5055433405

Research Areas

Video Surveillance and Tracking Methods
Advanced Image and Video Retrieval Techniques
Human Pose and Action Recognition
Advanced Neural Network Applications
Multimodal Machine Learning Applications
Image Retrieval and Classification Techniques
Gait Recognition and Analysis
Speech and Audio Processing
Music and Audio Processing
Face recognition and analysis
Speech Recognition and Synthesis
Neuroscience and Neuropharmacology Research
Domain Adaptation and Few-Shot Learning
Video Analysis and Summarization
Neurotransmitter Receptor Influence on Behavior
Visual Attention and Saliency Detection
Robotics and Sensor-Based Localization
Anomaly Detection Techniques and Applications
Photoreceptor and optogenetics research
Neural dynamics and brain function
Advanced Memory and Neural Computing
Receptor Mechanisms and Signaling
Machine Learning and ELM
Face and Expression Recognition
Emotion and Mood Recognition

Alibaba Group (China)
2022-2025

Peking University
2016-2025

National Institute on Drug Abuse
2014-2024

Ningde Normal University
2012-2024

Alibaba Group (United States)
2019-2024

Peng Cheng Laboratory
2022-2023

Tianjin University
2023

National Institutes of Health
2015-2023

Northwestern Polytechnical University
2023

King University
2019-2021

Person Transfer GAN to Bridge Domain Gap for Person Re-identification

OPENALEX - Publications

Longhui Wei Shiliang Zhang Wen Gao Qi Tian

Although the performance of person Re-Identification (ReID) has been significantly boosted, many challenging issues in real scenarios have not fully investigated, e.g., complex scenes and lighting variations, viewpoint pose changes, large number identities a camera network. To facilitate research towards conquering those issues, this paper contributes new dataset called MSMT171 with important features, 1) raw videos are taken by an 15-camera network deployed both indoor outdoor scenes, 2)...

10.1109/cvpr.2018.00016 preprint EN 2018-06-01

Pose-Driven Deep Convolutional Model for Person Re-identification

OPENALEX - Publications

Chi Su Jianing Li Shiliang Zhang Junliang Xing Wen Gao and 1 more

Feature extraction and matching are two crucial components in person Re-Identification (ReID). The large pose deformations the complex view variations exhibited by captured images significantly increase difficulty of learning features from images. To overcome these difficulties, this work we propose a Pose-driven Deep Convolutional (PDC) model to learn improved feature models end end. Our deep architecture explicitly leverages human part cues alleviate robust representations both global...

10.1109/iccv.2017.427 article EN 2017-10-01

Senolytic therapy alleviates Aβ-associated oligodendrocyte progenitor cell senescence and cognitive deficits in an Alzheimer’s disease model

OPENALEX - Publications

Peisu Zhang Yuki Kishimoto Ioannis Grammatikakis Kamalvishnu Gottimukkala Roy G. Cutler and 6 more

10.1038/s41593-019-0372-9 article EN Nature Neuroscience 2019-04-01

GLAD

OPENALEX - Publications

Longhui Wei Shiliang Zhang Hantao Yao Wen Gao Qi Tian

The huge variance of human pose and the misalignment detected images significantly increase difficulty person Re-Identification (Re-ID). Moreover, efficient Re-ID systems are required to cope with massive visual data being produced by video surveillance systems. Targeting solve these problems, this work proposes a Global-Local-Alignment Descriptor (GLAD) an indexing retrieval framework, respectively. GLAD explicitly leverages local global cues in body generate discriminative robust...

10.1145/3123266.3123279 preprint EN Proceedings of the 30th ACM International Conference on Multimedia 2017-10-19

Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching

OPENALEX - Publications

Shiqing Zhang Shiliang Zhang Tiejun Huang Wen Gao

Speech emotion recognition is challenging because of the affective gap between subjective emotions and low-level features. Integrating multilevel feature learning model training, deep convolutional neural networks (DCNN) has exhibited remarkable success in bridging semantic visual tasks like image classification, object detection. This paper explores how to utilize a DCNN bridge speech signals. To this end, we first extract three channels log Mel-spectrograms (static, delta, delta delta)...

10.1109/tmm.2017.2766843 article EN IEEE Transactions on Multimedia 2017-10-26

Single rodent mesohabenular axons release glutamate and GABA

OPENALEX - Publications

David H. Root Carlos A. Mejías-Aponte Shiliang Zhang Huiling Wang Alexander F. Hoffman and 2 more

10.1038/nn.3823 article EN Nature Neuroscience 2014-09-21

Unsupervised Person Re-Identification via Multi-Label Classification

OPENALEX - Publications

Dongkai Wang Shiliang Zhang

The challenge of unsupervised person re-identification (ReID) lies in learning discriminative features without true labels. This paper formulates ReID as a multi-label classification task to progressively seek Our method starts by assigning each image with single-class label, then evolves leveraging the updated model for label prediction. prediction comprises similarity computation and cycle consistency ensure quality predicted To boost training efficiency classification, we further propose...

10.1109/cvpr42600.2020.01099 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

DR2-Net: Deep Residual Reconstruction Network for image compressive sensing

OPENALEX - Publications

Hantao Yao Feng Dai Shiliang Zhang Yongdong Zhang Qi Tian and 1 more

10.1016/j.neucom.2019.05.006 article EN Neurocomputing 2019-05-29

Learning Affective Features With a Hybrid Deep Model for Audio–Visual Emotion Recognition

OPENALEX - Publications

Shiqing Zhang Shiliang Zhang Tiejun Huang Wen Gao Qi Tian

Emotion recognition is challenging due to the emotional gap between emotions and audio-visual features. Motivated by powerful feature learning ability of deep neural networks, this paper proposes bridge using a hybrid model, which first produces segment features with Convolutional Neural Networks (CNNs) 3D-CNN, then fuses in Deep Belief (DBNs). The proposed method trained two stages. First, CNN 3D-CNN models pre-trained on corresponding large-scale image video classification tasks are...

10.1109/tcsvt.2017.2719043 article EN IEEE Transactions on Circuits and Systems for Video Technology 2017-06-23

Bi-Directional Cascade Network for Perceptual Edge Detection

OPENALEX - Publications

Jianzhong He Shiliang Zhang Ming Yang Yanhu Shan Tiejun Huang

Exploiting multi-scale representations is critical to improve edge detection for objects at different scales. To extract edges dramatically scales, we propose a Bi-Directional Cascade Network (BDCN) structure, where an individual layer supervised by labeled its specific scale, rather than directly applying the same supervision all CNN outputs. Furthermore, enrich learned BDCN, introduce Scale Enhancement Module (SEM) which utilizes dilated convolution generate features, instead of using...

10.1109/cvpr.2019.00395 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Deep Representation Learning With Part Loss for Person Re-Identification

OPENALEX - Publications

Hantao Yao Shiliang Zhang Richang Hong Yongdong Zhang Changsheng Xu and 1 more

Learning discriminative representations for unseen person images is critical Re-Identification (ReID). Most of current approaches learn deep in classification tasks, which essentially minimize the empirical risk on training set. As shown our experiments, such commonly focus several body parts to set, rather than entire human body. Inspired by structural minimization principle SVM, we revise traditional representation learning procedure both and risk. The evaluated proposed part loss,...

10.1109/tip.2019.2891888 article EN IEEE Transactions on Image Processing 2019-01-10

Dopaminergic and glutamatergic microdomains in a subset of rodent mesoaccumbens axons

OPENALEX - Publications

Shiliang Zhang Jia Qi Xueping Li Huiling Wang Jonathan P. Britt and 4 more

10.1038/nn.3945 article EN Nature Neuroscience 2015-02-09

Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition

OPENALEX - Publications

Jianshu Zhang Jun Du Shiliang Zhang Dan Liu Yulong Hu and 3 more

10.1016/j.patcog.2017.06.017 article EN Pattern Recognition 2017-06-10

RAM: A Region-Aware Deep Model for Vehicle Re-Identification

OPENALEX - Publications

Xiaobin Liu Shiliang Zhang Qingming Huang Wen Gao

Previous works on vehicle Re-ID mainly focus extracting global features and learning distance metrics. Because some vehicles commonly share same model maker, it is hard to distinguish them based their appearances. Compared with the appearance, local regions such as decorations inspection stickers attached windshield, may be more distinctive for Re-ID. To embed detailed visual cues in those regions, we propose a Region-Aware deep Model (RAM). Specifically, addition features, RAM also extracts...

10.1109/icme.2018.8486589 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2018-07-01

Global-Local Temporal Representations for Video Person Re-Identification

OPENALEX - Publications

Jianing Li Shiliang Zhang Jingdong Wang Wen Gao Qi Tian

This paper proposes the Global-Local Temporal Representation (GLTR) to exploit multi-scale temporal cues in video sequences for person Re-Identification (ReID). GLTR is constructed by first modeling short-term among adjacent frames, then capturing long-term relations inconsecutive frames. Specifically, are modeled parallel dilated convolutions with different dilation rates represent motion and appearance of pedestrian. The captured a self-attention model alleviate occlusions noises...

10.1109/iccv.2019.00406 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Multi-Task Learning with Low Rank Attribute Embedding for Person Re-Identification

OPENALEX - Publications

Chi Su Fan Yang Shiliang Zhang Qi Tian Larry S. Davis and 1 more

We propose a novel Multi-Task Learning with Low Rank Attribute Embedding (MTL-LORAE) framework for person re-identification. Re-identifications from multiple cameras are regarded as related tasks to exploit shared information improve re-identification accuracy. Both low level features and semantic/data-driven attributes utilized. Since generally correlated, we introduce rank attribute embedding into the MTL formulation embed original binary continuous space, where incorrect incomplete...

10.1109/iccv.2015.426 article EN 2015-12-01

Intra-Inter Camera Similarity for Unsupervised Person Re-Identification

OPENALEX - Publications

Shiyu Xuan Shiliang Zhang

Most of unsupervised person Re-Identification (Re-ID) works produce pseudo-labels by measuring the feature similarity without considering distribution discrepancy among cameras, leading to degraded accuracy in label computation across cameras. This paper targets address this challenge studying a novel intra-inter camera for pseudo-label generation. We decompose sample into two stage, i.e., intra-camera and inter-camera computations, respectively. The directly leverages CNN features within...

10.1109/cvpr46437.2021.01175 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Dorsal Raphe Dual Serotonin-Glutamate Neurons Drive Reward by Establishing Excitatory Synapses on VTA Mesoaccumbens Dopamine Neurons

OPENALEX - Publications

Hui-Ling Wang Shiliang Zhang Jia Qi Huikun Wang Roger Cachope and 7 more

Dorsal raphe (DR) serotonin neurons provide a major input to the ventral tegmental area (VTA). Here, we show that DR transporter (SERT) establish both asymmetric and symmetric synapses on VTA dopamine neurons, but most of these are asymmetric. Moreover, DR-SERT terminals making coexpress vesicular glutamate 3 (VGluT3; for accumulation its synaptic release), suggesting excitatory nature synapses. photoactivation fibers promotes conditioned place preference, elicits currents mesoaccumbens...

10.1016/j.celrep.2019.01.014 article EN cc-by-nc-nd Cell Reports 2019-01-01

Multi-Scale 3D Convolution Network for Video Based Person Re-Identification

OPENALEX - Publications

Jianing Li Shiliang Zhang Tiejun Huang

This paper proposes a two-stream convolution network to extract spatial and temporal cues for video based person ReIdentification (ReID). A stream in this is constructed by inserting several Multi-scale 3D (M3D) layers into 2D CNN network. The resulting M3D introduces fraction of parameters the CNN, but gains ability multi-scale feature learning. With compact architecture, also more efficient easier optimize than existing networks. further involves Residual Attention Layers (RAL) refine...

10.1609/aaai.v33i01.33018618 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2019-07-17

AAformer: Auto-Aligned Transformer for Person Re-Identification

OPENALEX - Publications

Kuan Zhu Haiyun Guo Shiliang Zhang Yaowei Wang Jing Liu and 2 more

In person re-identification (re-ID), extracting part-level features from images has been verified to be crucial offer fine-grained information. Most of the existing CNN-based methods only locate human parts coarsely, or rely on pretrained parsing models and fail in locating identifiable nonhuman (e.g., knapsack). this article, we introduce an alignment scheme transformer architecture for first time propose auto-aligned (AAformer) automatically both ones at patch level. We "Part tokens...

10.1109/tnnls.2023.3301856 article EN IEEE Transactions on Neural Networks and Learning Systems 2023-08-25

Descriptive visual words and visual phrases for image applications

OPENALEX - Publications

Shiliang Zhang Qi Tian Gang Hua Qingming Huang Shipeng Li

The Bag-of-visual Words (BoW) image representation has been applied for various problems in the fields of multimedia and computer vision. basic idea is to represent images as visual documents composed repeatable distinctive elements, which are comparable words texts. However, massive experiments show that commonly used not expressive text words, desirable because it hinders their effectiveness applications. In this paper, Descriptive Visual (DVWs) Phrases (DVPs) proposed correspondences...

10.1145/1631272.1631285 article EN Proceedings of the 30th ACM International Conference on Multimedia 2009-10-19

A glutamatergic reward input from the dorsal raphe to ventral tegmental area dopamine neurons

OPENALEX - Publications

Jia Qi Shiliang Zhang Huiling Wang Huikun Wang José de Jesús Aceves Buendía and 4 more

10.1038/ncomms6390 article EN Nature Communications 2014-11-12

The Anterior Insular Cortex→Central Amygdala Glutamatergic Pathway Is Critical to Relapse after Contingency Management

OPENALEX - Publications

Marco Vènniro Daniele Caprioli Michelle Zhang Leslie R. Whitaker Shiliang Zhang and 8 more

Despite decades of research on neurobiological mechanisms psychostimulant addiction, the only effective treatment for many addicts is contingency management, a behavioral that uses alternative non-drug reward to maintain abstinence. However, when management discontinued, most relapse drug use. The brain underlying after cessation are largely unknown, and, until recently, an animal model this human condition did not exist. Here we used novel rat model, in which availability mutually exclusive...

10.1016/j.neuron.2017.09.024 article EN publisher-specific-oa Neuron 2017-10-01

GLAD: Global–Local-Alignment Descriptor for Scalable Person Re-Identification

OPENALEX - Publications

Longhui Wei Shiliang Zhang Hantao Yao Wen Gao Qi Tian

The huge variance of human pose and the misalign-ment detected images significantly increase difficulty pedestrian image matching in person Re-Identification (Re-ID). Moreover, massive visual data being produced by surveillance video cameras requires highly efficient Re-ID systems. Targeting to solve first problem, this work proposes a robust discriminative descriptor, namely, Global-Local-Alignment Descriptor (GLAD). For second treats as retrieval an indexing framework. GLAD explicitly...

10.1109/tmm.2018.2870522 article EN IEEE Transactions on Multimedia 2018-09-14

Coming Soon ...