NFDI4DS | UHH-SEMS - Publication Details

Shiwei Zhang

ORCID: 0000-0003-2870-3974

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5113431536

Research Areas

Human Pose and Action Recognition
Anomaly Detection Techniques and Applications
Video Analysis and Summarization
Multimodal Machine Learning Applications
Generative Adversarial Networks and Image Synthesis
E-commerce and Technology Innovations
Video Surveillance and Tracking Methods
Multimedia Communication and Technology
Human Motion and Animation
Control and Dynamics of Mobile Robots
Digital Humanities and Scholarship
Machine Learning and Algorithms
Image and Video Quality Assessment
Water Systems and Optimization
Advanced Data and IoT Technologies
Advancements in Photolithography Techniques
VLSI and Analog Circuit Testing
Advanced Image and Video Retrieval Techniques
Adhesion, Friction, and Surface Interactions
Cellular Automata and Applications
Domain Adaptation and Few-Shot Learning
Gait Recognition and Analysis
Advanced Neural Network Applications
Connexins and lens biology
Advanced Steganography and Watermarking Techniques

South China University of Technology
2024-2025

Zhuhai Institute of Advanced Technology
2025

Xi'an Jiaotong University
2024

Alibaba Group (United States)
2022-2024

Siemens (China)
2024

Hebei University
2024

Hohai University
2021

Huazhong University of Science and Technology
2007-2019

Wuhan University
2007

Hubei Zhongshan Hospital
2007

Hybrid Relation Guided Set Matching for Few-shot Action Recognition

OPENALEX - Publications

Xiang Wang Shiwei Zhang Zhiwu Qing Mingqian Tang Zhengrong Zuo and 3 more

Current few-shot action recognition methods reach impressive performance by learning discriminative features for each video via episodic training and designing various temporal alignment strategies. Nevertheless, they are limited in that (a) individual without considering the entire task may lose most relevant information current episode, (b) these strategies fail misaligned instances. To overcome two limitations, we propose a novel Hybrid Relation guided Set Matching (HyRSM) approach...

10.1109/cvpr52688.2022.01932 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

MoLo: Motion-Augmented Long-Short Contrastive Learning for Few-Shot Action Recognition

OPENALEX - Publications

Xiang Wang Shiwei Zhang Zhiwu Qing Changxin Gao Yingya Zhang and 2 more

Current state-of-the-art approaches for few-shot action recognition achieve promising performance by conducting frame-level matching on learned visual features. However, they generally suffer from two limitations: i) the procedure between local frames tends to be inaccurate due lack of guidance force long-range temporal perception; ii) explicit motion learning is usually ignored, leading partial information loss. To address these issues, we develop a Motion-augmented Long-short Contrastive...

10.1109/cvpr52729.2023.01727 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection

OPENALEX - Publications

Lin Song Shiwei Zhang Gang Yu Hongbin Sun

Current state-of-the-art approaches for spatio-temporal action detection have achieved impressive results but remain unsatisfactory temporal extent detection. The main reason comes from that, there are some ambiguous states similar to the real actions which may be treated as target even by a well trained network. In this paper, we define these samples "transitional states", and propose Transition-Aware Context Network (TACNet) distinguish transitional states. proposed TACNet includes two...

10.1109/cvpr.2019.01226 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Dream Video: Composing Your Dream Videos with Customized Subject and Motion

OPENALEX - Publications

Yujie Wei Shiwei Zhang Zhiwu Qing Hangjie Yuan Zhi‐Heng Liu and 4 more

10.1109/cvpr52733.2024.00625 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Fabrication and Performance of Aluminum-Based Composite Wicks Using a Two-Step Laser-Sintering Process

OPENALEX - Publications

Tang Yong Yuxin Wei Tong Sun Jingjing Bai Fangqiong Luo and 4 more

The evolution of 5G technology necessitates effective thermal management strategies for compact, high-power devices. potential aluminum-based vapor chambers (VCs) as solutions is recognized, yet the heat transfer performance limited by capillary constraints wick structures. This study proposes a laser-sintered composite to address this limitation. Experimental evaluations were conducted on microgroove wicks (MW) and groove–spiral woven mesh (GSCW), utilizing ethanol acetone working fluids....

10.3390/mi16040370 article EN cc-by Micromachines 2025-03-25

FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing

OPENALEX - Publications

Lingling Cai Kang Zhao Hangjie Yuan Yingya Zhang Shiwei Zhang and 1 more

Text-to-video diffusion models have made remarkable advancements. Driven by their ability to generate temporally coherent videos, research on zero-shot video editing using these fundamental has expanded rapidly. To enhance quality, structural controls are frequently employed in editing. Among techniques, cross-attention mask control stands out for its effectiveness and efficiency. However, when masks naively applied editing, they can introduce artifacts such as blurring flickering. Our...

10.1609/aaai.v39i2.32185 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

Hierarchical Spatio-temporal Decoupling for Text-to- Video Generation

OPENALEX - Publications

Zhiwu Qing Shiwei Zhang Jiayu Wang Xiang Wang Yujie Wei and 3 more

10.1109/cvpr52733.2024.00634 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Discriminative Part Selection For Human Action Recognition

OPENALEX - Publications

Shiwei Zhang Changxin Gao Jing Zhang Feifei Chen Nong Sang

Semantic parts have shown a powerful discriminative capacity for action recognition. However, many existing methods select according to predefined heuristic rules, which may cause the correlation among be lost, or do not appropriately consider cluttered candidate part space, result in weak generalizability of resulting labels. Therefore, better consideration and refinement space will lead more representation. This paper achieves improved performance by elegantly addressing these two factors....

10.1109/tmm.2017.2758524 article EN IEEE Transactions on Multimedia 2017-01-01

Group Sparse-Based Mid-Level Representation for Action Recognition

OPENALEX - Publications

Shiwei Zhang Changxin Gao Feifei Chen Sihui Luo Nong Sang

Mid-level parts are shown to be effective for human action recognition in videos. Typically, these semantic first mined with some heuristic rules, then videos represented via volumetric max-pooling (VMP) method. However, methods have two issues: 1) the VMP strategy divides by static grids. In this case, a part may occur different localizations That means loses space-time invariance. To solve problem, we propose apply saliency-driven scheme represent video. We extract video cues saliency map,...

10.1109/tsmc.2016.2625840 article EN IEEE Transactions on Systems Man and Cybernetics Systems 2016-11-29

Aconitine alters connexin43 phosphorylation status and [Ca2+] oscillation patterns in cultured ventricular myocytes of neonatal rats

OPENALEX - Publications

Shiwei Zhang Yan Liu Guangzhao Huang Liang Liu

10.1016/j.tiv.2007.06.013 article EN Toxicology in Vitro 2007-07-08

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

OPENALEX - Publications

Xiang Wang Shiwei Zhang Hangjie Yuan Zhiwu Qing Biao Gong and 4 more

10.1109/cvpr52733.2024.00628 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Self-Supervised Learning for Semi-Supervised Temporal Action Proposal

OPENALEX - Publications

Xiang Wang Shiwei Zhang Zhiwu Qing Yuanjie Shao Changxin Gao and 1 more

Self-supervised learning presents a remarkable performance to utilize unlabeled data for various video tasks. In this paper, we focus on applying the power of self-supervised methods improve semi-supervised action proposal generation. Particularly, design an effective Semi-supervised Temporal Action Proposal (SSTAP) framework. The SSTAP contains two crucial branches, i.e., temporal-aware branch and relation-aware branch. improves model by introducing temporal perturbations, feature shift...

10.48550/arxiv.2104.03214 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Context-aware Proposal Network for Temporal Action Detection

OPENALEX - Publications

Xiang Wang Hua‐Xin Zhang Shiwei Zhang Changxin Gao Yuanjie Shao and 1 more

This technical report presents our first place winning solution for temporal action detection task in CVPR-2022 AcitivityNet Challenge. The aims to localize boundaries of instances with specific classes long untrimmed videos. Recent mainstream attempts are based on dense boundary matchings and enumerate all possible combinations produce proposals. We argue that the generated proposals contain rich contextual information, which may benefits confidence prediction. To this end, method mainly...

10.48550/arxiv.2206.09082 preprint EN other-oa arXiv (Cornell University) 2022-01-01

EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models

OPENALEX - Publications

Rui Zhao Hangjie Yuan Yujie Wei Shiwei Zhang Yuchao Gu and 6 more

Recent advancements in generation models have showcased remarkable capabilities generating fantastic content. However, most of them are trained on proprietary high-quality data, and some withhold their parameters only provide accessible application programming interfaces (APIs), limiting benefits for downstream tasks. To explore the feasibility training a text-to-image model comparable to advanced using publicly available resources, we introduce EvolveDirector. This framework interacts with...

10.48550/arxiv.2410.07133 preprint EN arXiv (Cornell University) 2024-10-09

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control

OPENALEX - Publications

Yujie Wei Shiwei Zhang Hangjie Yuan Wang Xiang Haonan Qiu and 7 more

Recent advances in customized video generation have enabled users to create videos tailored both specific subjects and motion trajectories. However, existing methods often require complicated test-time fine-tuning struggle with balancing subject learning control, limiting their real-world applications. In this paper, we present DreamVideo-2, a zero-shot customization framework capable of generating trajectory, guided by single image bounding box sequence, respectively, without the need for...

10.48550/arxiv.2410.13830 preprint EN arXiv (Cornell University) 2024-10-17

One point is all you need for weakly supervised object detection

OPENALEX - Publications

Shiwei Zhang Zhengzheng Wang Wei Ke

10.1016/j.patcog.2024.111087 article EN Pattern Recognition 2024-10-01

A multi-strategy improved sparrow search algorithm for indoor AGV path planning

OPENALEX - Publications

Shiwei Zhang Jinzhuang Xiao Yingying Liu Meiya Dong Zhen Zhou

To address the problems of weak search ability, easily falling into local optimal solutions and poor path quality sparrow algorithm in AGV planning, a multi-strategy improved (MISSA) is proposed this paper. MISSA improves global ability by improving discoverer position update operator introducing sine cosine algorithm; adopts adaptive number vigilantes adjustment step size to improve convergence speed; introduces Levy flight variation strategy reduce probability any solution; optimizes...

10.3233/jifs-234357 article EN Journal of Intelligent & Fuzzy Systems 2024-10-25

Boosting Semi-supervised Crowd Counting with Scale-based Active Learning

OPENALEX - Publications

Shiwei Zhang Wei Ke Shuai Liu Xiaopeng Hong Tong Zhang

10.1145/3664647.3680976 article EN 2024-10-26

FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing

OPENALEX - Publications

Lingling Cai Kang Zhao Hangjie Yuan Yingya Zhang Shiwei Zhang and 1 more

10.48550/arxiv.2409.20500 preprint EN arXiv (Cornell University) 2024-09-30

Gradient-Wettable Multiwedge Patterned Surface for Effective Transport of Droplets against the Temperature Gradient

OPENALEX - Publications

Jingjing Zhai Jie Zhang Liyuan Xu Qiankai Liu Liang Li and 3 more

With the rapid advancement of electronic integration technology, requirements for working environment and stability heat dissipation equipment have become increasingly stringent. Consequently, studying a high-efficiency gas–liquid two-phase transfer surface holds significant importance. Aiming at limited liquid transport performance caused by temperature gradient in process, this paper combines wetting with shape proposes gradient-wettable multiwedge patterned surface, where droplets can be...

10.1021/acsami.4c13342 article EN ACS Applied Materials & Interfaces 2024-11-04

A fully automatic and generic method for classifying repeating array designs

OPENALEX - Publications

Dion King Ying Zhang Qijian Wan Ruizhi Hou Shiwei Zhang and 1 more

Optical proximity correction (OPC) plays a critical role in the entire semiconductor manufacturing process. The consistency of identical patterns within same context becomes increasingly crucial to ensure high performance during OPC processing, especially areas like SRAM regions. Consistency checking essentially involves classification repeated and comparison pattern layers (i.e., results) . While mini-array designs can often be easily identified manually, there are still instances where...

10.1117/12.3052902 article EN 2024-12-10

TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection

OPENALEX - Publications

Lin Song Shiwei Zhang Gang Yu Hongbin Sun

Current state-of-the-art approaches for spatio-temporal action detection have achieved impressive results but remain unsatisfactory temporal extent detection. The main reason comes from that, there are some ambiguous states similar to the real actions which may be treated as target even by a well-trained network. In this paper, we define these samples "transitional states", and propose Transition-Aware Context Network (TACNet) distinguish transitional states. proposed TACNet includes two...

10.48550/arxiv.1905.13417 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Mid-level parts mined by feature selection for action recognition

OPENALEX - Publications

Shiwei Zhang Nong Sang Changxin Gao FeiFei Chen Jing Hu

This paper develops a method to learn very few discriminative part detectors from training videos directly, for action recognition. We hold the opinion that being classification is of primary importance in selecting detectors, not just intuitive. For this purpose, selection based on feature proposed, employing SVM method. Firstly, large number candidate are trained using k-means and Exemplar-LDA techniques whitened space. Secondly, each detector regarded as visual feature, so can be achieved...

10.1109/acpr.2015.7486577 article EN 2015-11-01

Research on PD-IoT Cloud Master Station Architecture Based on Blockchain Technology

OPENALEX - Publications

Zhu Hai-peng Lei Zhao Kun Qin He Li Di Wang and 1 more

Abstract Blockchain technology is a new type of distributed database solution, which has unique advantages in terms decentralization, security and transparency. These characteristics the blockchain can solve some typical technical problems construction current power distribution IoT cloud master station. In this context, architecture design implementation station based on first discussed; then, integrated from three aspects: performance, state estimation algorithm, deep search engine. The...

10.1088/1757-899x/768/6/062052 article EN IOP Conference Series Materials Science and Engineering 2020-03-01

Research on government network public opinion monitoring algorithm under the background of sustainable smart government

OPENALEX - Publications

Shiwei Zhang

10.1504/ijnvo.2023.10054645 article EN International Journal of Networking and Virtual Organisations 2023-01-01

Coming Soon ...