- Human Pose and Action Recognition
- Multimodal Machine Learning Applications
- Mobile Ad Hoc Networks
- Energy Efficient Wireless Sensor Networks
- Gait Recognition and Analysis
- Anomaly Detection Techniques and Applications
- Video Surveillance and Tracking Methods
- Wireless Networks and Protocols
- Topic Modeling
- Advanced Image and Video Retrieval Techniques
- Advanced Vision and Imaging
- Advanced Neural Network Applications
- Explainable Artificial Intelligence (XAI)
- Mobile Agent-Based Network Management
- Advanced Graph Neural Networks
- Domain Adaptation and Few-Shot Learning
- Blind Source Separation Techniques
- Image and Signal Denoising Methods
- Wireless Communication Networks Research
- Advanced Algorithms and Applications
- Medical Image Segmentation Techniques
- Medical Imaging Techniques and Applications
- Advanced Computing and Algorithms
- Graph Theory and Algorithms
- Language and cultural evolution
Xuzhou University of Technology
2025
University of Hong Kong
2024
Wuhan University of Technology
2002-2024
Chinese University of Hong Kong
2024
Sinopec (China)
2024
Xidian University
2020-2023
Shandong First Medical University
2010-2016
Shandong Tumor Hospital
2010-2016
Xi'an Railway Survey and Design Institute
2012
Northwestern Polytechnical University
2003
Deep learning techniques have led to remarkable breakthroughs in the field of object detection and spawned a lot scene-understanding tasks recent years. Scene graph has been focus research because its powerful semantic representation applications scene understanding. Graph Generation (SGG) refers task automatically mapping an image or video into structural graph, which requires correct labeling detected objects their relationships. In this paper, comprehensive survey achievements is...
Various methods to deal with graph data have been proposed in recent years. However, most of these focus on feature aggregation rather than pooling. Besides, the existing top-k selection pooling a few problems. First, construct pooled topology, current evaluate importance node from single perspective only, which is simplistic and unobjective. Second, information unselected nodes directly lost during process, inevitably leads massive loss information. To solve problems mentioned above, we...
Abstract Aiming at the problems of many path inflection points, unsmooth paths, and poor local obstacle avoidance in planning inspection robots static-dynamic scenes under complex geological conditions coal mine roadways, a hybrid method based on improved A* algorithm dynamic window approach (DWA) is proposed. First, robot platform system model are constructed. An heuristic function that incorporates target weight information proposed global algorithm. Additionally, redundant nodes...
Recently, Large Language Models (LLMs) and Multimodal (MLLMs) have shown promise in instruction following image understanding. While these models are powerful, they not yet been developed to comprehend the more challenging 3D geometric physical scenes, especially when it comes sparse outdoor LiDAR data. In this paper, we introduce LiDAR-LLM, which takes raw data as input harnesses remarkable reasoning capabilities of LLMs gain a comprehensive understanding scenes. The central insight our...
For a given video-based Human-Object Interaction scene, modeling the spatio-temporal relationship between humans and objects is important cue to understand contextual information presented in video. With efficient modeling, it possible not only uncover each frame, but directly capture inter-frame dependencies as well. Capturing position changes of human over dimension more critical when significant appearance features may occur time. When utilizing features, spatial location semantic are...
Video-based human-object interaction recognition is a challenging task since the state of objects as well their correlations change constantly in video. Existing methods mainly use 3DCNN or separate components (e.g., GCN + RNN) to model spatial correlation temporal respectively, but ignore modeling spatio-temporal simultaneously and long-term dynamics objects. In this paper, we propose novel model, named Spatio-Temporal Interaction Graph Parsing Networks (STIGPN), for videos. STIGPN captures...
Node power management is one of the key problems in wireless sensor networks. This paper proposes a new method by using genetic algorithm which has characteristic auto-adapted global optimization probability searching. Under condition connectivity between nodes, this can calculate optimum route link from source node to destination entire network, thus reduces quantity communication nodes as well network power. The simulation results indicated that be applied perfectly and effect energy...
Flowcharts and mind maps, collectively known as flowmind, are vital in daily activities, with hand-drawn versions facilitating real-time collaboration. However, there's a growing need to digitize them for efficient processing. Automated conversion methods essential overcome manual challenges. Existing sketch recognition face limitations practical situations, being field-specific lacking digital steps. Our paper introduces the Flowmind2digital method hdFlowmind dataset address these...
Predicting the future motion of surrounding agents is essential for autonomous vehicles (AVs) to operate safely in dynamic, human-robot-mixed environments. Context information, such as road maps and agents' states, provides crucial geometric semantic information behavior prediction. To this end, recent works explore two-stage prediction frameworks where coarse trajectories are first proposed, then used select critical context trajectory refinement. However, they either incur a large amount...
Neuromorphic sensors, specifically event cameras, revolutionize visual data acquisition by capturing pixel intensity changes with exceptional dynamic range, minimal latency, and energy efficiency, setting them apart from conventional frame-based cameras. The distinctive capabilities of cameras have ignited significant interest in the domain event-based action recognition, recognizing their vast potential for advancement. However, development this field is currently slowed lack comprehensive,...
Heterogeneous phase combined flooding (HPCF) has been a promising technology used for enhancing oil recovery in heterogeneous mature reservoirs. However, the injectivity and propagation behavior of preformed particle gel (PPG) low–medium-permeability reservoir porous media is crucial HPCF treatment reservoir. Thus, were systematically studied by conducting series sand pack experiments. The matching factor (δ) was defined as ratio average size PPG particles to mean pore throats pressure...
Photo-realistic and controllable 3D avatars are crucial for various applications such as virtual mixed reality (VR/MR), telepresence, gaming, film production. Traditional methods avatar creation often involve time-consuming scanning reconstruction processes each avatar, which limits their scalability. Furthermore, these do not offer the flexibility to sample new identities or modify existing ones. On other hand, by learning a strong prior from data, generative models provide promising...
Predicting the future motion of surrounding agents is essential for autonomous vehicles (AVs) to operate safely in dynamic, human-robot-mixed environments. However, scarcity large-scale driving datasets has hindered development robust and generalizable prediction models, limiting their ability capture complex interactions road geometries. Inspired by recent advances natural language processing (NLP) computer vision (CV), self-supervised learning (SSL) gained significant attention community...
Video face swapping is becoming increasingly popular across various applications, yet existing methods primarily focus on static images and struggle with video because of temporal consistency complex scenarios. In this paper, we present the first diffusion-based framework specifically designed for swapping. Our approach introduces a novel image-video hybrid training that leverages both abundant image data sequences, addressing inherent limitations video-only training. The incorporates...