- Advanced Vision and Imaging
- 3D Shape Modeling and Analysis
- Human Pose and Action Recognition
- Computer Graphics and Visualization Techniques
- Advanced Battery Technologies Research
- Advancements in Battery Materials
- Human Motion and Animation
- Advanced Battery Materials and Technologies
- Robotics and Sensor-Based Localization
- Image and Video Quality Assessment
- Optical measurement and interference techniques
- Video Surveillance and Tracking Methods
- Spectroscopy and Chemometric Analyses
- Advanced Measurement and Detection Methods
- Astrophysics and Cosmic Phenomena
- EEG and Brain-Computer Interfaces
- Generative Adversarial Networks and Image Synthesis
- Advanced Image and Video Retrieval Techniques
- Visual Attention and Saliency Detection
- Facility Location and Emergency Management
- Virtual Reality Applications and Impacts
- Medical Coding and Health Information
- Smart Agriculture and AI
- Advanced Chemical Sensor Technologies
- Software Engineering Research
Tsinghua University
2015-2025
Fifth Affiliated Hospital of Zhengzhou University
2024
National Engineering Research Center for Information Technology in Agriculture
2023
Huaqiao University
2022
Harbin Institute of Technology
2021
Changchun University of Science and Technology
2020
Chinese People's Liberation Army
2020
Shenzhen Stock Exchange
2020
Shandong Jiaotong University
2019
Shanghai Jiao Tong University
2019
Recently, the Segment Anything Model (SAM) gains lots of attention rapidly due to its impressive segmentation performance on images. Regarding strong ability image and high interactivity with different prompts, we found that it performs poorly consistent in videos. Therefore, this report, propose Track (TAM), which achieves high-performance interactive tracking To be detailed, given a video sequence, only very little human participation, i.e., several clicks, people can track anything they...
Multi-person total motion capture is extremely challenging when it comes to handle severe occlusions, different reconstruction granularities from body face and hands, drastically changing observation scales fast movements. To overcome these challenges above, we contribute a lightweight system for multi-person interactive scenarios using only sparse multi-view cameras. By contributing novel hand bootstrapping algorithm, our method capable of efficient localization accurate association the...
Creating pose-driven human avatars is about modeling the mapping from low-frequency driving pose to high-frequency dynamic appearances, so an effective encoding method that can encode high-fidelity details essential avatar modeling. To this end, we present PoseVocab, a novel encourages network discover optimal embeddings for learning appearance. Given multi-view RGB videos of character, PoseVocab constructs key poses and latent based on training poses. achieve generalization temporal...
An agglomerate model for the impedance of a single micro-sized secondary particle, capable accounting internal electrochemical reactions and material transport, is deduced in three steps: (1) A set basic equations porous battery electrode while adopting refined structure electric double layer considering effect on rate charge transfer reaction; (2) primary particles with reaction only occurring at surface developed. The features detailed description lithium-ion transportation across SEI...
In this paper, we propose an efficient method for robust 3D self-portraits using a single RGBD camera. Benefiting from the proposed PIFusion and lightweight bundle adjustment algorithm, our can generate detailed in seconds shows ability to handle subjects wearing extremely loose clothes. To achieve highly reconstruction, PIFusion, which combines learning-based recovery with volumetric non-rigid fusion accurate sparse partial scans of subject. Moreover, deformation is continuously refine...
The local key features in video are important for improving the accuracy of human action recognition. However, most end-to-end methods focus on global feature learning from videos, while few works consider enhancement information a feature. In this article, we discuss how to automatically enhance ability discriminate an and improve To address these problems, assume that critical level each region recognition task is different will not change with location shuffle. We therefore propose novel...
We propose POseguided SElective Fusion (POSEFu-sion), a single-view human volumetric capture method that leverages tracking-based methods and tracking-free inference to achieve high-fidelity dynamic 3D reconstruction. By contributing novel reconstruction framework which contains pose-guided keyframe selection robust implicit surface fusion, our fully utilizes the advantages of both methods, finally enables recon-struction details even in invisible regions. formulate as programming problem...
Image translation for change detection or classification in bi-temporal remote sensing images is unique. Although it can acquire paired images, still unsupervised. Moreover, strict semantic preservation always needed instead of multimodal outputs. In response to these problems, this paper proposes a new method, SRUIT (Semantically Robust Unsupervised Image-to-image Translation), which ensures semantically robust and produces deterministic output. Inspired by previous works, the method...
Severe deterioration of lithium-ion cells at low temperatures constitutes one the bottlenecks for wide adoption electric vehicles.
The objective of this study is to determine the standard reference intervals for coagulation function and factors in children across various age groups.
Abstract Camera calibration, image feature detection, matching and other aspects have become barriers that traditional 3D reconstruction methods are difficult to break through. The important role of deep learning in data detection classification has a impact on the real world, research hotspot at home abroad deal with this problem. In paper, method sequence based depth is proposed. Firstly, principle introduced. Then, new studied discussed combination theory. Finally, conclusion prospect given.
Creating high-fidelity 3D head avatars has always been a research hotspot, but there remains great challenge under lightweight sparse view setups. In this paper, we propose Gaussian Head Avatar represented by controllable Gaussians for avatar modeling. We optimize the neutral and fully learned MLP-based deformation field to capture complex expressions. The two parts benefit each other, thereby our method can model fine-grained dynamic details while ensuring expression accuracy. Furthermore,...
Currently, user-based quality of experience (QoE) measurement methods (e.g., mean opinion score, MOS) are often employed. However, their results might be affected by human subjective and thoughts. Physiological can overcome these disadvantages. In the field video models, buffering problem caused poor network conditions is an important factor that affects QoE. this paper, a reasonable psychophysiological method, electroencephalography (EEG), proposed to quantitatively analyze QoE changes when...
Entering the 5G era, virtual reality (VR) business is developing rapidly, resulting in many new applications and scenarios. However, due to insufficient bandwidth other reasons, VR video will undergo adaptive resolution reduction, which one of important factors affecting quality user experience (QoE). unique look feel visual VR, evaluation model established on basis 2D screen not applicable. At same time, because services are less popular, it difficult use big data build correct short term....