NFDI4DS | UHH-SEMS - Publication Details

Sijie Yan

ORCID: 0000-0003-4398-0590

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100381761

Research Areas

Human Pose and Action Recognition
Human Motion and Animation
Advanced Vision and Imaging
3D Shape Modeling and Analysis
Generative Adversarial Networks and Image Synthesis
Advanced Surface Polishing Techniques
Manufacturing Process and Optimization
Optical measurement and interference techniques
Robotics and Sensor-Based Localization
Robotic Mechanisms and Dynamics
Advanced Measurement and Metrology Techniques
Face recognition and analysis
Gait Recognition and Analysis
Advanced machining processes and optimization
Domain Adaptation and Few-Shot Learning
Image and Object Detection Techniques
Adversarial Robustness in Machine Learning
Robot Manipulation and Learning
Context-Aware Activity Recognition Systems
Dynamics and Control of Mechanical Systems
Image Processing and 3D Reconstruction
Multimodal Machine Learning Applications
Gaussian Processes and Bayesian Inference
Hand Gesture Recognition Systems
Advanced Sensor Technologies Research

Huazhong University of Science and Technology
2008-2024

Lanzhou University
2024

Chinese University of Hong Kong
2016-2022

Wuxi Institute of Technology
2022

Amazon (United States)
2020-2021

Jilin Province Science and Technology Department
2021

Jilin University
2021

Guangxi University
2008

Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition

OPENALEX - Publications

Sijie Yan Yuanjun Xiong Dahua Lin

Dynamics of human body skeletons convey significant information for action recognition. Conventional approaches modeling usually rely on hand-crafted parts or traversal rules, thus resulting in limited expressive power and difficulties generalization. In this work, we propose a novel model dynamic called Spatial-Temporal Graph Convolutional Networks (ST-GCN), which moves beyond the limitations previous methods by automatically learning both spatial temporal patterns from data. This...

10.1609/aaai.v32i1.12328 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-27

Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition

OPENALEX - Publications

Sijie Yan Yuanjun Xiong Dahua Lin

10.48550/arxiv.1801.07455 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Convolutional Sequence Generation for Skeleton-Based Action Synthesis

OPENALEX - Publications

Sijie Yan Zhizhong Li Yuanjun Xiong Huahan Yan Dahua Lin

In this work, we aim to generate long actions represented as sequences of skeletons. The generated must demonstrate continuous, meaningful human actions, while maintaining coherence among body parts. Instead generating skeletons sequentially following an autoregressive model, propose a framework that generates the entire sequence altogether by transforming from latent vectors sampled Gaussian process (GP). This framework, named Convolutional Sequence Generation Network (CSGN), jointly models...

10.1109/iccv.2019.00449 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Towards Diverse and Natural Scene-aware 3D Human Motion Synthesis

OPENALEX - Publications

Jingbo Wang Yu Rong Jingyuan Liu Sijie Yan Dahua Lin and 1 more

The ability to synthesize long-term human motion sequences in real-world scenes can facilitate numerous applications. Previous approaches for scene-aware synthesis are constrained by pre-defined target objects or positions and thus limit the diversity of human-scene interactions synthesized motions. In this paper, we focus on problem synthesizing diverse motions under guidance action sequences. To achieve this, first decompose into three aspects, namely interaction (e.g. sitting different...

10.1109/cvpr52688.2022.01981 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Hand–Eye Calibration in Visually-Guided Robot Grinding

OPENALEX - Publications

Wenlong Li He Xie Gang Zhang Sijie Yan Zhouping Yin

Visually-guided robot grinding is a novel and promising automation technique for blade manufacturing. One common problem encountered in hand-eye calibration, which establishes the pose relationship between end effector (hand) scanning sensor (eye). This paper proposes new calibration approach belt grinding. The main contribution of this its consideration both joint parameter errors equation. objective function built solved, from 30 compensated values (corresponding to 24 parameters six...

10.1109/tcyb.2015.2483740 article EN IEEE Transactions on Cybernetics 2015-10-20

3-D Shape Matching of a Blade Surface in Robotic Grinding Applications

OPENALEX - Publications

Wenlong Li He Xie Gang Zhang Sijie Yan Zhouping Yin

Robotic grinding is a promising technique to generate the final shape of blades. It can relieve human from participating in dirty and noisy environments, improve product quality, lower production costs. One important task robotic 3-D matching. However, existing matching methods do not consider requirements associated with different allowances, which potentially lead an unstable force. This paper proposes novel method for grinding. The goal define new objective function considering allowance...

10.1109/tmech.2016.2574813 article EN IEEE/ASME Transactions on Mechatronics 2016-06-02

Unconstrained Fashion Landmark Detection via Hierarchical Recurrent Transformer Networks

OPENALEX - Publications

Sijie Yan Ziwei Liu Ping Luo Shi Qiu Xiaogang Wang and 1 more

Fashion landmarks are functional key points defined on clothes, such as corners of neckline, hemline, and cuff. They have been recently introduced [18]as an effective visual representation for fashion image understanding. However, detecting challenging due to background clutters, human poses, scales. To remove the above variations, previous works usually assumed bounding boxes clothes provided in training test additional annotations, which expensive obtain inapplicable practice. This work...

10.1145/3123266.3123276 article EN Proceedings of the 30th ACM International Conference on Multimedia 2017-10-19

A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder

OPENALEX - Publications

Yujun Cai Yiwei Wang Yiheng Zhu Tat‐Jen Cham Jianfei Cai and 8 more

We present a unified and flexible framework to address the generalized problem of 3D motion synthesis that covers tasks prediction, completion, interpolation, spatial-temporal recovery. Since these have different input constraints various fidelity diversity requirements, most existing approaches only cater specific task or use architectures tasks. Here we propose based on Conditional Variational Auto-Encoder (CVAE), where treat any arbitrary as masked series. Notably, by considering this...

10.1109/iccv48922.2021.01144 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Scene-aware Generative Network for Human Motion Synthesis

OPENALEX - Publications

Jingbo Wang Sijie Yan Bo Dai Dahua Lin

We revisit human motion synthesis, a task useful in various real-world applications, this paper. Whereas number of methods have been developed previously for task, they are often limited two aspects: 1) focus on the poses while leaving location movement behind, and 2) ignore impact environment motion. In paper, we propose new framework, with interaction between scene taken into account. Considering uncertainty motion, formulate as generative whose objective is to generate plausible...

10.1109/cvpr46437.2021.01203 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

A vision-based human-robot collaborative system for digital twin

OPENALEX - Publications

Shuming Yi Sichao Liu Xiaohu Xu Xi Vincent Wang Sijie Yan and 1 more

Flexible and safe human-robot collaboration depends on accurately capturing the three-dimensional motion of humans robots in field smart manufacturing. In this paper, a novel approach to developing collaborative assembly system is proposed applied digital twins. Within context, deep learning-based model explored develop depth camera-based human recognition for accurate prediction key points skeletons high-precision localisation setting. After functional mapping robot calibration, collision...

10.1016/j.procir.2022.05.024 article EN Procedia CIRP 2022-01-01

Conformal Slit Mapping Based Spiral Tool Trajectory Planning for Ball-End Milling on Complex Freeform Surfaces

OPENALEX - Publications

Changqing Shen Bingzhou Xu Xiaojian Zhang Sijie Yan Han Ding

10.2139/ssrn.5218043 preprint EN 2025-01-01

Positive-Congruent Training: Towards Regression-Free Model Updates

OPENALEX - Publications

Sijie Yan Yuanjun Xiong Kaustav Kundu Shuo Yang Siqi Deng and 3 more

Reducing inconsistencies in the behavior of different versions an AI system can be as important practice reducing its overall error. In image classification, sample-wise appear "negative flips": A new model incorrectly predicts output for a test sample that was correctly classified by old (reference) model. Positive-congruent (PC) training aims at error rate while same time negative flips, thus maximizing congruency with reference only on positive predictions, unlike distillation. We propose...

10.1109/cvpr46437.2021.01407 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

A case study of blade inspection based on optical scanning method

OPENALEX - Publications

Wenlong Li Liping Zhou Sijie Yan

AbstractBlades play an important role in aviation engine, gas turbine and jet engine. Inspecting the blade by optical method is a meaningful work manufacturing industry. During inspecting process, one common problem encountered that scanned point cloud large scale noisy. In this paper, we present systematic introduction of simplification, smoothing parameter extraction with respect to point-sampled blades. First, moving least square surface applied create geometric deviation, which used...

10.1080/00207543.2014.974851 article EN International Journal of Production Research 2014-10-30

Third-order trajectory planning for high accuracy point-to-point motion

OPENALEX - Publications

Haihua Mu Yunfei Zhou Sijie Yan Aiguo Han

10.1007/s11460-009-0017-y article EN Frontiers of Electrical and Electronic Engineering in China 2008-12-18

Spiral complete coverage path planning based on conformal slit mapping in multi-connected domains

OPENALEX - Publications

Changqing Shen Sihao Mao Bingzhou Xu Ziwei Wang Xiaojian Zhang and 2 more

The generation of smoother and shorter spiral complete coverage paths in multi-connected domains is a crucial research topic path planning for robotic cavity machining other related fields. Traditional methods typically incorporate subregion division procedure that leads to excessive bridging, requiring longer, more sharply turning, unevenly spaced spirals achieve coverage. To address this issue, paper proposes novel method using conformal slit mapping. It takes advantage the fact mapping...

10.1177/02783649241251385 article EN The International Journal of Robotics Research 2024-05-10

Motion Guided 3D Pose Estimation from Videos

OPENALEX - Publications

Jingbo Wang Sijie Yan Yuanjun Xiong Dahua Lin

We propose a new loss function, called motion loss, for the problem of monocular 3D Human pose estimation from 2D pose. In computing simple yet effective representation keypoint motion, pairwise encoding, is introduced. design graph convolutional network architecture, U-shaped GCN (UGCN). It captures both short-term and long-term information to fully leverage additional supervision loss. experiment training UGCN with on two large scale benchmarks: Human3.6M MPI-INF-3DHP. Our model surpasses...

10.48550/arxiv.2004.13985 preprint EN other-oa arXiv (Cornell University) 2020-01-01

STAC: a spatio-temporal approximate method in data collection applications

OPENALEX - Publications

Xiaohui Wei Sijie Yan Xingwang Wang Mohsen Guizani Xiaojiang Du

10.1016/j.pmcj.2021.101371 article EN Pervasive and Mobile Computing 2021-03-07

Unconstrained Fashion Landmark Detection via Hierarchical Recurrent Transformer Networks

OPENALEX - Publications

Sijie Yan Ziwei Liu Ping Luo Shi Qiu Xiaogang Wang and 1 more

Fashion landmarks are functional key points defined on clothes, such as corners of neckline, hemline, and cuff. They have been recently introduced an effective visual representation for fashion image understanding. However, detecting challenging due to background clutters, human poses, scales. To remove the above variations, previous works usually assumed bounding boxes clothes provided in training test additional annotations, which expensive obtain inapplicable practice. This work addresses...

10.48550/arxiv.1708.02044 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Deep learning assisted pressure measurements using femtosecond laser-induced grating scattering technique

OPENALEX - Publications

Tongxun Zhao Pengji Ding Yishan Wang Renjun Yang Sijie Yan and 4 more

Abstract The femtosecond laser-induced grating scattering (fs-LIGS) technique has recently been developed and applied for temperature pressure measurements. In this work, we combined deep learning with the fs-LIGS to predict gas-phase from raw signals without data post-processing. Two models, a fully connected neural network convolutional network, were trained master hidden relationship between features of signal traces corresponding under which was recorded. Accurate predictions by both...

10.1088/1361-6463/ad1e27 article EN Journal of Physics D Applied Physics 2024-01-12

A Full-duplex Speech Dialogue Scheme Based On Large Language Models

OPENALEX - Publications

Peng Wang Songshuo Lu Yaohua Tang Sijie Yan Yuanjun Xiong and 1 more

We present a generative dialogue system capable of operating in full-duplex manner, allowing for seamless interaction. It is based on large language model (LLM) carefully aligned to be aware perception module, motor function and the concept simple finite state machine (called neural FSM) with two states. The modules operate simultaneously, simultaneously speak listen user. LLM generates textual tokens inquiry responses makes autonomous decisions start responding to, wait for, or interrupt...

10.48550/arxiv.2405.19487 preprint EN arXiv (Cornell University) 2024-05-29

RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians

OPENALEX - Publications

Bingling Li Shengyi Chen Luchao Wang Kaimin He Sijie Yan and 1 more

In this work, we explore the possibility of training high-parameter 3D Gaussian splatting (3DGS) models on large-scale, high-resolution datasets. We design a general model parallel method for 3DGS, named RetinaGS, which uses proper rendering equation and can be applied to any scene arbitrary distribution primitives. It enables us scaling behavior 3DGS in terms primitive numbers resolutions that were difficult before surpass previous state-of-the-art reconstruction quality. observe clear...

10.48550/arxiv.2406.11836 preprint EN arXiv (Cornell University) 2024-06-17

Weak defect and flatness discrepancy fused identification for robotic measuring of putty repair area

OPENALEX - Publications

Lina Wu Zeyuan Yang Yifan Yang Yue Jin Sijie Yan

10.1117/12.3037912 article EN 2024-10-22

Coming Soon ...