NFDI4DS | UHH-SEMS - Publication Details

Learning Trajectory Dependencies for Human Motion Prediction

OPENALEX - Publications

Wei Mao Miaomiao Liu Mathieu Salzmann Hongdong Li

Human motion prediction, i.e., forecasting future body poses given observed pose sequence, has typically been tackled with recurrent neural networks (RNNs). However, as evidenced by prior work, the resulted RNN models suffer from prediction errors accumulation, leading to undesired discontinuities in prediction. In this paper, we propose a simple feed-forward deep network for which takes into account both temporal smoothness and spatial dependencies among human joints. context, then encode...

10.1109/iccv.2019.00958 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Cost Volume Pyramid Based Depth Inference for Multi-View Stereo

OPENALEX - Publications

Jiayu Yang Wei Mao Jose M. Álvarez Miaomiao Liu

We propose a cost volume-based neural network for depth inference from multi-view images. demonstrate that building volume pyramid in coarse-to-fine manner instead of constructing at fixed resolution leads to compact, lightweight and allows us inferring high maps achieve better reconstruction results. To this end, we first build based on uniform sampling fronto-parallel planes across the entire range coarsest an image. Then, given current estimate, construct new volumes iteratively pixelwise...

10.1109/cvpr42600.2020.00493 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Multi-level Motion Attention for Human Motion Prediction

OPENALEX - Publications

Wei Mao Miaomiao Liu Mathieu Salzmann Hongdong Li

10.1007/s11263-021-01483-7 article EN International Journal of Computer Vision 2021-06-16

Generating Smooth Pose Sequences for Diverse Human Motion Prediction

OPENALEX - Publications

Wei Mao Miaomiao Liu Mathieu Salzmann

Recent progress in stochastic motion prediction, i.e., predicting multiple possible future human motions given a single past pose sequence, has led to producing truly diverse and even providing control over the of some body parts. However, achieve this, state-of-the-art method requires learning several mappings for diversity dedicated model controllable prediction. In this paper, we introduce unified deep generative network both To end, leverage intuition that realistic consist smooth...

10.1109/iccv48922.2021.01306 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Cost Volume Pyramid Based Depth Inference for Multi-View Stereo

OPENALEX - Publications

Jiayu Yang Wei Mao Jose M. Álvarez Miaomiao Liu

We propose a cost volume-based neural network for depth inference from multi-view images. demonstrate that building volume pyramid in coarse-to-fine manner instead of constructing at fixed resolution leads to compact, lightweight and allows us inferring high maps achieve better reconstruction results. To this end, we first build based on uniform sampling fronto-parallel planes across the entire range coarsest an image. Then, given current estimate, construct new volumes iteratively perform...

10.1109/tpami.2021.3082562 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2021-01-01

Clinical Features Evaluation of Myopic Fundus tessellation from OCTA and MfERG

OPENALEX - Publications

Yanyan Zhang Yan Zhong Wei Mao Zhenyu Zhang Yusheng Zhou and 3 more

10.1016/j.pdpdt.2025.104493 article EN cc-by-nc-nd Photodiagnosis and Photodynamic Therapy 2025-01-01

Weakly-supervised Action Transition Learning for Stochastic Human Motion Prediction

OPENALEX - Publications

Wei Mao Miaomiao Liu Mathieu Salzmann

We introduce the task of action-driven stochastic human motion prediction, which aims to predict multiple plausible future motions given a sequence action labels and short history. This differs from existing works, that either do not respect any specific category, or follow single label. In particular, addressing this requires tackling two challenges: The transitions between different actions must be smooth; length predicted depends on varies significantly across samples. As we cannot...

10.1109/cvpr52688.2022.00798 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Radar target recognition based on few-shot learning

OPENALEX - Publications

Yue Yang Zhuo Zhang Wei Mao Yang Li LV Chen-gang

10.1007/s00530-021-00832-3 article EN Multimedia Systems 2021-07-26

Structure of a semantic segmentation-based defect detection network for laser cladding infrared images

OPENALEX - Publications

Shiyi Deng Ruipeng Gao Yiran Wang Wei Mao Weikang Zheng

Abstract While selecting the most suitable infrared thermal imaging detection scheme for online inspection during laser cladding processing, this paper designs RespathU-net semantic segmentation defect network coating defects in images. The is based on U-net framework. It optimized and improved by redesigning coding structure, expanding perceptual field, connecting paths of residuals, thus enhancing effect defective areas melt addressing problems that original cannot realize end-to-end...

10.1088/1361-6501/acc7bd article EN Measurement Science and Technology 2023-03-27

VisFusion: Visibility-Aware Online 3D Scene Reconstruction from Videos

OPENALEX - Publications

Huiyu Gao Wei Mao Miaomiao Liu

We propose VisFusion, a visibility-aware online 3D scene reconstruction approach from posed monocular videos. In particular, we aim to reconstruct the volumetric features. Unlike previous methods which aggregate features for each voxel input views without considering its visibility, improve feature fusion by explicitly inferring visibility similarity matrix, computed projected in image pair. Following works, our model is coarse-to-fine pipeline including volume sparsification process....

10.1109/cvpr52729.2023.01661 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

WLIB-SIFT: A Distinctive Local Image Feature Descriptor

OPENALEX - Publications

Wei Mao Xiwei Peng

SIFT descriptor plays a great role in image mosaic, retrieval and target recognition for good invariance of translation, rotation zoom. However, the disadvantages are its high dimensionality complex computation. Besides, has poor performance when massive similar local features background exist matching image. In this paper, distinctive robust weighted intensity binary descriptor(WLIB-SIFT) is proposed. A WLIB-SIFT consists descriptor(B-SIFT) descriptor. The experimental results show that...

10.1109/icicsp48821.2019.8958587 article EN 2019-09-01

Identification and micro-motion parameter estimation of non-cooperative UAV targets

OPENALEX - Publications

Jiachen Yang Zhuo Zhang Wei Mao Yang Yue

10.1016/j.phycom.2021.101314 article EN Physical Communication 2021-03-07

Intelligent Life Prediction of Thermal Barrier Coating for Aero Engine Blades

OPENALEX - Publications

Ruipeng Gao Wei Mao Yiran Wang Shanshan Fan Wei Shao

The existing methods for thermal barrier coating (TBC) life prediction rely mainly on experience and formula derivation are inefficient inaccurate. By introducing deep learning into TBC analyses, a convolutional neural network (CNN) is used to extract the interface morphology analyze its information, which can achieve high-efficiency accurate judgment of life. In this thesis, an Adap–Alex algorithm proposed overcome problems related large training time, over-fitting, low accuracy in CNN...

10.3390/coatings11080890 article EN Coatings 2021-07-26

IoT-based critical infrastructure enabled radar information fusion

OPENALEX - Publications

Jiachen Yang Zhuo Zhang Wei Mao Yiwen Sun Yongjun Bao and 1 more

10.1016/j.compeleceng.2022.107723 article EN Computers & Electrical Engineering 2022-01-28

Learning Trajectory Dependencies for Human Motion Prediction

OPENALEX - Publications

Wei Mao Miaomiao Liu Mathieu Salzmann Hongdong Li

Human motion prediction, i.e., forecasting future body poses given observed pose sequence, has typically been tackled with recurrent neural networks (RNNs). However, as evidenced by prior work, the resulted RNN models suffer from prediction errors accumulation, leading to undesired discontinuities in prediction. In this paper, we propose a simple feed-forward deep network for which takes into account both temporal smoothness and spatial dependencies among human joints. context, then encode...

10.48550/arxiv.1908.05436 preprint EN other-oa arXiv (Cornell University) 2019-01-01

DeepSimHO: Stable Pose Estimation for Hand-Object Interaction via Physics Simulation

OPENALEX - Publications

Rong Wang Wei Mao Hongdong Li

This paper addresses the task of 3D pose estimation for a hand interacting with an object from single image observation. When modeling hand-object interaction, previous works mainly exploit proximity cues, while overlooking dynamical nature that must stably grasp to counteract gravity and thus preventing slipping or falling. These fail leverage constraints in consequently often produce unstable results. Meanwhile, refining configurations physics-based reasoning remains challenging, both by...

10.48550/arxiv.2310.07206 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Simulation and Analysis of a Streaming Media Congestion Control Algorithm Based on NS-2

OPENALEX - Publications

Wei Mao Dongyuan Qi Congming Wu Kangyi Zhang

NS(Simulator Network) is an object-oriented visual simulator based on large-scale discrete event. It simulates not only the transmission of network data and topology architecture, but also all kinds IP circumstance. This paper describes architecture characteristics NS, gives technique general process NS. The instance a streaming media applications-based adaptive congestion control algorithm implemented simulation results are analyzed. experimental result shows that when congested, delay,...

10.1109/iccis.2012.282 article EN 2012-08-01

A method for feature extraction based on SVD and machine learning

OPENALEX - Publications

Wei Mao Huang Shu-xian Xin Liu Hongyan Liu Jiaqi Liu and 1 more

By studying the shortcomings of feature, which extracted from Radar-Cross Section(RCS),using mathematical and statistical method, using idea extracting abstract features in image recognition speech by artificial intelligence for reference[2][3]. This paper explores possibility target's RCS sequence, proposes an feature extraction method sequence based on singular value decomposition(SVD) decomposition. Because poor interpretability features, four different machine learning algorithms are...

10.1088/1757-899x/569/5/052010 article EN IOP Conference Series Materials Science and Engineering 2019-07-01

Cost Volume Pyramid Based Depth Inference for Multi-View Stereo

OPENALEX - Publications

Jiayu Yang Wei Mao Jose M. Álvarez Miaomiao Liu

We propose a cost volume-based neural network for depth inference from multi-view images. demonstrate that building volume pyramid in coarse-to-fine manner instead of constructing at fixed resolution leads to compact, lightweight and allows us inferring high maps achieve better reconstruction results. To this end, we first build based on uniform sampling fronto-parallel planes across the entire range coarsest an image. Then, given current estimate, construct new volumes iteratively pixelwise...

10.48550/arxiv.1912.08329 preprint EN other-oa arXiv (Cornell University) 2019-01-01

History Repeats Itself: Human Motion Prediction via Motion Attention

OPENALEX - Publications

Wei Mao Miaomiao Liu Mathieu Salzmann

Human motion prediction aims to forecast future human poses given a past motion. Whether based on recurrent or feed-forward neural networks, existing methods fail model the observation that tends repeat itself, even for complex sports actions and cooking activities. Here, we introduce an attention-based network explicitly leverages this observation. In particular, instead of modeling frame-wise attention via pose similarity, propose extract capture similarity between current context...

10.48550/arxiv.2007.11755 preprint EN other-oa arXiv (Cornell University) 2020-01-01