NFDI4DS | UHH-SEMS - Publication Details

Matthieu Gaetan Lin

ORCID: 0009-0004-4265-6830

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5078405581

Research Areas

Reinforcement Learning in Robotics
Advanced Vision and Imaging
Generative Adversarial Networks and Image Synthesis
Mobile Crowdsensing and Crowdsourcing
Face recognition and analysis
Industrial Vision Systems and Defect Detection
3D Surveying and Cultural Heritage
Speech and Audio Processing
Robotics and Sensor-Based Localization
Image Processing and 3D Reconstruction
Advanced Bandit Algorithms Research
CCD and CMOS Imaging Sensors
Computer Graphics and Visualization Techniques
Mechanics and Biomechanics Studies
Fault Detection and Control Systems
Optical measurement and interference techniques
Astronomical Observations and Instrumentation
Machine Learning and Data Classification
Advanced Image and Video Retrieval Techniques

Tsinghua University
2023-2024

DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models

OPENALEX - Publications

Zhiyao Sun Tian Lv Sheng Ye Matthieu Gaetan Lin Jenny Sheng and 3 more

The generation of stylistic 3D facial animations driven by speech presents a significant challenge as it requires learning many-to-many mapping between speech, style, and the corresponding natural motion. However, existing methods either employ deterministic model for speech-to-motion or encode style using one-hot encoding scheme. Notably, approach fails to capture complexity thus limits generalization ability. In this paper, we propose DiffPoseTalk, generative framework based on diffusion...

10.1145/3658221 article EN ACM Transactions on Graphics 2024-07-19

SD-FSOD: Self-Distillation Paradigm via Distribution Calibration for Few-Shot Object Detection

OPENALEX - Publications

Han Chen Qi Wang Kailin Xie Lei Liang Matthieu Gaetan Lin and 3 more

Few-shot object detection (FSOD) aims to detect novel targets with only a few instances of the associated samples. Although combinations distillation techniques and meta-learning paradigms have been acknowledged as primary strategies for FSOD tasks, existing methods exhibit inherent biases sensitivity class variability. A critical hurdle is difficulty in ensuring appropriate knowledge learned from teacher model during fine-tuning stage. Furthermore, coarse procedures risk misalignment...

10.1109/tcsvt.2023.3343397 article EN IEEE Transactions on Circuits and Systems for Video Technology 2023-12-15

Indoor Scene Reconstruction with Fine-Grained Details Using Hybrid Representation and Normal Prior Enhancement

OPENALEX - Publications

Sheng Ye Yubin Hu Matthieu Gaetan Lin Yu‐Hui Wen Wang Zhao and 2 more

The reconstruction of indoor scenes from multi-view RGB images is challenging due to the coexistence flat and texture-less regions alongside delicate fine-grained regions. Recent methods leverage neural radiance fields aided by predicted surface normal priors recover scene geometry. These excel in producing complete smooth results for floor wall areas. However, they struggle capture complex surfaces with high-frequency structures inadequate representation inaccurately priors. This work aims...

10.1109/tvcg.2024.3444036 article EN IEEE Transactions on Visualization and Computer Graphics 2024-01-01

PCKRF: Point Cloud Completion and Keypoint Refinement With Fusion Data for 6D Pose Estimation

OPENALEX - Publications

Yiheng Han Irvin Haozhe Zhan Long Zeng Yu‐Ping Wang Ran Yi and 4 more

Some robust point cloud registration approaches with controllable pose refinement magnitude, such as ICP and its variants, are commonly used to improve 6D estimation accuracy. However, the effectiveness of these methods gradually diminishes advancement deep learning techniques enhancement initial accuracy, primarily due their lack specific design for refinement. In this paper, we propose Point Cloud Completion Keypoint Refinement Fusion Data (PCKRF), a new pipeline estimation. The consists...

10.1109/tvcg.2024.3390122 article EN IEEE Transactions on Visualization and Computer Graphics 2024-01-01

Emotional Neural Textures: Generating Talking-Face Videos with Continuously Controllable Emotions

OPENALEX - Publications

Bin Wan Zhiyao Sun Matthieu Gaetan Lin Zipeng Ye Yu‐Hui Wen and 1 more

10.2139/ssrn.4873630 preprint EN 2024-01-01

Generalizable Thermal-based Depth Estimation via Pre-trained Visual Foundation Model

OPENALEX - Publications

Ruoyu Fan Zhao Wang Matthieu Gaetan Lin Qi Wang Yong‐Jin Liu and 1 more

10.1109/icra57147.2024.10610394 article EN 2024-05-13

PVP-Recon: Progressive View Planning via Warping Consistency for Sparse-View Surface Reconstruction

OPENALEX - Publications

Sheng Ye Yuze He Matthieu Gaetan Lin Jenny Sheng Ruoyu Fan and 6 more

Neural implicit representations have revolutionized dense multi-view surface reconstruction, yet their performance significantly diminishes with sparse input views. A few pioneering works sought to tackle this challenge by leveraging additional geometric priors or multi-scene generalizability. However, they are still hindered the imperfect choice of views, using images under empirically determined viewpoints. We propose PVP-Recon , a novel and effective sparse-view reconstruction method that...

10.1145/3687896 article EN other-oa ACM Transactions on Graphics 2024-11-19

A Mixture of Surprises for Unsupervised Reinforcement Learning

OPENALEX - Publications

Andrew Zhao Matthieu Gaetan Lin Yangguang Li Yong‐Jin Liu Gao Huang

Unsupervised reinforcement learning aims at a generalist policy in reward-free manner for fast adaptation to downstream tasks. Most of the existing methods propose provide an intrinsic reward based on surprise. Maximizing or minimizing surprise drives agent either explore gain control over its environment. However, both strategies rely strong assumption: entropy environment's dynamics is high low. This assumption may not always hold real-world scenarios, where be unknown. Hence, choosing...

10.48550/arxiv.2210.06702 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Boosting Offline Reinforcement Learning with Action Preference Query

OPENALEX - Publications

Qisen Yang Shenzhi Wang Matthieu Gaetan Lin Shiji Song Gao Huang

Training practical agents usually involve offline and online reinforcement learning (RL) to balance the policy's performance interaction costs. In particular, fine-tuning has become a commonly used method correct erroneous estimates of out-of-distribution data learned in training phase. However, even limited interactions can be inaccessible or catastrophic for high-stake scenarios like healthcare autonomous driving. this work, we introduce an interaction-free scheme dubbed...

10.48550/arxiv.2306.03362 preprint EN cc-by-nc-nd arXiv (Cornell University) 2023-01-01

DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models

OPENALEX - Publications

Zhiyao Sun Tian Lv Sheng Ye Matthieu Gaetan Lin Jenny Sheng and 3 more

The generation of stylistic 3D facial animations driven by speech poses a significant challenge as it requires learning many-to-many mapping between speech, style, and the corresponding natural motion. However, existing methods either employ deterministic model for speech-to-motion or encode style using one-hot encoding scheme. Notably, approach fails to capture complexity thus limits generalization ability. In this paper, we propose DiffPoseTalk, generative framework based on diffusion...

10.48550/arxiv.2310.00434 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning

OPENALEX - Publications

Shenzhi Wang Qisen Yang Jiawei Gao Matthieu Gaetan Lin Hao Chen and 4 more

Offline-to-online reinforcement learning (RL) is a training paradigm that combines pre-training on pre-collected dataset with fine-tuning in an online environment. However, the incorporation of can intensify well-known distributional shift problem. Existing solutions tackle this problem by imposing policy constraint improvement objective both offline and learning. They typically advocate single balance between constraints across diverse data collections. This one-size-fits-all manner may not...

10.48550/arxiv.2310.17966 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Coming Soon ...