NFDI4DS | UHH-SEMS - Publication Details

Yihong Xu

ORCID: 0000-0003-1043-0656

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5089449284

Research Areas

Advanced Image and Video Retrieval Techniques
Video Surveillance and Tracking Methods
Statistical and numerical algorithms
Advanced Vision and Imaging
Time Series Analysis and Forecasting
Advanced Neural Network Applications
Robotics and Sensor-Based Localization
Autonomous Vehicle Technology and Safety
Visual perception and processing mechanisms
Advanced Optical Sensing Technologies
Remote-Sensing Image Classification
Photonic and Optical Devices
Optical Coherence Tomography Applications
Forecasting Techniques and Applications

Valeo (France)
2022-2024

TransCenter: Transformers With Dense Representations for Multiple-Object Tracking

OPENALEX - Publications

Yihong Xu Yutong Ban Guillaume Delorme Chuang Gan Daniela Rus and 1 more

Transformers have proven superior performance for a wide variety of tasks since they were introduced. In recent years, drawn attention from the vision community in such as image classification and object detection. Despite this wave, an accurate efficient multiple-object tracking (MOT) method based on transformers is yet to be designed. We argue that direct application transformer architecture with quadratic complexity insufficient noise-initialized sparse queries – not optimal MOT. propose...

10.1109/tpami.2022.3225078 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-11-28

Learning Kernel-Modulated Neural Representation for Efficient Light Field Compression

OPENALEX - Publications

Jinglei Shi Yihong Xu Christine Guillemot

Light fields capture 3D scene information by recording light rays emitted from a at various orientations. They offer more immersive perception, compared with classic 2D images, but the cost of huge data volumes. In this paper, we design compact neural network representation for field compression task. same vein as deep image prior, takes randomly initialized noise input and is trained in supervised manner order to best reconstruct target Sub-Aperture Images (SAIs). The composed two types...

10.1109/tip.2024.3418670 article EN IEEE Transactions on Image Processing 2024-01-01

Towards Motion Forecasting with Real-World Perception Inputs: Are End-to-End Approaches Competitive?

OPENALEX - Publications

Yihong Xu Loïck Chambon Éloi Zablocki Mickaël Chen Alexandre Alahi and 2 more

10.1109/icra57147.2024.10610201 article EN 2024-05-13

Annealed Winner-Takes-All for Motion Forecasting

OPENALEX - Publications

Yihong Xu Victor Letzelter Mickaël Chen Éloi Zablocki Matthieu Cord

In autonomous driving, motion prediction aims at forecasting the future trajectories of nearby agents, helping ego vehicle to anticipate behaviors and drive safely. A key challenge is generating a diverse set predictions, commonly addressed using data-driven models with Multiple Choice Learning (MCL) architectures Winner-Takes-All (WTA) training objectives. However, these methods face initialization sensitivity instabilities. Additionally, compensate for limited performance, some approaches...

10.48550/arxiv.2409.11172 preprint EN arXiv (Cornell University) 2024-09-17

PPT: Pre-Training with Pseudo-Labeled Trajectories for Motion Forecasting

OPENALEX - Publications

Yihong Xu Yuan Yin Tuan-Hung Vu Alexandre Boulch Éloi Zablocki and 1 more

Motion forecasting (MF) for autonomous driving aims at anticipating trajectories of surrounding agents in complex urban scenarios. In this work, we investigate a mixed strategy MF training that first pre-train motion forecasters on pseudo-labeled data, then fine-tune them annotated data. To obtain trajectories, propose simple pipeline leverages off-the-shelf single-frame 3D object detectors and non-learning trackers. The whole pre-training including pseudo-labeling is coined as PPT. Our...

10.48550/arxiv.2412.06491 preprint EN arXiv (Cornell University) 2024-12-09

Learning Kernel-Modulated Neural Representation for Efficient Light Field Compression

OPENALEX - Publications

Jinglei Shi Yihong Xu Christine Guillemot Fellow

Light field is a type of image data that captures the 3D scene information by recording light rays emitted from at various orientations. It offers more immersive perception than classic 2D images but cost huge volume. In this paper, we draw inspiration visual characteristics Sub-Aperture Images (SAIs) and design compact neural network representation for compression task. The backbone takes randomly initialized noise as input supervised on SAIs target field. composed two types complementary...

10.48550/arxiv.2307.06143 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Coming Soon ...