NFDI4DS | UHH-SEMS - Publication Details

Zero-shot Object-Centric Instruction Following: Integrating Foundation Models with Traditional Navigation

FOS: Computer and information sciences Computer Science - Robotics Computer Vision and Pattern Recognition (cs.CV) Computer Science - Computer Vision and Pattern Recognition Robotics (cs.RO)

DOI: 10.48550/arxiv.2411.07848 Publication Date: 2024-01-01

Abstract Supplemental Material References Cited by

AUTHORS (6)

Raychaudhuri, Sonia

Ta, Duy

Ashton, Katrina

Chang, Angel X.

Wang, Jiuguang

Bucher, Bernadette

ABSTRACT

Large scale scenes such as multifloor homes can be robustly and efficiently mapped with a 3D graph of landmarks estimated jointly with robot poses in a factor graph, a technique commonly used in commercial robots such as drones and robot vacuums. In this work, we propose Language-Inferred Factor Graph for Instruction Following (LIFGIF), a zero-shot method to ground natural language instructions in such a map. LIFGIF also includes a policy for following natural language navigation instructions in a novel environment while the map is constructed, enabling robust navigation performance in the physical world. To evaluate LIFGIF, we present a new dataset, Object-Centric VLN (OC-VLN), in order to evaluate grounding of object-centric natural language navigation instructions. We compare to two state-of-the-art zero-shot baselines from related tasks, Object Goal Navigation and Vision Language Navigation, to demonstrate that LIFGIF outperforms them across all our evaluation metrics on OCVLN. Finally, we successfully demonstrate the effectiveness of LIFGIF for performing zero-shot object-centric instruction following in the real world on a Boston Dynamics Spot robot.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENAIRE - Products

PlumX Metrics

Zero-shot Object-Centric Instruction Following: Integrating Foundation Models with Traditional Navigation

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....