LaViRA: Language-Vision-Robot Actions Translation for Zero-Shot Vision Language Navigation in Continuous Environments Permalink
Published in IEEE International Conference on Robotics and Automation (ICRA), 2025
A zero-shot VLN framework that decomposes navigation into language, vision, and robot actions for stronger generalization in continuous environments.
Recommended citation: Hongyu Ding, Ziming Xu, Yudong Fang, You Wu, Zixuan Chen, Jieqi Shi, Jing Huo, Yifan Zhang, and Yang Gao. "LaViRA: Language-Vision-Robot Actions Translation for Zero-Shot Vision Language Navigation in Continuous Environments." In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 2026.
Download Paper