CrowdSurfer: Sampling Optimization Augmented with Vector-Quantized Variational AutoEncoder for Dense Crowd Navigation

* equal contribution
1 Robotics Research Center, IIIT Hyderabad, India;
2 University of Tartu, Estonia
Note - Accepted to IEEE ICRA 2025
CrowdSurfer Teaser

CrowdSurfer in Action

Abstract

Navigation amongst densely packed crowds remains a challenge for mobile robots. The complexity increases further if the environment layout changes making the prior computed global plan infeasible. In this paper, we show that it is possible to dramatically enhance crowd navigation by just improving the local planner. Our approach combines generative modelling with inference time optimization to generate sophisticated long-horizon local plans at interactive rates. More specifically, we train a Vector Quantized Variational AutoEncoder to learn a prior over the expert trajectory distribution conditioned on the perception input. At run-time, this is used as an initialization for a sampling-based optimizer for further refinement. Our approach does not require any sophisticated prediction of dynamic obstacles and yet provides state-of-the- art performance. In particular, we compare against the recent DRL-VO approach and show a 40% improvement in success rate and a 6% improvement in travel time.

Wheelchair Test - Indoor Cluttered Environment - with Global Plan

Husky Test - Outdoors with Dynamic Obstacles - No Global Plan

Simulation Tests - Extremely Cluttered Environments

BibTeX

@INPROCEEDINGS{11128132,
              author={Kumar, Naman and Singha, Antareep and Nanwani, Laksh and Potdar, Dhruv and R, Tarun and Rastgar, Fatemeh and Idoko, Simon and Singh, Arun Kumar and Krishna, K. Madhava},
              booktitle={2025 IEEE International Conference on Robotics and Automation (ICRA)}, 
              title={CrowdSurfer: Sampling Optimization Augmented with Vector-Quantized Variational AutoEncoder for Dense Crowd Navigation}, 
              year={2025},
              volume={},
              number={},
              pages={16854-16860},
              keywords={Pedestrians;Navigation;Autoencoders;Layout;Vectors;Real-time systems;Trajectory;Mobile robots;Robotics and automation;Optimization},
              doi={10.1109/ICRA55743.2025.11128132}}