File size: 2,224 Bytes
935d608
1cb8460
935d608
 
 
 
 
1cb8460
8be41de
1cb8460
8be41de
1cb8460
8be41de
1cb8460
 
8be41de
1cb8460
 
8be41de
1cb8460
 
 
8be41de
1cb8460
8be41de
1cb8460
 
 
 
8be41de
1cb8460
 
 
8be41de
1cb8460
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
---
pipeline_tag: robotics
tags:
- model_hub_mixin
- pytorch_model_hub_mixin
---

# PointGoal Navigation Policy (FrodoBots8K)

PointGoal navigation policy trained with the FrodoBots8K dataset, as presented in the paper [Data Scaling for Navigation in Unknown Environments](https://arxiv.org/abs/2601.09444).

[**Project Page**](https://lasuomela.github.io/navigation_scaling/) | [**Code**](https://github.com/lasuomela/NavigationScaling)

## Authors
Lauri Suomela, Naoki Takahata, Sasanka Kuruppu Arachchige, Harry Edelman, Joni-Kristian Kämäräinen

## Model Description
This model is an end-to-end, map-free visual navigation policy designed for sidewalk robots. It was trained using imitation learning on crowd-sourced data to achieve zero-shot navigation in unknown environments. The research demonstrates that data diversity (geographical variety) is significantly more important than data quantity for real-world generalization.

- **Task:** Zero-shot PointGoal navigation.
- **Generalization:** Evaluated across 125 km of autonomous driving in four different countries.
- **Key Finding:** Doubling geographical locations in training decreases navigation errors by ~15%, while adding more data from existing locations leads to rapid performance saturation.

## Details

### Architecture
- **Type:** MLP-BC
- **Visual Encoder:** [Theia encoder](https://huggingface.co/theaiinstitute/theia-base-patch16-224-cddsv)
- **Action Head:** MLP action head

### Training Dataset
- **Dataset:** FrodoBots8K (subset)
- **Scale:** 1024 total trajectories (~32 hours of data) from 32 distinct geographical locations.

## Usage
The model is built for deployment on EarthRover Zero robots using ROS2. For detailed instructions on environment setup, training, and deployment, please refer to the [official GitHub repository](https://github.com/lasuomela/NavigationScaling).

## Citation
If you use this model or code in your research, please cite:
```bibtex
@misc{suomela2026data_scaling,
  title={Data Scaling for Navigation in Unknown Environments},
  author={Suomela, Lauri and Takahata, Naoki and Kuruppu Arachchige, Sasanka and Edelman, Harry and Kämäräinen, Joni-Kristian},
  journal={arXiv:2601.09444},
  year={2026},
}
```