Update model card with paper details and authors

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +32 -11
README.md CHANGED
@@ -1,26 +1,47 @@
1
  ---
 
2
  tags:
3
  - model_hub_mixin
4
  - pytorch_model_hub_mixin
5
- pipeline_tag: robotics
6
  ---
7
 
8
- PointGoal navigation policy trained with the FrodoBots8K dataset.
9
 
10
- Presented in the paper [Data scaling for navigation in unknown environments](https://arxiv.org/abs/2601.09444)
11
 
12
- [Project page](https://lasuomela.github.io/navigation_scaling/) | [Code](https://github.com/lasuomela/NavigationScaling)
13
 
14
- ## Details
 
15
 
16
- Architecture:
 
17
 
18
- MLP-BC with Theia encoder + MLP action head
 
 
19
 
20
- Dataset:
21
 
22
- 1024 total, 32H from 32 distinct geographical locations
 
 
 
23
 
24
- ## Usage
 
 
25
 
26
- See the Github repo
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ pipeline_tag: robotics
3
  tags:
4
  - model_hub_mixin
5
  - pytorch_model_hub_mixin
 
6
  ---
7
 
8
+ # PointGoal Navigation Policy (FrodoBots8K)
9
 
10
+ PointGoal navigation policy trained with the FrodoBots8K dataset, as presented in the paper [Data Scaling for Navigation in Unknown Environments](https://arxiv.org/abs/2601.09444).
11
 
12
+ [**Project Page**](https://lasuomela.github.io/navigation_scaling/) | [**Code**](https://github.com/lasuomela/NavigationScaling)
13
 
14
+ ## Authors
15
+ Lauri Suomela, Naoki Takahata, Sasanka Kuruppu Arachchige, Harry Edelman, Joni-Kristian Kämäräinen
16
 
17
+ ## Model Description
18
+ This model is an end-to-end, map-free visual navigation policy designed for sidewalk robots. It was trained using imitation learning on crowd-sourced data to achieve zero-shot navigation in unknown environments. The research demonstrates that data diversity (geographical variety) is significantly more important than data quantity for real-world generalization.
19
 
20
+ - **Task:** Zero-shot PointGoal navigation.
21
+ - **Generalization:** Evaluated across 125 km of autonomous driving in four different countries.
22
+ - **Key Finding:** Doubling geographical locations in training decreases navigation errors by ~15%, while adding more data from existing locations leads to rapid performance saturation.
23
 
24
+ ## Details
25
 
26
+ ### Architecture
27
+ - **Type:** MLP-BC
28
+ - **Visual Encoder:** [Theia encoder](https://huggingface.co/theaiinstitute/theia-base-patch16-224-cddsv)
29
+ - **Action Head:** MLP action head
30
 
31
+ ### Training Dataset
32
+ - **Dataset:** FrodoBots8K (subset)
33
+ - **Scale:** 1024 total trajectories (~32 hours of data) from 32 distinct geographical locations.
34
 
35
+ ## Usage
36
+ The model is built for deployment on EarthRover Zero robots using ROS2. For detailed instructions on environment setup, training, and deployment, please refer to the [official GitHub repository](https://github.com/lasuomela/NavigationScaling).
37
+
38
+ ## Citation
39
+ If you use this model or code in your research, please cite:
40
+ ```bibtex
41
+ @misc{suomela2026data_scaling,
42
+ title={Data Scaling for Navigation in Unknown Environments},
43
+ author={Suomela, Lauri and Takahata, Naoki and Kuruppu Arachchige, Sasanka and Edelman, Harry and Kämäräinen, Joni-Kristian},
44
+ journal={arXiv:2601.09444},
45
+ year={2026},
46
+ }
47
+ ```