VLA-RL-Study: What Can RL Bring to VLA Generalization? An Empirical Study

arXiv Website

This is the RL model, fine-tuned from the warm-upped OpenVLA model. The RL training takes about 1.5M environment steps. For more details, please refer to the codebase and the paper.

Downloads last month
1,207
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for gen-robot/openvla-7b-rlvla-rl

Finetuned
(4)
this model

Collection including gen-robot/openvla-7b-rlvla-rl

Paper for gen-robot/openvla-7b-rlvla-rl