calvin-openvla-oft-lora-epoch0

OpenVLA-OFT (OpenVLA-7B LoRA) fine-tuned on CALVIN task_D_D — first post-epoch-0 checkpoint (step25212).

Fine-tuned with LoRA on CALVIN task_D_D for the paper:

"Reasoning as a Double-Edged Sword: Vulnerability and Defense in VLA Pipelines" NeurIPS 2026 — University of Melbourne Physical AI Group

Code and probe results: https://github.com/uom-physical-ai/calvin-vla-adversarial

Checkpoint details

Field	Value
Base model	`openvla/openvla-7b`
Fine-tune dataset	CALVIN task_D_D (512,077 episodes)
Step	25212
Epoch (training)	0
Adversarial probe step	25212

Adversarial probe results (Gaussian image noise, n=300, seed=42)

σ	mean_RAS	p95_RAS	mean_RS
0.05	0.1183	0.7054	0.2484
0.10	0.2211	1.2795	0.4487
0.20	0.3588	1.5071	0.6699

RAS = Relative Action Shift (L2 deviation of predicted action, normalised). Lower = more robust.

Usage

from peft import PeftModel
from transformers import AutoModelForVision2Seq, AutoProcessor

base = AutoModelForVision2Seq.from_pretrained("openvla/openvla-7b")
model = PeftModel.from_pretrained(base, "uom-physical-ai/calvin-openvla-oft-lora-epoch0")
processor = AutoProcessor.from_pretrained("openvla/openvla-7b")

See the GitHub repo for the full fine-tuning and probe pipeline.

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

Robotics

Model tree for uom-physical-ai/calvin-openvla-oft-lora-epoch0

Base model

openvla/openvla-7b

Adapter

(21)

this model