Instructions to use robometer/Robometer-4B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use robometer/Robometer-4B with Transformers:
# Load model directly from transformers import AutoProcessor, RFM processor = AutoProcessor.from_pretrained("robometer/Robometer-4B") model = RFM.from_pretrained("robometer/Robometer-4B") - Notebooks
- Google Colab
- Kaggle
Add robotics pipeline tag and improve model card
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,28 +1,30 @@
|
|
| 1 |
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
base_model: Qwen/Qwen3-VL-4B-Instruct
|
| 4 |
-
tags:
|
| 5 |
-
- reward model
|
| 6 |
-
- robot learning
|
| 7 |
-
- foundation models
|
| 8 |
library_name: transformers
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
---
|
| 10 |
|
| 11 |
# Robometer 4B
|
| 12 |
|
| 13 |
-
**
|
| 14 |
|
| 15 |
-
**Robometer** is a general-purpose vision-language reward model for robotics. It is trained on [RBM-1M](https://huggingface.co/datasets/)
|
| 16 |
|
| 17 |
-
|
| 18 |
|
| 19 |
-
|
| 20 |
-
- **Per-frame
|
| 21 |
-
- **
|
|
|
|
| 22 |
|
| 23 |
### Usage
|
| 24 |
|
| 25 |
-
For full setup
|
| 26 |
|
| 27 |
**Option 1 — Run the model locally** (loads this checkpoint from Hugging Face):
|
| 28 |
|
|
@@ -56,10 +58,10 @@ uv run python scripts/example_inference.py \
|
|
| 56 |
If you use this model, please cite:
|
| 57 |
|
| 58 |
```bibtex
|
| 59 |
-
@
|
| 60 |
title = {Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons},
|
| 61 |
-
author={Anthony Liang and Yigit Korkmaz and Jiahui Zhang and Minyoung Hwang and Abrar Anwar and Sidhant Kaushik and Aditya Shah and Alex S. Huang and Luke Zettlemoyer and Dieter Fox and Yu Xiang and Anqi Li and Andreea Bobu and Abhishek Gupta and Stephen Tu and Erdem Biyik and Jesse Zhang},
|
| 62 |
-
year={2026},
|
| 63 |
-
|
| 64 |
}
|
| 65 |
-
```
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
base_model: Qwen/Qwen3-VL-4B-Instruct
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
library_name: transformers
|
| 4 |
+
license: apache-2.0
|
| 5 |
+
pipeline_tag: robotics
|
| 6 |
+
tags:
|
| 7 |
+
- reward model
|
| 8 |
+
- robot learning
|
| 9 |
+
- foundation models
|
| 10 |
---
|
| 11 |
|
| 12 |
# Robometer 4B
|
| 13 |
|
| 14 |
+
[**Project Page**](https://robometer.github.io/) | [**Paper**](https://arxiv.org/abs/2603.02115) | [**GitHub**](https://github.com/robometer/robometer)
|
| 15 |
|
| 16 |
+
**Robometer** is a general-purpose vision-language reward model for robotics. It is trained on [RBM-1M](https://huggingface.co/datasets/) (a dataset of over one million trajectories) using a **Qwen3-VL-4B** backbone to predict **per-frame progress**, **per-frame success**, and **trajectory preferences** from rollout videos.
|
| 17 |
|
| 18 |
+
The model combines frame-level progress supervision on expert data with trajectory-comparison preference supervision, allowing it to learn from both successful and failed rollouts and generalize across diverse robot embodiments and tasks.
|
| 19 |
|
| 20 |
+
Given a **task instruction** and a **rollout video** (or frame sequence), the model predicts:
|
| 21 |
+
- **Per-frame progress**: Continuous progress values over time (e.g., 0–1 or binned).
|
| 22 |
+
- **Per-frame success**: Success probability (or binary) at each timestep.
|
| 23 |
+
- **Preference / ranking**: Which of two trajectories is better for the task.
|
| 24 |
|
| 25 |
### Usage
|
| 26 |
|
| 27 |
+
For full setup and configurations, see the [GitHub repository](https://github.com/robometer/robometer).
|
| 28 |
|
| 29 |
**Option 1 — Run the model locally** (loads this checkpoint from Hugging Face):
|
| 30 |
|
|
|
|
| 58 |
If you use this model, please cite:
|
| 59 |
|
| 60 |
```bibtex
|
| 61 |
+
@inproceedings{liang2026robometer,
|
| 62 |
title = {Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons},
|
| 63 |
+
author = {Anthony Liang and Yigit Korkmaz and Jiahui Zhang and Minyoung Hwang and Abrar Anwar and Sidhant Kaushik and Aditya Shah and Alex S. Huang and Luke Zettlemoyer and Dieter Fox and Yu Xiang and Anqi Li and Andreea Bobu and Abhishek Gupta and Stephen Tu and Erdem Biyik and Jesse Zhang},
|
| 64 |
+
year = {2026},
|
| 65 |
+
booktitle = {Robotics: Science and Systems 2026},
|
| 66 |
}
|
| 67 |
+
```
|