RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies
Abstract
RoboChallenge is an online evaluation system for robotic control algorithms, particularly VLA models, that addresses the need for large-scale testing with scalability and reproducibility.
Testing on real machines is indispensable for robotic control algorithms. In the context of learning-based algorithms, especially VLA models, demand for large-scale evaluation, i.e. testing a large number of models on a large number of tasks, is becoming increasingly urgent. However, doing this right is highly non-trivial, especially when scalability and reproducibility is taken into account. In this report, we describe our methodology for constructing RoboChallenge, an online evaluation system to test robotic control algorithms, and our survey of recent state-of-the-art VLA models using our initial benchmark Table30.
Community
RoboChallenge is an online evaluation system designed to scalably and reproducibly test learning-based robotic control algorithms, including state-of-the-art VLA models, on large-scale benchmarks.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Robot Control Stack: A Lean Ecosystem for Robot Learning at Scale (2025)
- Bridging Embodiment Gaps: Deploying Vision-Language-Action Models on Soft Robots (2025)
- RoboManipBaselines: A Unified Framework for Imitation Learning in Robotic Manipulation across Real and Simulated Environments (2025)
- ManiAgent: An Agentic Framework for General Robotic Manipulation (2025)
- Ctrl-World: A Controllable Generative World Model for Robot Manipulation (2025)
- SITCOM: Scaling Inference-Time COMpute for VLAs (2025)
- SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper