Running Agents 20 Physical AI Bench Leaderboard 🤖 20 Benchmark for Physical AI generation and understanding
Runtime error Agents 48 Leaderboard: Physical Reasoning from Video 🏃 48 Submit model evaluations and view leaderboard results