Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published 27 days ago • 80
Running on CPU Upgrade 526 Visualize Dataset (v2.0+ latest dataset format) 💻 526 Explore and visualize LeRobot datasets easily