Running on CPU Upgrade Featured 3.2k The Smol Training Playbook π 3.2k The secrets to building world-class LLMs
CP-Bench: Evaluating Large Language Models for Constraint Modelling Paper β’ 2506.06052 β’ Published Jun 6, 2025 β’ 3
Text2Zinc: A Cross-Domain Dataset for Modeling Optimization and Satisfaction Problems in MiniZinc Paper β’ 2503.10642 β’ Published Feb 22, 2025 β’ 2
Running 600 Scaling test-time compute π 600 Boost LLM answers with flexible testβtime search strategies