Running 3.86k The Ultra-Scale Playbook 🌌 3.86k The ultimate guide to training LLM on large GPU Clusters
Running 599 Scaling test-time compute 📈 599 Run advanced search strategies to boost LLM problem solving
Running Agents 230 BigCodeBench Leaderboard 🥇 230 Explore code-generation model leaderboards and task details
Transforming and Combining Rewards for Aligning Large Language Models Paper • 2402.00742 • Published Feb 1, 2024 • 12