Running Featured 85 Distilling 100B+ Models 40x Faster with TRL 📝 85 TRL distillation for 100B+ teachers, 40x faster
Running on CPU Upgrade 239 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 239 Explore synthetic data experiments on an interactive bookshelf
Running on CPU Upgrade Featured 3.19k The Smol Training Playbook 📚 3.19k The secrets to building world-class LLMs
Running 3.86k The Ultra-Scale Playbook 🌌 3.86k The ultimate guide to training LLM on large GPU Clusters
Running on L40S Agents 612 MinerU Document Extraction Tools 📚 612 Easy converting PDF and Office docs into Markdown and JSON
Running 600 Scaling test-time compute 📈 600 Boost LLM answers with flexible test‑time search strategies
Running 6 PL-MTEB: Polish Massive Text Embedding Benchmark 📈 6 Display evaluation results in a leaderboard
Running Featured 1.05k Can You Run It? LLM version 🚀 1.05k Check if your GPU can run a chosen LLM model