-
The Ultra-Scale Playbook
π3.86kThe ultimate guide to training LLM on large GPU Clusters
-
The Smol Training Playbook
π3.2kThe secrets to building world-class LLMs
-
FineWeb: decanting the web for the finest text data at scale
π·1.35kExplore and download the FineWeb webβscale text dataset
-
Unlocking On-Policy Distillation for Any Model Family
π108Visualize on-policy distillation for any model family
Aditya Bhosale
croeasusking
Β·
AI & ML interests
None yet
Recent Activity
updated a collection 1 day ago
HF Books liked a Space 1 day ago
dlouapre/eiffel-tower-llama updated a collection 17 days ago
HF BooksOrganizations
None yet