Minsitral and Devstral
Walter Troiani Vargas
eZWALT
·
AI & ML interests
None yet
Recent Activity
updated a collection 3 days ago
Mistral Compressions updated a collection 4 days ago
Mistral Compressions updated a collection 4 days ago
Mistral CompressionsOrganizations
Production LLMs
- RunningFeatured1.36k
FineWeb: decanting the web for the finest text data at scale
🍷1.36kExplore and download the FineWeb web‑scale text dataset
- Running3.88k
The Ultra-Scale Playbook
🌌3.88kThe ultimate guide to training LLM on large GPU Clusters
- RunningAgents109
Predict Memory
🧮109Estimate model memory usage and see detailed plots
Mistral Compressions
Minsitral and Devstral
Production LLMs
- RunningFeatured1.36k
FineWeb: decanting the web for the finest text data at scale
🍷1.36kExplore and download the FineWeb web‑scale text dataset
- Running3.88k
The Ultra-Scale Playbook
🌌3.88kThe ultimate guide to training LLM on large GPU Clusters
- RunningAgents109
Predict Memory
🧮109Estimate model memory usage and see detailed plots
models 6
eZWALT/NanoChimera-Qwen2.5-siglip2-VLM-pretrained
Updated
eZWALT/SmolLM2-135M-Pedantic-PPO
Text Generation • 0.1B • Updated • 1
eZWALT/SmolLM2-135M-Pedantic-DPO
Text Generation • 0.1B • Updated • 1
eZWALT/SmolLM2-135M-Pedantic-GRPO
Text Generation • 0.1B • Updated • 3
eZWALT/SmolLM2-135M-Pedantic-SFT-Instruct
Text Generation • 0.1B • Updated • 1
eZWALT/SmolLM2-135M-Pedantic-Reward-Model
Text Classification • 0.1B • Updated • 5