Minsitral and Devstral
Walter Troiani Vargas
eZWALT
·
AI & ML interests
None yet
Recent Activity
updated a collection 3 days ago
Mistral Compressions updated a collection 4 days ago
Mistral Compressions updated a collection 4 days ago
Mistral CompressionsOrganizations
Production LLMs
- RunningFeatured1.36k
FineWeb: decanting the web for the finest text data at scale
🍷1.36kExplore and download the FineWeb web‑scale text dataset
- Running3.88k
The Ultra-Scale Playbook
🌌3.88kThe ultimate guide to training LLM on large GPU Clusters
- RunningAgents109
Predict Memory
🧮109Estimate model memory usage and see detailed plots
Multimodal NanoChimera
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 22.1k • 587 -
google/siglip2-base-patch16-224
Zero-Shot Image Classification • 0.4B • Updated • 356k • 106 -
openai/clip-vit-base-patch32
Zero-Shot Image Classification • Updated • 21.3M • 958 -
google/siglip2-base-patch16-512
Zero-Shot Image Classification • 0.4B • Updated • 153k • 46
RLHF Resources
Mistral Compressions
Minsitral and Devstral
Production LLMs
- RunningFeatured1.36k
FineWeb: decanting the web for the finest text data at scale
🍷1.36kExplore and download the FineWeb web‑scale text dataset
- Running3.88k
The Ultra-Scale Playbook
🌌3.88kThe ultimate guide to training LLM on large GPU Clusters
- RunningAgents109
Predict Memory
🧮109Estimate model memory usage and see detailed plots
TFM
Multimodal NanoChimera
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 22.1k • 587 -
google/siglip2-base-patch16-224
Zero-Shot Image Classification • 0.4B • Updated • 356k • 106 -
openai/clip-vit-base-patch32
Zero-Shot Image Classification • Updated • 21.3M • 958 -
google/siglip2-base-patch16-512
Zero-Shot Image Classification • 0.4B • Updated • 153k • 46
Pretraining Corpora
RLHF Resources
Cursed Toxic Pretraining Corpora