Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step 5 days ago • 17
🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do Mar 10 • 38
DARWIN-Family 비드래프트 FINAL-Bench/Metacognitive Viewer • Updated Feb 27 • 100 • 953 • 89 Running Featured 49 Leaderboard - FINAL Bench 'Metacognitive' 🚀 49 Metacognitive Running 79 ALL Bench Leaderboard 🚀 79 ALL Bench Leaderboard FINAL-Bench/Darwin-4B-Genesis Text Generation • 8B • Updated 4 days ago • 584 • 33
DARWIN-Family 비드래프트 FINAL-Bench/Metacognitive Viewer • Updated Feb 27 • 100 • 953 • 89 Running Featured 49 Leaderboard - FINAL Bench 'Metacognitive' 🚀 49 Metacognitive Running 79 ALL Bench Leaderboard 🚀 79 ALL Bench Leaderboard FINAL-Bench/Darwin-4B-Genesis Text Generation • 8B • Updated 4 days ago • 584 • 33