Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Open to Collab
16.5
TFLOPS
20
230
687
specimba
specimba
Follow
PhysiQuanty's profile picture
AgPerry's profile picture
K0D3IN's profile picture
10 followers
ยท
260 following
specimba
specimba
AI & ML interests
None yet
Recent Activity
new
activity
about 10 hours ago
build-small-hackathon/Darwin-35B-A3B-Opus:
Fix Darwin 35B load: CPU offload flag + spaces pkg + persistent cache
liked
a model
about 11 hours ago
FINAL-Bench/Darwin-60B-DUO
reacted
to
SeaWolf-AI
's
post
with โค๏ธ
about 15 hours ago
Darwin-60B-DUO: Two SOTAs, One Endpoint โ 88.38% on GPQA Diamond ๐ We're excited to release Darwin-60B-DUO, the Darwin family's first DUO model. Take two domain-verified specialists, hide them behind a single OpenAI-compatible endpoint, and let a router decide which one (or both) answers. You see one model, one API โ but get the best of both. The number that matters: on the full 198-question GPQA Diamond, Darwin-60B-DUO hits 88.38%. The constituents alone land at 69.70% (Darwin-28B-REASON) and 77.27% (AWAXIS-Think-31B); a naive cascade only reaches 83.84%. The DUO clears them all. Two small specialists, intelligently routed, beat one big generalist on cost and quality. Both are independently verified โ Darwin-28B-REASON is #3 on the HF GPQA Diamond leaderboard, AWAXIS-Think-31B is #1 on Korea's national K-AI Leaderboard (MSIT). The brains is a Hybrid-A router picking one of five strategies on the fly. Korean โ AWAXIS, English/STEM โ Darwin (single-backend, ~70% of traffic at 1ร cost). When a Korean answer needs rigorous English reasoning, split_refine fires โ Darwin drafts, AWAXIS polishes; MCQ/short-answer runs both with self-consistency + cross-verify. Net effective cost: only ~1.3ร a single 30B model. The part the community will care about: the gateway is model-agnostic and Apache-2.0. Point it at any two OpenAI-compatible backends and you've got a DUO in minutes โ teach router.py when to use which, and parallel calls, response merging, and routing transparency via _duo_route are handled for you. Fork it and tell us what you built. Painless deploy: docker compose up for both vLLM backends + gateway; FP8 ~30GB colocates on a single B200/H100. One git clone (~120GB). Text-only for now, streaming in v1.1. Two SOTAs, one endpoint. Come build your own on the Community tab. ๐ ๐ https://huggingface.co/FINAL-Bench/Darwin-60B-DUO
View all activity
Organizations
specimba
's models
3
Sort:ย Recently updated
specimba/nexus-os-v2-dualmode
Updated
18 days ago
โข
1
specimba/nexus-os-v2
Updated
20 days ago
specimba/chimera-router
Updated
Apr 30
โข
1