FINAL_Bench

company

https://www.vidraft.net

AI & ML interests

Contact: arxivgpt@gmail.com

Recent Activity

SeaWolf-AI updated a Space about 15 hours ago

FINAL-Bench/ax-diagnostic

SeaWolf-AI published a Space about 15 hours ago

FINAL-Bench/ax-diagnostic

SeaWolf-AI new activity 3 days ago

FINAL-Bench/JGOS-398B-fp8:fp8 of DARWIN 398b BF16?

View all activity

Papers

Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

View all Papers

Articles

Quantum Cryptanalysis on Real Hardware: Pushing Symmetric-Structure Key Recovery Beyond the Published Frontier

Adding a GPU Without Building One

Chitos: From Detection to Proof — An Autonomous Security AI That Actually Exploits

FINAL-Bench Quantum: An Open, Neutral Benchmark for Quantum-Computing Methods

Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step

Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — It Started Showing Emotion

"Darwin-27B-Opus: Surpassing the Foundation Model Without Training"

Darwin V6: Diagnostic-Guided Evolutionary Model Merging

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models

🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do

MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning

Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework

Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism?

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

View all articles

FINAL-Bench 's collections 3