The Verification Horizon: No Silver Bullet for Coding Agent Rewards Paper • 2606.26300 • Published 4 days ago • 38
FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generation Paper • 2606.24876 • Published 5 days ago • 20
JetSpec: Breaking the Scaling Ceiling of Speculative Decoding with Parallel Tree Drafting Paper • 2606.18394 • Published 3 days ago • 27
Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models Paper • 2606.25041 • Published 5 days ago • 87
Ornith-1.0 Collection Ornith-1.0 is a family of open-source LLMs specialized for agentic coding. • 8 items • Updated about 13 hours ago • 179
view article Article Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel nvidia • 3 days ago • 26
CalVerT: Augmenting Agents with Calibrated Verifier Telemetry Improves Action and Learning in Knowledge-Intensive Tasks Paper • 2606.21777 • Published 9 days ago • 4
PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems Paper • 2606.22388 • Published 7 days ago • 95
HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry Paper • 2606.14249 • Published 16 days ago • 49
Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages Paper • 2606.20517 • Published 10 days ago • 60
S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence Paper • 2606.20515 • Published 10 days ago • 39
Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games Paper • 2606.19338 • Published 11 days ago • 48
OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy Distillation Paper • 2606.17628 • Published 12 days ago • 27
view article Article From the Hugging Face Hub to robot hardware with Strands Agents and LeRobot amazon • 10 days ago • 15
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling Paper • 2606.18023 • Published 12 days ago • 207
JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence Paper • 2606.14777 • Published 18 days ago • 204
Sumi: Open Uniform Diffusion Language Model from Scratch Paper • 2606.19005 • Published 11 days ago • 11