SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search Paper • 2605.29796 • Published 5 days ago • 14
ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models Paper • 2605.18879 • Published 13 days ago • 5
ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models Paper • 2605.18879 • Published 13 days ago • 5
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published 25 days ago • 69
G-Zero: Self-Play for Open-Ended Generation from Zero Data Paper • 2605.09959 • Published 22 days ago • 17
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data Paper • 2603.09206 • Published Mar 10 • 53
TTCS: Test-Time Curriculum Synthesis for Self-Evolving Paper • 2601.22628 • Published Jan 30 • 35 • 3
VisPlay: Self-Evolving Vision-Language Models from Images Paper • 2511.15661 • Published Nov 19, 2025 • 45