Beyond Clicking:A Step Towards Generalist GUI Grounding via Text Dragging Paper • 2601.06031 • Published Nov 7, 2025
QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks Paper • 2605.24218 • Published 9 days ago • 35
QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks Paper • 2605.24218 • Published 9 days ago • 35
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents Paper • 2510.24702 • Published Oct 28, 2025 • 31
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents Paper • 2510.24702 • Published Oct 28, 2025 • 31
Mind2Web 2 Collection Evaluating Agentic Search with Agent-as-a-Judge • 2 items • Updated Jun 27, 2025 • 2
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Paper • 2507.00432 • Published Jul 1, 2025 • 79
Mind2Web 2 Collection Evaluating Agentic Search with Agent-as-a-Judge • 2 items • Updated Jun 27, 2025 • 2
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge Paper • 2506.21506 • Published Jun 26, 2025 • 52