arxiv:2602.12735
Yu Zeng
YuZeng260
AI & ML interests
VLMs, LLMs, RL, Agent, Reasoning
Recent Activity
upvoted a paper about 3 hours ago
Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning liked a Space 9 days ago
HuggingFaceH4/on-policy-distillation upvoted a paper 13 days ago
Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?