arxiv:2508.03680
Zhiyuan He
hzy46
AI & ML interests
None yet
Recent Activity
upvoted a paper 7 days ago
Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs? upvoted a paper 7 months ago
ΔL Normalization: Rethink Loss Aggregation in RLVR commentedon a paper 7 months ago
$ΔL$ Normalization: Rethink Loss Aggregation in RLVROrganizations
None yet