Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues Paper • 2606.02754 • Published 2 days ago • 9
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 3 days ago • 133
CorrectKLinRL/Qwen3-1.7B-Base-prlCurrentKL-eta100-forward_k3-clipLow_inf-clipHigh_inf 2B • Updated 16 days ago • 62
CorrectKLinRL/Qwen3-1.7B-Base-prlCurrentKL-eta100-forward_k3-clipLow_inf-clipHigh_inf 2B • Updated 16 days ago • 62
CorrectKLinRL/Qwen3-1.7B-Base-prlCurrentKL-eta100-reverse_k3-clipLow_inf-clipHigh_inf 2B • Updated 16 days ago • 18
CorrectKLinRL/Qwen3-1.7B-Base-prlCurrentKL-eta100-reverse_k3-clipLow_inf-clipHigh_inf 2B • Updated 16 days ago • 18