Penghui Qi

QPHutu

·

QPHutu

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Rethinking the Divergence Regularization in LLM RL

upvoted a paper about 2 months ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

upvoted a paper about 2 months ago

Rethinking the Divergence Regularization in LLM RL

View all activity

Organizations

QPHutu 's papers 10

arxiv:2606.09821

arxiv:2602.04879

arxiv:2601.19362

arxiv:2510.26788

arxiv:2505.13438

arxiv:2503.20783

arxiv:2503.01328

arxiv:2411.05288

arxiv:2405.15362

arxiv:2401.10241