Penghui Qi

QPHutu

·

QPHutu

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Rethinking the Divergence Regularization in LLM RL

upvoted a paper about 2 months ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

upvoted a paper about 2 months ago

Rethinking the Divergence Regularization in LLM RL

View all activity

Organizations

Collections 4

View 4 collections

Papers 10

arxiv:2606.09821

arxiv:2602.04879

arxiv:2601.19362

arxiv:2510.26788

models 0

None public yet

datasets 0

None public yet