6 25 9

Bingxiang He

hbx

https://hbx-hbx.github.io/

AI & ML interests

NLP

Recent Activity

commentedon a paper about 3 hours ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

upvoted a paper about 3 hours ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

submitted a paper about 3 hours ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

View all activity

Organizations

commented a paper about 3 hours ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 1 day ago • 19 •

upvoted a paper about 3 hours ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 1 day ago • 19

submitted a paper to Daily Papers about 3 hours ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 1 day ago • 19

commented a paper about 1 month ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 58 •

upvoted a paper about 1 month ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 58

submitted a paper to Daily Papers about 1 month ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 58

liked a model 2 months ago

openbmb/MiniCPM-SALA

Text Generation • 9B • Updated 12 days ago • 1.25k • 497

upvoted a paper 2 months ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

liked a model 2 months ago

openbmb/MiniCPM-o-4_5

Any-to-Any • 9B • Updated Mar 7 • 22.8k • 924

liked a model 3 months ago

openbmb/AgentCPM-Explore

Text Generation • 4B • Updated Jan 18 • 122 • 328

updated 2 models 4 months ago

hbx/JustRL-Nemotron-1.5B

Text Generation • 2B • Updated Dec 29, 2025 • 233 • 3

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated Dec 29, 2025 • 2.08k • 10

upvoted a collection 4 months ago

JustRL

Collection

2 items • Updated Nov 1, 2025 • 5

New activity in hbx/JustRL-Nemotron-1.5B 4 months ago

Add Hugging Face paper link badge to model card

#1 opened 4 months ago by

nielsr

New activity in hbx/JustRL-DeepSeek-1.5B 4 months ago

Improve model card: Update title, add paper link, correct license and citation

#1 opened 4 months ago by

nielsr

commented a paper 4 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 27 •

upvoted a paper 4 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 27

submitted a paper to Daily Papers 4 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 27

upvoted a paper 5 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

liked a model 5 months ago

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated Dec 29, 2025 • 2.08k • 10

Bingxiang He

AI & ML interests

Recent Activity

Organizations

hbx's activity

Add Hugging Face paper link badge to model card

Improve model card: Update title, add paper link, correct license and citation