Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Open to Collab
3
7
13
Jiayu (Mila) Wang
PRO
MilaWang
Follow
MachiaveIIi's profile picture
AshBlanc's profile picture
John6666's profile picture
3 followers
·
5 following
http://jiayuww.github.io
jiayuwang111
jiayuww
jiayu-mila-wang
jiayuwang.bsky.social
AI & ML interests
Large Language Model, Multimodal Large Language Model, Agentic System, Reasoning, Efficiency
Recent Activity
updated
a model
about 16 hours ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-handrolled-v2
published
a model
about 16 hours ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-handrolled-v2
updated
a model
about 17 hours ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-hp
View all activity
Organizations
MilaWang
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
about 16 hours ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-handrolled-v2
Updated
9 minutes ago
published
a model
about 16 hours ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-handrolled-v2
Updated
9 minutes ago
updated
a model
about 17 hours ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-hp
Updated
1 minute ago
updated
a model
about 19 hours ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-handrolled-rin0.1-hier
Updated
28 minutes ago
published
a model
about 19 hours ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-handrolled-rin0.1-hier
Updated
28 minutes ago
updated
a model
about 22 hours ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-handrolled-rin0.1
Updated
10 minutes ago
updated
3 models
about 23 hours ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-answeronly-handrolled
Updated
9 minutes ago
MilaWang/grpo-fullparam-qwen2-5-math-7b-answeronly
Updated
about 5 hours ago
MilaWang/grpo-fullparam-qwen2-5-math-7b-hierarchical
Updated
14 minutes ago
published
a model
about 23 hours ago
MilaWang/grpo-fullparam-qwen2-5-math-7b-hierarchical
Updated
14 minutes ago
published
a model
1 day ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-handrolled-rin0.1
Updated
10 minutes ago
updated
a collection
1 day ago
LiveResearchBench
Collection
3 items
•
Updated
1 day ago
•
1
published
2 models
2 days ago
MilaWang/grpo-fullparam-qwen2-5-math-7b-answeronly
Updated
about 5 hours ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-hp
Updated
1 minute ago
updated
a model
3 days ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-handrolled
Updated
2 days ago
published
a model
3 days ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-handrolled
Updated
2 days ago
updated
a model
3 days ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-longseq-handrolled
Updated
2 days ago
published
2 models
3 days ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-longseq-handrolled
Updated
2 days ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-answeronly-handrolled
Updated
9 minutes ago
updated
a model
4 days ago
MilaWang/lirpg-fullparam-qwen2-5-math-7b-answeronly
Updated
3 days ago
Load more