Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
10
2
Jiahe Jin
zizi-0123
Follow
henryhe0123's profile picture
jzguo's profile picture
2 followers
·
3 following
zizi0123
AI & ML interests
None yet
Organizations
None yet
Papers
2
arxiv:
2505.13909
arxiv:
2412.17589
models
35
Sort: Recently updated
zizi-0123/mhqa_llama_grpo
Updated
Jan 8
zizi-0123/web_llama_sft_correct
Text Generation
•
3B
•
Updated
Jan 8
•
1
zizi-0123/web_llama_sft_correct_grpo
Updated
Jan 8
zizi-0123/mhqa_llama_sft_behavior
Text Generation
•
3B
•
Updated
Jan 8
•
1
zizi-0123/mhqa_llama_sft_behavior_grpo
Updated
Jan 8
zizi-0123/OLMo2-1B-midtrain-run1
1B
•
Updated
Dec 15, 2025
•
3
zizi-0123/mhqa_llama_sft_random_grpo
Updated
Nov 16, 2025
zizi-0123/mhqa_llama_sft_correct_grpo
Updated
Nov 16, 2025
zizi-0123/web_qwen_sft_singlebehavior_grpo
Updated
Nov 7, 2025
zizi-0123/web_llama_sft_random_grpo
Updated
Nov 7, 2025
View 35 models
datasets
0
None public yet