Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
3
CHENyongxi
chenyongxi
Follow
0 followers
·
5 following
icyyymilk
AI & ML interests
RL rollout、Sampling
Recent Activity
published
a model
about 2 months ago
chenyongxi/Qwen2.5-0.5B-SFT-Safe
updated
a model
about 2 months ago
chenyongxi/Qwen2.5-1.5B-SFT-DPO-InfinityPreference
published
a model
about 2 months ago
chenyongxi/Qwen2.5-1.5B-SFT-DPO-InfinityPreference
View all activity
Organizations
None yet
chenyongxi
's models
44
Sort: Recently updated
chenyongxi/Qwen2.5-0.5B-SFT-Safe
Updated
Apr 3
chenyongxi/Qwen2.5-1.5B-SFT-DPO-InfinityPreference
Text Generation
•
2B
•
Updated
Apr 3
•
4
•
chenyongxi/Qwen2.5-1.5B-SFT-PPO-InfinityPreference
2B
•
Updated
Apr 3
chenyongxi/Qwen2.5-1.5B-SFT-InfinityPreference
2B
•
Updated
Apr 3
•
1
chenyongxi/Qwen2.5-1.5B-SFT-PPO-IP
Updated
Apr 2
chenyongxi/Qwen2.5-1.5B-DPO-1.5B
Text Generation
•
2B
•
Updated
Apr 2
•
7
•
chenyongxi/Qwen2-1.5B-SFT-PPO-IP
Updated
Apr 2
chenyongxi/Qwen2.5-1.5B-SFT-IP
Text Generation
•
2B
•
Updated
Apr 1
•
4
•
chenyongxi/Qwen2-1.5B-SFT-IF
Text Generation
•
2B
•
Updated
Mar 31
•
5
•
chenyongxi/Qwen2.5-1.5B-PPO-IP
Updated
Mar 30
chenyongxi/Qwen2.5-1.5B-RM-IP
Text Classification
•
2B
•
Updated
Mar 30
•
4
chenyongxi/DRPO_tldr
Updated
Mar 26
chenyongxi/Qwen2.5-0.5B-RM-HH
Text Classification
•
0.5B
•
Updated
Mar 25
•
4
chenyongxi/DRPO_tldr_exp
Updated
Mar 25
chenyongxi/Qwen2-0.5B-SFT-HH
Text Generation
•
0.5B
•
Updated
Mar 25
•
4
•
chenyongxi/DPO_TLDR_1B_checkpoint_2178
1B
•
Updated
Mar 25
•
4
chenyongxi/DPO_TLDR_1B_checkpoint_2000
1B
•
Updated
Mar 25
•
4
chenyongxi/DPO_TLDR_1B_checkpoint_1500
1B
•
Updated
Mar 25
•
4
chenyongxi/DPO_TLDR_1B_checkpoint_1000
1B
•
Updated
Mar 25
•
1
chenyongxi/DPO_TLDR_1B_checkpoint_500
1B
•
Updated
Mar 25
•
3
chenyongxi/DRPO_TLDR_1B_checkpoint_9000
1B
•
Updated
Mar 24
•
3
chenyongxi/DRPO_TLDR_1B_checkpoint_8000
1B
•
Updated
Mar 24
•
1
chenyongxi/DRPO_TLDR_1B_checkpoint_7000
1B
•
Updated
Mar 24
•
2
chenyongxi/DRPO_TLDR_1B_checkpoint_6000
1B
•
Updated
Mar 24
•
1
chenyongxi/DRPO_TLDR_1B_checkpoint_5000
1B
•
Updated
Mar 24
•
1
chenyongxi/DRPO_TLDR_1B_checkpoint_4000
1B
•
Updated
Mar 24
•
1
chenyongxi/DRPO_TLDR_1B_checkpoint_3000
1B
•
Updated
Mar 24
•
1
chenyongxi/DRPO_TLDR_1B_checkpoint_2000
1B
•
Updated
Mar 24
•
1
chenyongxi/DRPO_TLDR_1B_checkpoint_1000
1B
•
Updated
Mar 24
•
3
chenyongxi/TLDR_trl_DPO_1B_checkpoint_2000
1B
•
Updated
Mar 24
•
5
Previous
1
2
Next