Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
3
Ilze Amanda Auzina
iaa01
Follow
John6666's profile picture
juliane-v's profile picture
2 followers
·
2 following
https://ilzeamandaa.github.io/
AI & ML interests
RL Post-Training | Reasoning and Exploration | Open-ended
Organizations
Papers
1
arxiv:
2502.04313
models
11
Sort: Recently updated
iaa01/CIA-1.7B
2B
•
Updated
Feb 13
•
13
•
1
iaa01/CIA-4B
4B
•
Updated
Feb 13
•
34
•
3
iaa01/qwen3-4b-elicit-pos-ckpt72
4B
•
Updated
Jan 10
iaa01/qwen3-4b-elicit-pos
4B
•
Updated
Jan 7
•
3
iaa01/llama-8b-merge-alpha1-freq10
8B
•
Updated
Nov 28, 2025
iaa01/llama-8b-grpo-no-kl
8B
•
Updated
Nov 28, 2025
•
2
iaa01/llama-8b-grpo-kl
8B
•
Updated
Nov 28, 2025
•
2
iaa01/llama-8b-merge-alpha08-freq10
8B
•
Updated
Nov 28, 2025
•
1
iaa01/llama-8b-merge-alpha05-freq10
8B
•
Updated
Nov 28, 2025
iaa01/qwen3-1.7b-sft-grpo
2B
•
Updated
May 20, 2025
View 11 models
datasets
0
None public yet