Ilze Amanda Auzina

iaa01

3

·

https://ilzeamandaa.github.io/

AI & ML interests

RL Post-Training | Reasoning and Exploration | Open-ended

Organizations

Papers 1

arxiv:2502.04313

models 11

iaa01/CIA-1.7B

2B • Updated Feb 13 • 6 • 1

iaa01/CIA-4B

4B • Updated Feb 13 • 7 • 3

iaa01/qwen3-4b-elicit-pos-ckpt72

4B • Updated Jan 10 • 2

iaa01/qwen3-4b-elicit-pos

4B • Updated Jan 7 • 3

iaa01/llama-8b-merge-alpha1-freq10

8B • Updated Nov 28, 2025 • 2

iaa01/llama-8b-grpo-no-kl

8B • Updated Nov 28, 2025 • 3

iaa01/llama-8b-grpo-kl

8B • Updated Nov 28, 2025 • 1

iaa01/llama-8b-merge-alpha08-freq10

8B • Updated Nov 28, 2025 • 1

iaa01/llama-8b-merge-alpha05-freq10

8B • Updated Nov 28, 2025 • 1

iaa01/qwen3-1.7b-sft-grpo

2B • Updated May 20, 2025 • 4

datasets 0

None public yet