This collection contains all the GRPO-trained models for our paper "A Rising Tide Lifts All Boats". Please consider citing us!
Ishika Agarwal
ishikaa
·
AI & ML interests
active learning, reinforcement learning, reasoning, planning, NLP
Recent Activity
updated a model 19 days ago
ishikaa/UAS_student_qwen7b_tulu_minimax9 published a model 19 days ago
ishikaa/UAS_student_qwen7b_tulu_minimax9 updated a model 19 days ago
ishikaa/UAS_qwen7b_tulu_minimax9