arxiv:2501.02629
Yang Ouyang
OriDragon2000
AI & ML interests
Safety and Efficiency
Recent Activity
liked a dataset about 1 month ago
hotpotqa/hotpot_qa upvoted a paper 3 months ago
The Art of Efficient Reasoning: Data, Reward, and Optimization liked a model 5 months ago
google/gemma-2-2b-itOrganizations
None yet