XuQixin

Racktic

·

Racktic

AI & ML interests

NLP, mutimodel

Recent Activity

updated a model about 22 hours ago

Racktic/alchemy-qwen3-ckpt

updated a dataset 6 days ago

Racktic/alchemy-eval-logs

published a model 10 days ago

Racktic/alchemy-qwen3-ckpt

View all activity

Organizations

authored a paper 26 days ago

Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism

Paper • 2606.00408 • Published May 29 • 65

authored a paper about 2 months ago

MLS-Bench: A Holistic and Rigorous Assessment of AI Systems on Building Better AI

Paper • 2605.08678 • Published May 9 • 9

authored 2 papers 10 months ago

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Paper • 2509.03646 • Published Sep 3, 2025 • 33

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7, 2025 • 151

authored a paper over 1 year ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3, 2025 • 62