Xin-Qiang Cai's picture

2

Xin-Qiang Cai

caixq

https://caixq1996.github.io/

caixq1996

AI & ML interests

RL, RLHF, Learning under Weak Supervision, Diffusion Model

Recent Activity

upvoted a paper 3 months ago

VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction

authored a paper 8 months ago

PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models

authored a paper 8 months ago

Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers

View all activity

Organizations

None yet

upvoted a paper 3 months ago

VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction

Paper • 2602.12579 • Published Feb 13 • 2

authored 2 papers 8 months ago

PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models

Paper • 2507.17220 • Published Jul 23, 2025 • 1

Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers

Paper • 2510.00915 • Published Oct 1, 2025 • 3

upvoted a paper 8 months ago

Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers

Paper • 2510.00915 • Published Oct 1, 2025 • 3