jackpan's picture

6 1

jackpan

jackpang

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts

upvoted a paper 14 days ago

RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution

upvoted a paper 14 days ago

Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation

View all activity

Organizations

None yet

upvoted 4 papers 14 days ago

ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts

Paper • 2505.10010 • Published May 15, 2025 • 3

RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution

Paper • 2603.20799 • Published Mar 21 • 1

Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation

Paper • 2302.09368 • Published Feb 18, 2023 • 1

EDCO: Dynamic Curriculum Orchestration for Domain-specific Large Language Model Fine-tuning

Paper • 2601.03725 • Published Jan 7 • 1

upvoted a paper 2 months ago

Reinforcement Learning with Promising Tokens for Large Language Models

Paper • 2602.03195 • Published Feb 3 • 1

upvoted a paper 8 months ago

Language Model Self-improvement by Reinforcement Learning Contemplation

Paper • 2305.14483 • Published May 23, 2023 • 1

liked a dataset about 1 year ago

NJU-RLer/ImagineBench

Viewer • Updated Dec 21, 2025 • 747k • 547 • 2