Sergio Paniego PRO
AI & ML interests
None yet
Recent Activity
updated a dataset about 5 hours ago
agents-course/final-certificates updated a dataset about 5 hours ago
agents-course/course-certificates-of-excellence posted an update about 22 hours ago
If you have a github repo, you basically have an RL training environment
We're introducing Repo2RLEnv (built by @AdithyaSK), a tool that mines PRs, commits, CVEs and turns them into verifiable sandboxed tasks with real reward signals, automatically
Outputs to Harbor spec so you can plug it straight into RL training or coding-agent eval
> repo: https://github.com/huggingface/Repo2RLEnv
> collection with envs: https://huggingface.co/collections/AdithyaSK/repo2rlenv-verifiable-rl-environments