Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Open to Collab
14.2
TFLOPS
414
234
72
ben burtenshaw
burtenshaw
Follow
jkorstad's profile picture
ID221183's profile picture
ankitbhagat's profile picture
4,671 followers
·
486 following
ben_burtenshaw
burtenshaw
ben-burtenshaw
AI & ML interests
None yet
Recent Activity
reacted
to
sergiopaniego
's
post
with 🔥
about 2 hours ago
OpenEnv has a new home: github.com/huggingface/OpenEnv Starting today, it's coordinated by a committee that includes Meta-PyTorch, Reflection, Unsloth, Modal, Prime Intellect, Nvidia, Mercor, Fleet AI, and Hugging Face frontier labs train their models and their harnesses together. Claude knows Claude Code. GPT-5.5 knows Codex. that's not an accident, it's training. open-source models deserve the same magic, but pulling that off requires infrastructure that belongs to everyone, not one lab OpenEnv is that layer. one api, any harness, any trainer, any environment Rewards and training loops stay in TRL, Unsloth, wherever you already work. OpenEnv is the socket they all plug into Get involved! Full announcement: https://huggingface.co/blog/openenv-agentic-rl
updated
a dataset
about 2 hours ago
huggingface-course/supervised-finetuning_quiz_student_responses
upvoted
an
article
about 3 hours ago
The Open Source Community is backing OpenEnv for Agentic RL
View all activity
Organizations
burtenshaw
's models
97
Sort: Recently updated
burtenshaw/gemma-4-12b-sdpo-pi-mono-brevity-topk-v5
Updated
4 days ago
•
27
•
1
burtenshaw/gemma-4-12b-sdpo-pi-mono-brevity-topk-v4
Updated
4 days ago
burtenshaw/gemma-4-12b-sdpo-pi-mono-trace-feedback-v3
Updated
4 days ago
•
29
•
1
burtenshaw/gemma-4-12b-sdpo-pi-mono-trace-feedback-v2
Updated
4 days ago
•
29
•
1
burtenshaw/gemma-4-12b-sdpo-pi-mono-trace-feedback
Updated
4 days ago
•
22
burtenshaw/qwen2.5-0.5b-sdpo-pi-mono-smoke
Updated
5 days ago
burtenshaw/terminus-pi-trl-qwen3-5-4b-hard-22213506
Updated
6 days ago
burtenshaw/terminus-pi-trl-qwen3-5-4b-hard-22213429
Updated
6 days ago
burtenshaw/terminus-pi-trl-qwen3-5-4b-stream-22213278
5B
•
Updated
6 days ago
•
16
burtenshaw/terminus-pi-trl-qwen3-5-4b-stream-22213260
Updated
6 days ago
burtenshaw/terminus-pi-trl-qwen3-5-4b-rollout-22213219
Updated
6 days ago
burtenshaw/terminus-pi-trl-qwen3-6-27b-rollout-22212516
27B
•
Updated
7 days ago
•
16
burtenshaw/terminus-pi-trl-qwen3-6-27b-rollout-22212504
Updated
7 days ago
burtenshaw/terminus-pi-trl-qwen3-6-27b-rollout-22212483
Updated
7 days ago
burtenshaw/terminus-pi-trl-qwen3-6-27b-rollout-22212455
Updated
7 days ago
burtenshaw/terminus-pi-trl-qwen3-6-27b-rollout-22212393
Updated
7 days ago
burtenshaw/terminus-pi-trl-qwen3-6-27b-rollout-22212321
27B
•
Updated
7 days ago
•
22
burtenshaw/terminus-pi-trl-qwen3-6-27b-rollout-22212308
Updated
7 days ago
burtenshaw/terminus-pi-trl-qwen3-6-27b-rollout-22212292
Updated
7 days ago
burtenshaw/terminus-pi-trl-qwen3-6-27b-rollout-22212265
Updated
7 days ago
burtenshaw/terminus-pi-trl-qwen3-6-27b-rollout-22212263
Updated
7 days ago
burtenshaw/terminus-pi-trl-qwen3-6-27b-rollout-22212232
Updated
7 days ago
burtenshaw/terminus-pi-trl-qwen3-6-27b-rollout-22212226
Updated
7 days ago
burtenshaw/terminus-pi-trl-qwen3-4b-rollout-22189657
Text Generation
•
4B
•
Updated
9 days ago
•
17
burtenshaw/terminus-pi-trl-qwen3-4b-rollout-22189531
Updated
9 days ago
burtenshaw/terminus-pi-trl-qwen3-4b-rollout-22189522
Updated
9 days ago
burtenshaw/terminus-pi-trl-qwen3-4b-rollout-22189282
Updated
9 days ago
burtenshaw/terminus-pi-trl-async-grpo-qwen3-4b
Text Generation
•
4B
•
Updated
10 days ago
•
32
burtenshaw/terminus-pi-trl-qwen35-4b-200-beta
Text Generation
•
4B
•
Updated
10 days ago
•
72
burtenshaw/terminus-pi-trl-qwen35-4b-200-alpha
Text Generation
•
4B
•
Updated
10 days ago
•
99
Previous
1
2
3
4
Next