11 10

Sdeerk

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

upvoted an article 3 days ago

Ulysses Sequence Parallelism: Training with Million-Token Contexts

liked a Space 2 months ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

liked a Space 2 months ago

The Smol Training Playbook

📚

3.08k

The secrets to building world-class LLMs

liked a Space 6 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.32k

Read a detailed overview of the FineWeb web‑scale text dataset

liked a model 6 months ago

PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 10 days ago • 7.48k • 1.58k

liked a model 7 months ago

baidu/ERNIE-4.5-21B-A3B-Thinking

Text Generation • 22B • Updated Nov 26, 2025 • 651 • 777

liked 2 datasets 8 months ago

Jofthomas/hermes-function-calling-thinking-V1

Viewer • Updated Feb 16, 2025 • 3.57k • 412 • 74

NousResearch/hermes-function-calling-v1

Viewer • Updated Jan 3 • 11.6k • 9.8k • 391

liked 2 Spaces 9 months ago

Awesome O1 R1

💻

[Keep updating]Collect everything about o1 and r1!

The Ultra-Scale Playbook

🌌

3.76k

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 9 months ago

openai/gsm8k

Benchmark • Updated 13 days ago • 17.6k • 758k • 1.23k

liked a dataset 11 months ago

K-and-K/knights-and-knaves

Viewer • Updated Oct 31, 2024 • 6.9k • 794 • 35