AdithyaSK/data-agent-eval-traces
1.34 GB
Explore model evaluation results with interactive heatmaps
Building and scaling RL environments for LLM training
Single-tool E2B-backed coding environment
SETA-style multi-tool coding environment backed by E2B
Stateful Jupyter notebook environment backed by E2B
Cloud Linux desktop with computer-use tools, exposed via ORS