2-layer truncated models used for Bertha CI regression tests.
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 52
hyper-accel/ci-random-qwen2-moe-a3b
Text Generation • 2B • Updated • 584
hyper-accel/qwen2-moe-a3b-2layer
2B • Updated • 5
hyper-accel/ci-2layer-llama2-7b
0.7B • Updated • 2.32k • 1
hyper-accel/ci-random-bfloat16-llama3-3b
Text Generation • 0.6B • Updated • 268
hyper-accel/ci-random-llama3-3b
Text Generation • 0.6B • Updated • 88
hyper-accel/ci-random-bfloat16-llama3-8b
Text Generation • 1B • Updated • 7
hyper-accel/ci-random-bfloat16-llama2-7b
Text Generation • 0.7B • Updated • 4
hyper-accel/ci-random-solar-100b
Text Generation • 6B • Updated • 6
hyper-accel/tiny-random-nemotron-h
Text Generation • 0.1B • Updated • 9
hyper-accel/tiny-random-minimax-m25
Text Generation • 0.2B • Updated • 8
datasets 0
None public yet