inference-optimization/Qwen3-8B-speculator.dflash.fullattn-qwen235b-instruct-bs16-ckpt0 2B • Updated about 9 hours ago
inference-optimization/Qwen3-8B-speculator.dflash.fullattn-qwen235b-instruct-bs16-ckpt0 2B • Updated about 9 hours ago
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt4 2B • Updated 5 days ago • 21
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt4 2B • Updated 5 days ago • 21
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt3 2B • Updated 6 days ago • 225
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt3 2B • Updated 6 days ago • 225
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step210040 2B • Updated 7 days ago • 277
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step210040 2B • Updated 7 days ago • 277
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt2 2B • Updated 7 days ago • 7
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt2 2B • Updated 7 days ago • 7
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step189036 2B • Updated 10 days ago • 129
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-500k-ckpt5 0.6B • Updated 10 days ago • 1.11k
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step189036 2B • Updated 10 days ago • 129
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt0 2B • Updated 10 days ago • 112
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt0 2B • Updated 10 days ago • 112
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step126024 2B • Updated 11 days ago • 330
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step84016 2B • Updated 11 days ago • 122
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step84016 2B • Updated 11 days ago • 122