Deepseek deepseek-ai/DeepSeek-V4-Pro Text Generation • 862B • Updated 13 days ago • 1.24M • 5.15k deepseek-ai/DeepSeek-V3 Text Generation • 685B • Updated Mar 27, 2025 • 1.1M • • 4.09k
transformer KV Cache Steering for Inducing Reasoning in Small Language Models Paper • 2507.08799 • Published Jul 11, 2025 • 40
KV Cache Steering for Inducing Reasoning in Small Language Models Paper • 2507.08799 • Published Jul 11, 2025 • 40
glm nvidia/GLM-5.2-NVFP4 Text Generation • 381B • Updated 8 days ago • 237k • 228 huihui-ai/Huihui-GLM-5.2-abliterated-GGUF Text Generation • 754B • Updated 5 days ago • 4.7k • 162
video Vidi: Large Multimodal Models for Video Understanding and Editing Paper • 2504.15681 • Published Apr 22, 2025 • 14
Vidi: Large Multimodal Models for Video Understanding and Editing Paper • 2504.15681 • Published Apr 22, 2025 • 14
microsoft phi 4 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 541k • 1.61k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 541k • 1.61k
foolingaround google/flan-t5-large 0.8B • Updated Jul 17, 2023 • 510k • 890 stepfun-ai/GOT-OCR-2.0-hf Image-Text-to-Text • 0.6B • Updated Jan 31, 2025 • 174k • 235
Deepseek deepseek-ai/DeepSeek-V4-Pro Text Generation • 862B • Updated 13 days ago • 1.24M • 5.15k deepseek-ai/DeepSeek-V3 Text Generation • 685B • Updated Mar 27, 2025 • 1.1M • • 4.09k
glm nvidia/GLM-5.2-NVFP4 Text Generation • 381B • Updated 8 days ago • 237k • 228 huihui-ai/Huihui-GLM-5.2-abliterated-GGUF Text Generation • 754B • Updated 5 days ago • 4.7k • 162
transformer KV Cache Steering for Inducing Reasoning in Small Language Models Paper • 2507.08799 • Published Jul 11, 2025 • 40
KV Cache Steering for Inducing Reasoning in Small Language Models Paper • 2507.08799 • Published Jul 11, 2025 • 40
video Vidi: Large Multimodal Models for Video Understanding and Editing Paper • 2504.15681 • Published Apr 22, 2025 • 14
Vidi: Large Multimodal Models for Video Understanding and Editing Paper • 2504.15681 • Published Apr 22, 2025 • 14
microsoft phi 4 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 541k • 1.61k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 541k • 1.61k
foolingaround google/flan-t5-large 0.8B • Updated Jul 17, 2023 • 510k • 890 stepfun-ai/GOT-OCR-2.0-hf Image-Text-to-Text • 0.6B • Updated Jan 31, 2025 • 174k • 235