Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
253.3
TFLOPS
93
10
Mukul
mtcl
Follow
Gargaz's profile picture
AlexGS74's profile picture
21world's profile picture
5 followers
·
23 following
mtcl
mtcl
AI & ML interests
None yet
Recent Activity
new
activity
4 days ago
nvidia/nemotron-3.5-asr-streaming-0.6b:
vllm support ?
new
activity
5 days ago
cyankiwi/gemma-4-12B-it-qat-AWQ-INT4:
Vllm and SgLang command please
liked
a model
6 days ago
MisoLabs/MisoTTS
View all activity
Organizations
None yet
mtcl
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
nvidia/nemotron-3.5-asr-streaming-0.6b
4 days ago
vllm support ?
👍
8
1
#6 opened 7 days ago by
sdd5125
New activity in
cyankiwi/gemma-4-12B-it-qat-AWQ-INT4
5 days ago
Vllm and SgLang command please
👍
1
2
#1 opened 5 days ago by
mtcl
liked
a model
6 days ago
MisoLabs/MisoTTS
Text-to-Speech
•
8B
•
Updated
10 days ago
•
195
New activity in
nvidia/DeepSeek-V4-Pro-NVFP4
9 days ago
nvidia/DeepSeek-V4-flash-NVFP4
6
#1 opened 16 days ago by
mtcl
New activity in
canada-quant/DeepSeek-V4-Flash-NVFP4-FP8-MTP
16 days ago
Docker Image
8
#1 opened 16 days ago by
mtcl
New activity in
unsloth/DeepSeek-V4-Flash
16 days ago
Worse than (smaller) MiniMax M2.7??
17
#2 opened about 2 months ago by
deleted
New activity in
deepseek-ai/DeepSeek-V4-Flash
about 1 month ago
Unable to run on 2x RTX Pro 6000 (DEEP_GEMM problem)
➕
10
17
#15 opened about 2 months ago by
stev236
New activity in
mistralai/Mistral-Medium-3.5-128B
about 1 month ago
Running on 2 RTX Pro 6000 Blackwell GPUs at ~30 tps (Instructions that worked for me)
👍
❤️
7
10
#17 opened about 1 month ago by
CarouselAether
New activity in
RedHatAI/DeepSeek-V4-Flash-NVFP4-FP8
about 1 month ago
2x Nvidia 6000 Pros
3
#2 opened about 1 month ago by
mtcl
New activity in
lukealonso/MiMo-V2.5-NVFP4
about 1 month ago
Will it work on 2X6000 Pros
6
#1 opened about 1 month ago by
mtcl
New activity in
Intel/DeepSeek-V4-Flash-W4A16-AutoRound
about 2 months ago
Can I deploy it with sglang at my 8*4090 ubuntu sever?
9
#1 opened about 2 months ago by
marshal007
New activity in
nvidia/MiniMax-M2.7-NVFP4
about 2 months ago
Context Length for 2X6000 Pros (2x96 = 192GB VRAM)
3
#2 opened about 2 months ago by
mtcl
New activity in
ubergarm/Kimi-K2.6-GGUF
about 2 months ago
really awesome speeds! running at 256k context.
🔥
1
5
#11 opened about 2 months ago by
mtcl
New activity in
Qwen/Qwen3.6-27B
about 2 months ago
MOE 122b and 397b please!
🚀
24
14
#7 opened about 2 months ago by
jesleocizi
New activity in
ubergarm/Kimi-K2.6-GGUF
about 2 months ago
How to disable thinking?
4
#9 opened about 2 months ago by
Hansi2024
New activity in
demon-zombie/MiniMax-M2.7-AWQ-4bit
about 2 months ago
These are NOT actual AWQ-quantized models.
2
#1 opened about 2 months ago by
cai-cai
New activity in
NinjaBoffin/MiniMax-M2.7-NVFP4
about 2 months ago
max context
#2 opened about 2 months ago by
mtcl
New activity in
ubergarm/Kimi-K2.6-GGUF
about 2 months ago
No think tags.
10
#4 opened about 2 months ago by
DrRos
New activity in
nvidia/MiniMax-M2.5-NVFP4
about 2 months ago
Minimax M2.7 NVFP4
👀
🔥
5
4
#4 opened 2 months ago by
mtcl
New activity in
lukealonso/MiniMax-M2.7-NVFP4
about 2 months ago
Unable to use full 192k context in SGLang with MiniMax-M2.7-NVFP4 (runtime capped at ~80,964 tokens)
3
#9 opened about 2 months ago by
mtcl
Load more