Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Inference Optimization

community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

RelaxingSnorlax  updated a model 29 minutes ago
inference-optimization/Qwen3.5-0.8B-responses
RelaxingSnorlax  published a model 29 minutes ago
inference-optimization/Qwen3.5-0.8B-responses
MeganEFlynn  updated a model 44 minutes ago
inference-optimization/DFlash-SWA-Causal-Qwen3-8B-Magpie-Ultrachat
View all activity

Alexandre Marques's profile pictureMegan Flynn's profile pictureDipika's profile pictureKrishna Teja Chitty-Venkata's profile pictureHelen Zhao's profile pictureFynn Schmitt-Ulms's profile pictureNeural Magic Research's profile pictureChibueze Ukachi's profile pictureEldar Kurtić's profile pictureRahul Tuli's profile pictureKyle Sayers's profile pictureBrian Dellabetta's profile pictureLinghao Kong's profile pictureMichael Goin's profile pictureReed Meyerson's profile pictureHDCharles's profile pictureOrestis Zambounis's profile pictureRyan Fernandes's profile pictureWeifan Jiang's profile picture

inference-optimization 's models 369

inference-optimization/Qwen3-Next-80B-A3B-Instruct-quantized.w8a8

Updated Dec 9, 2025

inference-optimization/Llama-3.1-8B-Instruct-Mixed-NVFP4-FP8_DYNAMIC-gate_up_proj-all

7B • Updated Dec 4, 2025 • 1

inference-optimization/Llama-3.1-8B-Instruct-Mixed-NVFP4-FP8_DYNAMIC-down_proj-all

6B • Updated Dec 4, 2025 • 1

inference-optimization/Llama-3.1-8B-Instruct-Mixed-NVFP4-FP8_DYNAMIC-qkv_proj-all

5B • Updated Dec 4, 2025 • 1

inference-optimization/Llama-3.1-8B-Instruct-Mixed-NVFP4-FP8_DYNAMIC-out_proj-all

5B • Updated Dec 4, 2025 • 2

inference-optimization/Llama-3.1-8B-Instruct-Mixed-NVFP4-FP8_BLOCK-gate_up_proj-all

7B • Updated Dec 4, 2025 • 1

inference-optimization/Llama-3.1-8B-Instruct-Mixed-NVFP4-FP8_BLOCK-down_proj-all

6B • Updated Dec 4, 2025 • 1

inference-optimization/Llama-3.1-8B-Instruct-Mixed-NVFP4-FP8_BLOCK-qkv_proj-all

5B • Updated Dec 4, 2025 • 1

inference-optimization/Llama-3.1-8B-Instruct-Mixed-NVFP4-FP8_BLOCK-out_proj-all

5B • Updated Dec 4, 2025 • 1
  • Previous
  • 1
  • ...
  • 11
  • 12
  • 13
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs