Michael Goin
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
updated a model 3 days ago
RedHatAI/gemma-4-26B-A4B-it-NVFP4 upvoted a paper about 2 months ago
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation new activity about 2 months ago
GadflyII/GLM-4.7-Flash-MXFP4:Update MXFP4 format to compressed-tensors