ISTA-DASLab/Kimi-K2.6-2Bit-GSQ
Image-Text-to-Text • 84B • Updated • 66
None defined yet.
MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning
GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling