ISTA-DASLab/Llama-3.1-8B-HIGGS-4bit
Text Generation • 3B • Updated
None defined yet.
MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers