ISTA-DASLab/Meta-Llama-3-8B-AQLM-PV-2Bit-1x16
Text Generation • 2B • Updated • 47 • 4
None defined yet.
MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers