ISTA-DASLab/Llama-3.2-3B-Instruct-FPQuant-QAT-MXFP4
2B • Updated
None defined yet.
MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers