nm-testing/Apertus-70B-Instruct-2509-NVFP4
41B
•
Updated
nm-testing/Apertus-8B-Instruct-2509-NVFP4
5B
•
Updated
nm-testing/Llama-4-Scout-17B-16E-Instruct-FP8-BLOCK
108B
•
Updated
nm-testing/tinysmokellama-3.2
354k
•
Updated
•
72.8k
nm-testing/Qwen3-Next-80B-A3B-Instruct-NVFP4
Updated
•
1.46k
•
2
nm-testing/Llama-3.2-1B-Instruct-quip-w4a16
0.8B
•
Updated
•
2.15k
nm-testing/Llama-3.2-1B-Instruct-group-activations
1B
•
Updated
•
2
nm-testing/qwen3-80b-fp8-dynamic
80B
•
Updated
•
1
nm-testing/gemma-3-4b-it-s_q-W4A8-G512
5B
•
Updated
nm-testing/llama3.3-70B-speculators.09-10-2025-eagle3
2B
•
Updated
•
1
nm-testing/Llama-3.2-1B-Instruct-quipv-w4a16
0.7B
•
Updated
nm-testing/Llama-3.2-1B-Instruct-quip
nm-testing/Llama-3.2-1B-Instruct-spinquantR1R2-online
0.7B
•
Updated
nm-testing/Qwen3-Coder-30B-A3B-Instruct-W4A16-awq
5B
•
Updated
•
628
•
3
nm-testing/llama4-scout-17b-eagle3-dummy-drafter
nm-testing/Llama-3.2-1B-Instruct-spinquantR1R2R4-w4a16
0.7B
•
Updated
•
2.18k
nm-testing/Llama-3.1-8B-Instruct-quip-w4a16
2B
•
Updated
nm-testing/Meta-Llama-3-8B-Instruct-spinquantR3-FP8_asym-attn
8B
•
Updated
nm-testing/Meta-Llama-3-8B-Instruct-spinquantR3
8B
•
Updated
nm-testing/gemma-3n-2b-quantized.w4a16-test
4B
•
Updated
nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-FP8-Dynamic
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4-FP8-Dynamic
0.8B
•
Updated
nm-testing/Llama-3.2-1B-Instruct-lc_min_hack-hadamard-w4a16
0.7B
•
Updated
nm-testing/Llama-3.2-1B-Instruct-sq_min_hack-hadamard-w4a16
0.7B
•
Updated
nm-testing/Llama-3.2-1B-Instruct-sq_min_hack-eye-w4a16
0.7B
•
Updated
nm-testing/Llama-3.2-1B-Instruct-lc_min_hack-eye-w4a16
0.7B
•
Updated
nm-testing/Meta-Llama-3-8B-Instruct-quip-w4a16
nm-testing/gemma-3n-E2B-it-W4A16-G128
4B
•
Updated
•
1
nm-testing/block-quantization-fp8-qwen3-0.6B
0.8B
•
Updated
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted
Text Generation
•
1.0B
•
Updated
•
2