Nous-1-4B-f32-GGUF
Nous-V1 4B is a cutting-edge 4 billion parameter language model developed by Apexion AI, based on the architecture of Qwen3-4B. Designed for versatility across diverse NLP tasks, Nous-V1 4B delivers strong performance in conversational AI, knowledge reasoning, code generation, and content creation.
Model Files
| File Name | Quant Type | Size |
|---|---|---|
| Nous-1-4B.BF16.gguf | BF16 | 8.05 GB |
| Nous-1-4B.F16.gguf | F16 | 8.05 GB |
| Nous-1-4B.F32.gguf | F32 | 16.1 GB |
| Nous-1-4B.Q2_K.gguf | Q2_K | 1.67 GB |
| Nous-1-4B.Q3_K_L.gguf | Q3_K_L | 2.24 GB |
| Nous-1-4B.Q3_K_M.gguf | Q3_K_M | 2.08 GB |
| Nous-1-4B.Q3_K_S.gguf | Q3_K_S | 1.89 GB |
| Nous-1-4B.Q4_K_M.gguf | Q4_K_M | 2.5 GB |
| Nous-1-4B.Q4_K_S.gguf | Q4_K_S | 2.38 GB |
| Nous-1-4B.Q5_K_M.gguf | Q5_K_M | 2.89 GB |
| Nous-1-4B.Q5_K_S.gguf | Q5_K_S | 2.82 GB |
| Nous-1-4B.Q6_K.gguf | Q6_K | 3.31 GB |
| Nous-1-4B.Q8_0.gguf | Q8_0 | 4.28 GB |
Quants Usage
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):
- Downloads last month
- 26
Hardware compatibility
Log In
to view the estimation
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit
32-bit
