New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance.
AI & ML interests
Open Source AI 💚
Recent Activity
View all activity
Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats.
-
unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF
Image-Text-to-Text • 31B • Updated • 141k • 75 -
unsloth/Qwen3-VL-30B-A3B-Thinking-GGUF
Image-Text-to-Text • 31B • Updated • 25.3k • 32 -
unsloth/Qwen3-VL-4B-Instruct-GGUF
Image-Text-to-Text • 4B • Updated • 18.2k • 33 -
unsloth/Qwen3-VL-4B-Thinking-GGUF
Image-Text-to-Text • 4B • Updated • 5.01k • 18
Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes.
DeepSeek's new 3.1 update to their V3 models!
Run or fine-tune embedding models with Unsloth.
-
unsloth/embeddinggemma-300m
Sentence Similarity • 0.3B • Updated • 10.5k • • 7 -
unsloth/embeddinggemma-300m-GGUF
Sentence Similarity • 0.3B • Updated • 5.44k • 46 -
unsloth/Qwen3-Embedding-0.6B
Feature Extraction • 0.6B • Updated • 299 • 3 -
unsloth/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 437 • 1
All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats.
-
unsloth/gemma-3-270m-it-GGUF
Text Generation • 0.3B • Updated • 18.1k • 147 -
unsloth/gemma-3-270m-it-qat-GGUF
Text Generation • 0.3B • Updated • 6k • 11 -
unsloth/gemma-3-270m-it
Text Generation • 0.3B • Updated • 32.7k • 22 -
unsloth/gemma-3-270m-it-unsloth-bnb-4bit
Text Generation • 0.3B • Updated • 11.9k • 5
Google Gemma 3n models, all versions including Dynamic GGUF, 4-bit, 16-bit and formats!
-
unsloth/gemma-3n-E4B-it-GGUF
Image-Text-to-Text • 7B • Updated • 20.5k • 186 -
unsloth/gemma-3n-E2B-it-GGUF
Image-Text-to-Text • 4B • Updated • 23.8k • 57 -
unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit
Image-Text-to-Text • 8B • Updated • 19.2k • 9 -
unsloth/gemma-3n-E4B-unsloth-bnb-4bit
Image-Text-to-Text • 8B • Updated • 859 • 4
Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes
Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions.
A collection of Mistral's new Small 3.2 and 3 models including GGUF, 4-bit and more!
-
unsloth/Mistral-Small-3.2-24B-Instruct-2506-GGUF
Image-Text-to-Text • 24B • Updated • 50k • 152 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506
Image-Text-to-Text • 24B • Updated • 1.33k • • 11 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8
Image-Text-to-Text • Updated • 43 • 6 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506-unsloth-bnb-4bit
Image-Text-to-Text • 25B • Updated • 2.26k • 12
Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions.
Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions.
Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions.
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text • 11B • Updated • 27.9k • 88 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text • 11B • Updated • 5.6k • 80 -
unsloth/Llama-3.2-11B-Vision
Image-to-Text • 11B • Updated • 487 • 34 -
unsloth/Llama-3.2-11B-Vision-bnb-4bit
Image-to-Text • 11B • Updated • 835 • 16
Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions.
-
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 233k • 92 -
unsloth/Llama-3.1-8B-Instruct-unsloth-bnb-4bit
Text Generation • 8B • Updated • 46.3k • 4 -
unsloth/Meta-Llama-3.1-8B-Instruct-unsloth-bnb-4bit
Text Generation • 8B • Updated • 157k • 4 -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation • 8B • Updated • 51.6k • 109
Native bitsandbytes 4bit pre quantized models
-
unsloth/Llama-3.2-3B-bnb-4bit
Text Generation • 3B • Updated • 27.9k • 21 -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation • 8B • Updated • 51.6k • 109 -
unsloth/llama-3-8b-Instruct-bnb-4bit
Text Generation • 8B • Updated • 56.4k • 133 -
unsloth/gemma-2-9b-bnb-4bit
Text Generation • 10B • Updated • 8.76k • 31
Find GGUFs and other variants of diffusion based Qwen-Image and FLUX models.
OpenAI's gpt-oss-20b and gpt-oss-120b is here! The powerful open models are available in GGUF, original & 4-bit formats.
-
unsloth/gpt-oss-20b-GGUF
Text Generation • 21B • Updated • 132k • 548 -
unsloth/gpt-oss-120b-GGUF
Text Generation • 117B • Updated • 81.7k • 200 -
unsloth/gpt-oss-20b-unsloth-bnb-4bit
Text Generation • 21B • Updated • 146k • 35 -
unsloth/gpt-oss-120b-unsloth-bnb-4bit
Text Generation • 117B • Updated • 19.4k • 12
Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants.
DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 49.8k • 364 -
unsloth/DeepSeek-R1-0528-GGUF
Text Generation • 671B • Updated • 4.52k • 193 -
unsloth/DeepSeek-R1-0528-Qwen3-8B-unsloth-bnb-4bit
Text Generation • 8B • Updated • 6.48k • 13 -
unsloth/DeepSeek-R1-0528
Text Generation • 685B • Updated • 23 • 15
The Qwen3-Coder models deliver SOTA advancements in agentic coding and code tasks. Includes Qwen3-Coder-480B-A35B.
-
unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF
Text Generation • 31B • Updated • 104k • 406 -
unsloth/Qwen3-Coder-30B-A3B-Instruct-1M-GGUF
Text Generation • 31B • Updated • 12.8k • 136 -
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF
Text Generation • 480B • Updated • 3.8k • 165 -
unsloth/Qwen3-Coder-480B-A35B-Instruct-1M-GGUF
Text Generation • 480B • Updated • 1.74k • 40
IBM's new Granite-4.0 models! Run Dynamic GGUFs or fine-tune with Unsloth.
Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth!
-
unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF
Image-to-Text • 108B • Updated • 22.1k • 129 -
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF
Image-to-Text • 401B • Updated • 6.14k • 42 -
unsloth/Llama-4-Scout-17B-16E-Instruct
Image-to-Text • 109B • Updated • 676 • 56 -
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit
Image-to-Text • 112B • Updated • 1.37k • 80
Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit
-
unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
Text Generation • 5B • Updated • 37.8k • 36 -
unsloth/DeepSeek-R1-Distill-Qwen-14B-unsloth-bnb-4bit
Text Generation • 15B • Updated • 2.69k • 30 -
unsloth/DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit
Text Generation • 5B • Updated • 3.69k • 24 -
unsloth/gemma-3-12b-it-unsloth-bnb-4bit
Image-to-Text • 12B • Updated • 45.5k • 24
A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now!
Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions.
-
unsloth/Llama-3.2-1B-Instruct-GGUF
Text Generation • 1B • Updated • 63.9k • 52 -
unsloth/Llama-3.2-1B-Instruct
Text Generation • 1B • Updated • 89.2k • 87 -
unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit
Text Generation • 0.8B • Updated • 54.3k • 4 -
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation • 1B • Updated • 21.4k • 22
All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more!
-
unsloth/Qwen2.5-VL-3B-Instruct-GGUF
Image-Text-to-Text • 3B • Updated • 12.3k • 19 -
unsloth/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text • 8B • Updated • 79k • 133 -
unsloth/Qwen2.5-VL-32B-Instruct-GGUF
Image-Text-to-Text • 33B • Updated • 556 • 7 -
unsloth/Qwen2.5-VL-72B-Instruct-GGUF
Image-Text-to-Text • 73B • Updated • 1.28k • 7
Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more!
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text • 11B • Updated • 27.9k • 88 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text • 11B • Updated • 5.6k • 80 -
unsloth/Llama-3.2-11B-Vision-Instruct-unsloth-bnb-4bit
Image-to-Text • 11B • Updated • 4.52k • 28 -
unsloth/Qwen2-VL-7B-Instruct-bnb-4bit
Image-Text-to-Text • 9B • Updated • 2.13k • 6
Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats.
-
unsloth/Llama-3.2-3B-Instruct-bnb-4bit
Text Generation • 3B • Updated • 33.1k • 33 -
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation • 1B • Updated • 21.4k • 22 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text • 11B • Updated • 5.6k • 80 -
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 233k • 92
New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance.
Find GGUFs and other variants of diffusion based Qwen-Image and FLUX models.
Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats.
-
unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF
Image-Text-to-Text • 31B • Updated • 141k • 75 -
unsloth/Qwen3-VL-30B-A3B-Thinking-GGUF
Image-Text-to-Text • 31B • Updated • 25.3k • 32 -
unsloth/Qwen3-VL-4B-Instruct-GGUF
Image-Text-to-Text • 4B • Updated • 18.2k • 33 -
unsloth/Qwen3-VL-4B-Thinking-GGUF
Image-Text-to-Text • 4B • Updated • 5.01k • 18
OpenAI's gpt-oss-20b and gpt-oss-120b is here! The powerful open models are available in GGUF, original & 4-bit formats.
-
unsloth/gpt-oss-20b-GGUF
Text Generation • 21B • Updated • 132k • 548 -
unsloth/gpt-oss-120b-GGUF
Text Generation • 117B • Updated • 81.7k • 200 -
unsloth/gpt-oss-20b-unsloth-bnb-4bit
Text Generation • 21B • Updated • 146k • 35 -
unsloth/gpt-oss-120b-unsloth-bnb-4bit
Text Generation • 117B • Updated • 19.4k • 12
Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes.
Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants.
DeepSeek's new 3.1 update to their V3 models!
DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 49.8k • 364 -
unsloth/DeepSeek-R1-0528-GGUF
Text Generation • 671B • Updated • 4.52k • 193 -
unsloth/DeepSeek-R1-0528-Qwen3-8B-unsloth-bnb-4bit
Text Generation • 8B • Updated • 6.48k • 13 -
unsloth/DeepSeek-R1-0528
Text Generation • 685B • Updated • 23 • 15
Run or fine-tune embedding models with Unsloth.
-
unsloth/embeddinggemma-300m
Sentence Similarity • 0.3B • Updated • 10.5k • • 7 -
unsloth/embeddinggemma-300m-GGUF
Sentence Similarity • 0.3B • Updated • 5.44k • 46 -
unsloth/Qwen3-Embedding-0.6B
Feature Extraction • 0.6B • Updated • 299 • 3 -
unsloth/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 437 • 1
The Qwen3-Coder models deliver SOTA advancements in agentic coding and code tasks. Includes Qwen3-Coder-480B-A35B.
-
unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF
Text Generation • 31B • Updated • 104k • 406 -
unsloth/Qwen3-Coder-30B-A3B-Instruct-1M-GGUF
Text Generation • 31B • Updated • 12.8k • 136 -
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF
Text Generation • 480B • Updated • 3.8k • 165 -
unsloth/Qwen3-Coder-480B-A35B-Instruct-1M-GGUF
Text Generation • 480B • Updated • 1.74k • 40
All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats.
-
unsloth/gemma-3-270m-it-GGUF
Text Generation • 0.3B • Updated • 18.1k • 147 -
unsloth/gemma-3-270m-it-qat-GGUF
Text Generation • 0.3B • Updated • 6k • 11 -
unsloth/gemma-3-270m-it
Text Generation • 0.3B • Updated • 32.7k • 22 -
unsloth/gemma-3-270m-it-unsloth-bnb-4bit
Text Generation • 0.3B • Updated • 11.9k • 5
IBM's new Granite-4.0 models! Run Dynamic GGUFs or fine-tune with Unsloth.
Google Gemma 3n models, all versions including Dynamic GGUF, 4-bit, 16-bit and formats!
-
unsloth/gemma-3n-E4B-it-GGUF
Image-Text-to-Text • 7B • Updated • 20.5k • 186 -
unsloth/gemma-3n-E2B-it-GGUF
Image-Text-to-Text • 4B • Updated • 23.8k • 57 -
unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit
Image-Text-to-Text • 8B • Updated • 19.2k • 9 -
unsloth/gemma-3n-E4B-unsloth-bnb-4bit
Image-Text-to-Text • 8B • Updated • 859 • 4
Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth!
-
unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF
Image-to-Text • 108B • Updated • 22.1k • 129 -
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF
Image-to-Text • 401B • Updated • 6.14k • 42 -
unsloth/Llama-4-Scout-17B-16E-Instruct
Image-to-Text • 109B • Updated • 676 • 56 -
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit
Image-to-Text • 112B • Updated • 1.37k • 80
Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes
Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit
-
unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
Text Generation • 5B • Updated • 37.8k • 36 -
unsloth/DeepSeek-R1-Distill-Qwen-14B-unsloth-bnb-4bit
Text Generation • 15B • Updated • 2.69k • 30 -
unsloth/DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit
Text Generation • 5B • Updated • 3.69k • 24 -
unsloth/gemma-3-12b-it-unsloth-bnb-4bit
Image-to-Text • 12B • Updated • 45.5k • 24
Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions.
A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now!
A collection of Mistral's new Small 3.2 and 3 models including GGUF, 4-bit and more!
-
unsloth/Mistral-Small-3.2-24B-Instruct-2506-GGUF
Image-Text-to-Text • 24B • Updated • 50k • 152 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506
Image-Text-to-Text • 24B • Updated • 1.33k • • 11 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8
Image-Text-to-Text • Updated • 43 • 6 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506-unsloth-bnb-4bit
Image-Text-to-Text • 25B • Updated • 2.26k • 12
Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions.
-
unsloth/Llama-3.2-1B-Instruct-GGUF
Text Generation • 1B • Updated • 63.9k • 52 -
unsloth/Llama-3.2-1B-Instruct
Text Generation • 1B • Updated • 89.2k • 87 -
unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit
Text Generation • 0.8B • Updated • 54.3k • 4 -
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation • 1B • Updated • 21.4k • 22
Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions.
All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more!
-
unsloth/Qwen2.5-VL-3B-Instruct-GGUF
Image-Text-to-Text • 3B • Updated • 12.3k • 19 -
unsloth/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text • 8B • Updated • 79k • 133 -
unsloth/Qwen2.5-VL-32B-Instruct-GGUF
Image-Text-to-Text • 33B • Updated • 556 • 7 -
unsloth/Qwen2.5-VL-72B-Instruct-GGUF
Image-Text-to-Text • 73B • Updated • 1.28k • 7
Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions.
Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more!
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text • 11B • Updated • 27.9k • 88 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text • 11B • Updated • 5.6k • 80 -
unsloth/Llama-3.2-11B-Vision-Instruct-unsloth-bnb-4bit
Image-to-Text • 11B • Updated • 4.52k • 28 -
unsloth/Qwen2-VL-7B-Instruct-bnb-4bit
Image-Text-to-Text • 9B • Updated • 2.13k • 6
Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions.
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text • 11B • Updated • 27.9k • 88 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text • 11B • Updated • 5.6k • 80 -
unsloth/Llama-3.2-11B-Vision
Image-to-Text • 11B • Updated • 487 • 34 -
unsloth/Llama-3.2-11B-Vision-bnb-4bit
Image-to-Text • 11B • Updated • 835 • 16
Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats.
Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions.
-
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 233k • 92 -
unsloth/Llama-3.1-8B-Instruct-unsloth-bnb-4bit
Text Generation • 8B • Updated • 46.3k • 4 -
unsloth/Meta-Llama-3.1-8B-Instruct-unsloth-bnb-4bit
Text Generation • 8B • Updated • 157k • 4 -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation • 8B • Updated • 51.6k • 109
Native bitsandbytes 4bit pre quantized models
-
unsloth/Llama-3.2-3B-bnb-4bit
Text Generation • 3B • Updated • 27.9k • 21 -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation • 8B • Updated • 51.6k • 109 -
unsloth/llama-3-8b-Instruct-bnb-4bit
Text Generation • 8B • Updated • 56.4k • 133 -
unsloth/gemma-2-9b-bnb-4bit
Text Generation • 10B • Updated • 8.76k • 31
-
unsloth/Llama-3.2-3B-Instruct-bnb-4bit
Text Generation • 3B • Updated • 33.1k • 33 -
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation • 1B • Updated • 21.4k • 22 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text • 11B • Updated • 5.6k • 80 -
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 233k • 92