Unsloth AI

Team

company

Verified

https://unsloth.ai

UnslothAI

unslothai

Activity Feed

AI & ML interests

Open Source AI 💚

Recent Activity

danielhanchen new activity 32 minutes ago

unsloth/ERNIE-4.5-21B-A3B-PT-GGUF:update the gguf

danielhanchen updated a model 37 minutes ago

unsloth/GLM-4.7-Flash

shimmyshimmer new activity about 17 hours ago

unsloth/GLM-4.7-Flash-GGUF:High CPU Usage / Slow Context Processing

View all activity

unsloth 's collections 30

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance.

unsloth/GLM-4.7-Flash-GGUF

Text Generation • 30B • Updated 1 day ago • 196k • 313
unsloth/Nemotron-3-Nano-30B-A3B-GGUF

Text Generation • 32B • Updated 25 days ago • 103k • 233
unsloth/GLM-4.7-GGUF

Text Generation • 358B • Updated 29 days ago • 128k • 184
unsloth/MiniMax-M2.1-GGUF

Text Generation • 229B • Updated 29 days ago • 147k • 154

Qwen3-VL

Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats.

unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF

Image-Text-to-Text • 31B • Updated 24 days ago • 141k • 75
unsloth/Qwen3-VL-30B-A3B-Thinking-GGUF

Image-Text-to-Text • 31B • Updated 24 days ago • 25.3k • 32
unsloth/Qwen3-VL-4B-Instruct-GGUF

Image-Text-to-Text • 4B • Updated Oct 31, 2025 • 18.2k • 33
unsloth/Qwen3-VL-4B-Thinking-GGUF

Image-Text-to-Text • 4B • Updated Oct 31, 2025 • 5.01k • 18

Ministral 3

Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes.

unsloth/Ministral-3-14B-Instruct-2512-GGUF

14B • Updated Dec 4, 2025 • 22.4k • 58
unsloth/Ministral-3-14B-Reasoning-2512-GGUF

14B • Updated Dec 4, 2025 • 13.1k • 33
unsloth/Ministral-3-8B-Instruct-2512-GGUF

8B • Updated Dec 4, 2025 • 13.4k • 16
unsloth/Ministral-3-8B-Reasoning-2512-GGUF

8B • Updated Dec 4, 2025 • 5.24k • 7

DeepSeek-V3.1

DeepSeek's new 3.1 update to their V3 models!

unsloth/DeepSeek-V3.1-Terminus-GGUF

671B • Updated Sep 24, 2025 • 9.32k • 67
unsloth/DeepSeek-V3.1-GGUF

671B • Updated Sep 22, 2025 • 9.46k • 93
unsloth/DeepSeek-V3.1

Text Generation • 685B • Updated Aug 21, 2025 • 18 • 3
unsloth/DeepSeek-V3.1-BF16

Text Generation • 684B • Updated Aug 21, 2025 • 264 • 1

Embedding Models

Run or fine-tune embedding models with Unsloth.

unsloth/embeddinggemma-300m

Sentence Similarity • 0.3B • Updated 3 days ago • 10.5k • • 7
unsloth/embeddinggemma-300m-GGUF

Sentence Similarity • 0.3B • Updated Sep 4, 2025 • 5.44k • 46
unsloth/Qwen3-Embedding-0.6B

Feature Extraction • 0.6B • Updated 3 days ago • 299 • 3
unsloth/Qwen3-Embedding-4B

Feature Extraction • 4B • Updated 3 days ago • 437 • 1

Gemma 3

All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats.

unsloth/gemma-3-270m-it-GGUF

Text Generation • 0.3B • Updated Aug 15, 2025 • 18.1k • 147
unsloth/gemma-3-270m-it-qat-GGUF

Text Generation • 0.3B • Updated Aug 15, 2025 • 6k • 11
unsloth/gemma-3-270m-it

Text Generation • 0.3B • Updated Aug 14, 2025 • 32.7k • 22
unsloth/gemma-3-270m-it-unsloth-bnb-4bit

Text Generation • 0.3B • Updated Aug 14, 2025 • 11.9k • 5

Gemma 3n

Google Gemma 3n models, all versions including Dynamic GGUF, 4-bit, 16-bit and formats!

unsloth/gemma-3n-E4B-it-GGUF

Image-Text-to-Text • 7B • Updated Jun 30, 2025 • 20.5k • 186
unsloth/gemma-3n-E2B-it-GGUF

Image-Text-to-Text • 4B • Updated Jul 17, 2025 • 23.8k • 57
unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit

Image-Text-to-Text • 8B • Updated Jul 11, 2025 • 19.2k • 9
unsloth/gemma-3n-E4B-unsloth-bnb-4bit

Image-Text-to-Text • 8B • Updated Jul 11, 2025 • 859 • 4

Phi-4 (All Versions)

Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes

unsloth/Phi-4-reasoning-plus-GGUF

Text Generation • 15B • Updated May 1, 2025 • 3.56k • 77
unsloth/Phi-4-mini-reasoning-GGUF

Text Generation • 4B • Updated May 1, 2025 • 5.4k • 57
unsloth/Phi-4-reasoning-GGUF

Text Generation • 15B • Updated May 1, 2025 • 1.57k • 19
unsloth/phi-4-GGUF

Text Generation • 15B • Updated Jan 13, 2025 • 3.73k • 181

Deepseek V3 (All Versions)

Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions.

unsloth/DeepSeek-V3-0324-GGUF-UD

Text Generation • 671B • Updated Apr 28, 2025 • 1.02k • 21
unsloth/DeepSeek-V3-0324-GGUF

Text Generation • 671B • Updated May 22, 2025 • 4k • 197
unsloth/DeepSeek-V3-0324

Text Generation • 684B • Updated Apr 21, 2025 • 13 • 7
unsloth/DeepSeek-V3-0324-BF16

Text Generation • 684B • Updated Jul 14, 2025 • 28.3k • 4

Mistral Small 3 (All Versions)

A collection of Mistral's new Small 3.2 and 3 models including GGUF, 4-bit and more!

unsloth/Mistral-Small-3.2-24B-Instruct-2506-GGUF

Image-Text-to-Text • 24B • Updated Aug 26, 2025 • 50k • 152
unsloth/Mistral-Small-3.2-24B-Instruct-2506

Image-Text-to-Text • 24B • Updated Aug 26, 2025 • 1.33k • • 11
unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8

Image-Text-to-Text • Updated Jun 21, 2025 • 43 • 6
unsloth/Mistral-Small-3.2-24B-Instruct-2506-unsloth-bnb-4bit

Image-Text-to-Text • 25B • Updated Jun 23, 2025 • 2.26k • 12

Llama 3.3 (All Versions)

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions.

unsloth/Llama-3.3-70B-Instruct-GGUF

Text Generation • 71B • Updated May 10, 2025 • 8.54k • 91
unsloth/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Nov 25, 2025 • 2.66k • 48
unsloth/Llama-3.3-70B-Instruct-bnb-4bit

Text Generation • 71B • Updated Nov 25, 2025 • 9.19k • 52

Qwen QwQ-32B Collection

Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions.

unsloth/QwQ-32B-GGUF

Text Generation • 33B • Updated Apr 27, 2025 • 1.05k • 86
unsloth/QwQ-32B-unsloth-bnb-4bit

Text Generation • 34B • Updated Mar 7, 2025 • 693 • 47
unsloth/QwQ-32B

Text Generation • 33B • Updated Apr 27, 2025 • 25 • • 17
unsloth/QwQ-32B-bnb-4bit

Text Generation • 34B • Updated Mar 5, 2025 • 125 • 4

Llama 3.2 Vision

Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions.

unsloth/Llama-3.2-11B-Vision-Instruct

Image-to-Text • 11B • Updated Dec 10, 2024 • 27.9k • 88
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit

Image-to-Text • 11B • Updated Dec 10, 2024 • 5.6k • 80
unsloth/Llama-3.2-11B-Vision

Image-to-Text • 11B • Updated Nov 22, 2024 • 487 • 34
unsloth/Llama-3.2-11B-Vision-bnb-4bit

Image-to-Text • 11B • Updated Nov 22, 2024 • 835 • 16

Llama 3.1 Collection

Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions.

unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 233k • 92
unsloth/Llama-3.1-8B-Instruct-unsloth-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 46.3k • 4
unsloth/Meta-Llama-3.1-8B-Instruct-unsloth-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 157k • 4
unsloth/Meta-Llama-3.1-8B-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 51.6k • 109

Load 4bit models 4x faster

Native bitsandbytes 4bit pre quantized models

unsloth/Llama-3.2-3B-bnb-4bit

Text Generation • 3B • Updated Jun 2, 2025 • 27.9k • 21
unsloth/Meta-Llama-3.1-8B-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 51.6k • 109
unsloth/llama-3-8b-Instruct-bnb-4bit

Text Generation • 8B • Updated Nov 22, 2024 • 56.4k • 133
unsloth/gemma-2-9b-bnb-4bit

Text Generation • 10B • Updated Jul 22, 2025 • 8.76k • 31

Unsloth Diffusion GGUFs

Find GGUFs and other variants of diffusion based Qwen-Image and FLUX models.

unsloth/Qwen-Image-2512-GGUF

Text-to-Image • 20B • Updated 18 days ago • 131k • • 275
unsloth/LTX-2-GGUF

Image-to-Video • 19B • Updated 3 days ago • 22.4k • 77
unsloth/FLUX.2-klein-9B-GGUF

Image-to-Image • 9B • Updated 9 days ago • 35.6k • 57
unsloth/Qwen-Image-Edit-2511-GGUF

Image-to-Image • 20B • Updated 16 days ago • 174k • 316

gpt-oss

OpenAI's gpt-oss-20b and gpt-oss-120b is here! The powerful open models are available in GGUF, original & 4-bit formats.

unsloth/gpt-oss-20b-GGUF

Text Generation • 21B • Updated Dec 19, 2025 • 132k • 548
unsloth/gpt-oss-120b-GGUF

Text Generation • 117B • Updated Aug 25, 2025 • 81.7k • 200
unsloth/gpt-oss-20b-unsloth-bnb-4bit

Text Generation • 21B • Updated Aug 8, 2025 • 146k • 35
unsloth/gpt-oss-120b-unsloth-bnb-4bit

Text Generation • 117B • Updated Aug 8, 2025 • 19.4k • 12

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants.

unsloth/Qwen3-30B-A3B-Instruct-2507-GGUF

31B • Updated Jul 31, 2025 • 35.6k • 280
unsloth/Qwen3-4B-Instruct-2507-GGUF

4B • Updated Aug 20, 2025 • 61.2k • 133
unsloth/Qwen3-4B-Thinking-2507-GGUF

4B • Updated Sep 11, 2025 • 11.9k • 86
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF

Text Generation • 480B • Updated Jul 31, 2025 • 3.8k • 165

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.

unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF

Text Generation • 8B • Updated Jun 16, 2025 • 49.8k • 364
unsloth/DeepSeek-R1-0528-GGUF

Text Generation • 671B • Updated Jun 15, 2025 • 4.52k • 193
unsloth/DeepSeek-R1-0528-Qwen3-8B-unsloth-bnb-4bit

Text Generation • 8B • Updated Jun 10, 2025 • 6.48k • 13
unsloth/DeepSeek-R1-0528

Text Generation • 685B • Updated Jun 10, 2025 • 23 • 15

Qwen3-Coder

The Qwen3-Coder models deliver SOTA advancements in agentic coding and code tasks. Includes Qwen3-Coder-480B-A35B.

unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF

Text Generation • 31B • Updated Aug 8, 2025 • 104k • 406
unsloth/Qwen3-Coder-30B-A3B-Instruct-1M-GGUF

Text Generation • 31B • Updated Aug 5, 2025 • 12.8k • 136
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF

Text Generation • 480B • Updated Jul 31, 2025 • 3.8k • 165
unsloth/Qwen3-Coder-480B-A35B-Instruct-1M-GGUF

Text Generation • 480B • Updated Jul 23, 2025 • 1.74k • 40

Granite 4.0

IBM's new Granite-4.0 models! Run Dynamic GGUFs or fine-tune with Unsloth.

unsloth/granite-4.0-350m-GGUF

0.4B • Updated Oct 28, 2025 • 1.03k • 4
unsloth/granite-4.0-h-350m-GGUF

0.3B • Updated Oct 28, 2025 • 1.35k • 8
unsloth/granite-4.0-h-1b-GGUF

1B • Updated Oct 28, 2025 • 1.82k • 14
unsloth/granite-4.0-1b-GGUF

2B • Updated Oct 28, 2025 • 750 • 3

Llama 4

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth!

unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF

Image-to-Text • 108B • Updated Jun 17, 2025 • 22.1k • 129
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF

Image-to-Text • 401B • Updated Jun 18, 2025 • 6.14k • 42
unsloth/Llama-4-Scout-17B-16E-Instruct

Image-to-Text • 109B • Updated Jun 17, 2025 • 676 • 56
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit

Image-to-Text • 112B • Updated Apr 12, 2025 • 1.37k • 80

Unsloth 4-bit Dynamic Quants

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit

unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit

Text Generation • 5B • Updated Jul 18, 2025 • 37.8k • 36
unsloth/DeepSeek-R1-Distill-Qwen-14B-unsloth-bnb-4bit

Text Generation • 15B • Updated Feb 14, 2025 • 2.69k • 30
unsloth/DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit

Text Generation • 5B • Updated Feb 14, 2025 • 3.69k • 24
unsloth/gemma-3-12b-it-unsloth-bnb-4bit

Image-to-Text • 12B • Updated May 12, 2025 • 45.5k • 24

Text-to-Speech (TTS) models

A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now!

unsloth/orpheus-3b-0.1-ft-GGUF

Text-to-Speech • 3B • Updated Jul 9, 2025 • 1.47k • 11
unsloth/orpheus-3b-0.1-ft-unsloth-bnb-4bit

Text-to-Speech • 3B • Updated Mar 24, 2025 • 35.6k • 16
unsloth/csm-1b

Text-to-Speech • 2B • Updated May 15, 2025 • 6.11k • 19
unsloth/whisper-large-v3

Automatic Speech Recognition • 2B • Updated May 14, 2025 • 5.98k • 14

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions.

unsloth/Llama-3.2-1B-Instruct-GGUF

Text Generation • 1B • Updated May 9, 2025 • 63.9k • 52
unsloth/Llama-3.2-1B-Instruct

Text Generation • 1B • Updated May 9, 2025 • 89.2k • 87
unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit

Text Generation • 0.8B • Updated Apr 26, 2025 • 54.3k • 4
unsloth/Llama-3.2-1B-Instruct-bnb-4bit

Text Generation • 1B • Updated Jan 23, 2025 • 21.4k • 22

Qwen2.5-VL (All Versions)

All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more!

unsloth/Qwen2.5-VL-3B-Instruct-GGUF

Image-Text-to-Text • 3B • Updated May 12, 2025 • 12.3k • 19
unsloth/Qwen2.5-VL-7B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated May 12, 2025 • 79k • 133
unsloth/Qwen2.5-VL-32B-Instruct-GGUF

Image-Text-to-Text • 33B • Updated May 12, 2025 • 556 • 7
unsloth/Qwen2.5-VL-72B-Instruct-GGUF

Image-Text-to-Text • 73B • Updated May 18, 2025 • 1.28k • 7

Vision/multimodal Models

Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more!

unsloth/Llama-3.2-11B-Vision-Instruct

Image-to-Text • 11B • Updated Dec 10, 2024 • 27.9k • 88
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit

Image-to-Text • 11B • Updated Dec 10, 2024 • 5.6k • 80
unsloth/Llama-3.2-11B-Vision-Instruct-unsloth-bnb-4bit

Image-to-Text • 11B • Updated Dec 4, 2024 • 4.52k • 28
unsloth/Qwen2-VL-7B-Instruct-bnb-4bit

Image-Text-to-Text • 9B • Updated Nov 22, 2024 • 2.13k • 6

Qwen 2.5 Coder

Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats.

unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF

33B • Updated Nov 15, 2024 • 1.41k • 74
unsloth/Qwen2.5-Coder-14B-Instruct-128K-GGUF

15B • Updated Nov 14, 2024 • 1.45k • 34
unsloth/Qwen2.5-Coder-7B-Instruct-128K-GGUF

8B • Updated Nov 14, 2024 • 2.11k • 20
unsloth/Qwen2.5-Coder-3B-Instruct-128K-GGUF

3B • Updated Nov 15, 2024 • 538 • 14

Qwen 2.5

unsloth/Qwen2.5-7B-Instruct-bnb-4bit

Text Generation • 8B • Updated Apr 28, 2025 • 78.1k • 19
unsloth/Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Apr 28, 2025 • 49.9k • • 21
unsloth/Qwen2.5-14B-bnb-4bit

Text Generation • 15B • Updated Apr 28, 2025 • 1.38k • 5
unsloth/Qwen2.5-7B-bnb-4bit

Text Generation • 8B • Updated Apr 28, 2025 • 6.6k • 6

4bit Instruct Models

unsloth/Llama-3.2-3B-Instruct-bnb-4bit

Text Generation • 3B • Updated Jun 2, 2025 • 33.1k • 33
unsloth/Llama-3.2-1B-Instruct-bnb-4bit

Text Generation • 1B • Updated Jan 23, 2025 • 21.4k • 22
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit

Image-to-Text • 11B • Updated Dec 10, 2024 • 5.6k • 80
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 233k • 92