-
-
-
-
-
-
Inference Providers
Active filters:
sglang
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
•
14B
•
Updated
•
352k
•
7
Image-Text-to-Text
•
138B
•
Updated
•
3.45k
•
2
QuantTrio/Qwen3-Coder-Next-E400
Text Generation
•
63B
•
Updated
•
1.19k
•
1
SurfaceData/llava-v1.6-mistral-7b-sglang
Image-Text-to-Text
•
8B
•
Updated
•
4
•
9
SurfaceData/llava-v1.6-vicuna-7b-sglang
Image-Text-to-Text
•
7B
•
Updated
•
3
•
1
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
•
73B
•
Updated
•
22
•
2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
•
69B
•
Updated
•
52
alvarobartt/grok-2-tokenizer
Text Generation
•
Updated
•
14
•
3
VibeStudio/MiniMax-M2-THRIFT
173B
•
Updated
•
1.98k
•
35
mradermacher/MiniMax-M2-THRIFT-GGUF
JasmineBBB/Kimi-Linear-48B-A3B-Instruct-bnb-4bit
Text Generation
•
49B
•
Updated
•
9
•
1
mradermacher/MiniMax-M2-THRIFT-i1-GGUF
173B
•
Updated
•
203
•
10
bartowski/VibeStudio_MiniMax-M2-THRIFT-GGUF
Text Generation
•
173B
•
Updated
•
301
•
8
VibeStudio/MiniMax-M2-THRIFT-55
106B
•
Updated
•
172
•
5
JinnP/SGLang-EAGLE3-Qwen3-Coder-30B-A3B-Instruct
Text Generation
•
0.2B
•
Updated
•
178
•
1
mradermacher/MiniMax-M2-THRIFT-55-GGUF
106B
•
Updated
•
28
•
2
mradermacher/MiniMax-M2-THRIFT-55-i1-GGUF
106B
•
Updated
•
404
•
2
VibeStudio/MiniMax-M2-THRIFT-55-MLX-4bit
106B
•
Updated
•
149
•
2
VibeStudio/MiniMax-M2-THRIFT-55-MLX-6bit
106B
•
Updated
•
114
Doradus-AI/MiroThinker-v1.0-30B-FP8
Text Generation
•
31B
•
Updated
•
15
•
4
Doradus-AI/Hermes-4.3-36B-FP8
Text Generation
•
36B
•
Updated
•
79
•
2
Doradus-AI/RnJ-1-Instruct-FP8
Text Generation
•
9B
•
Updated
•
4
•
4
QuantTrio/Qwen3-Coder-Next-E336
Text Generation
•
53B
•
Updated
•
85