RedHatAI/Meta-Llama-3.1-70B-FP8
Text Generation
•
71B
•
Updated
•
602
•
2
RedHatAI/Mistral-Large-Instruct-2407-FP8
Text Generation
•
123B
•
Updated
•
309
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w8a16
Text Generation
•
19B
•
Updated
•
264
•
5
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
193k
•
43
RedHatAI/Mistral-7B-Instruct-v0.3-quantized.w8a8
Text Generation
•
7B
•
Updated
•
15
•
2
RedHatAI/Qwen2-72B-Instruct-quantized.w8a8
Text Generation
•
73B
•
Updated
•
26
•
2
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w8a8
Text Generation
•
71B
•
Updated
•
40
RedHatAI/Qwen2-7B-Instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
20
RedHatAI/Phi-3-medium-128k-instruct-quantized.w4a16
Text Generation
•
2B
•
Updated
•
795
•
3
RedHatAI/Qwen2-0.5B-Instruct-quantized.w8a8
Text Generation
•
0.6B
•
Updated
•
121
RedHatAI/Phi-3-mini-128k-instruct-quantized.w4a16
Text Generation
•
0.7B
•
Updated
•
49
•
1
RedHatAI/Qwen2-1.5B-Instruct-quantized.w8a8
Text Generation
•
2B
•
Updated
•
270
RedHatAI/Phi-3-mini-128k-instruct-quantized.w8a8
Text Generation
•
4B
•
Updated
•
27
RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
4.05k
•
2
RedHatAI/Llama-2-7b-chat-quantized.w8a8
Text Generation
•
7B
•
Updated
•
46
•
1
RedHatAI/Phi-3-mini-128k-instruct-quantized.w8a16
Text Generation
•
1B
•
Updated
•
14
RedHatAI/Phi-3-mini-128k-instruct-FP8
Text Generation
•
4B
•
Updated
•
53
RedHatAI/Llama-3.2-3B-Instruct-FP8-dynamic
Text Generation
•
4B
•
Updated
•
54
•
3
RedHatAI/Llama-3.2-1B-Instruct-FP8-dynamic
Text Generation
•
1B
•
Updated
•
438k
•
3
RedHatAI/gemma-2-9b-it-quantized.w8a8
Text Generation
•
10B
•
Updated
•
24
•
2
RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a8
Text Generation
•
14B
•
Updated
•
16
•
2
RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a16
Text Generation
•
4B
•
Updated
•
15
•
2
RedHatAI/Phi-3-medium-128k-instruct-FP8
Text Generation
•
14B
•
Updated
•
21
•
5
RedHatAI/Qwen2.5-32B-Instruct-quantized.w8a16
9B
•
Updated
•
7
RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a16
3B
•
Updated
•
3.33k
RedHatAI/Qwen2.5-0.5B-Instruct-quantized.w8a16
0.4B
•
Updated
•
9
RedHatAI/Qwen2.5-72B-Instruct-quantized.w8a8
73B
•
Updated
•
30
RedHatAI/Qwen2.5-32B-Instruct-quantized.w8a8
33B
•
Updated
•
1.86k
RedHatAI/Qwen2.5-32B-quantized.w8a8
33B
•
Updated
•
6
RedHatAI/Meta-Llama-3.1-405B-Instruct-FP8
Text Generation
•
406B
•
Updated
•
347
•
31