Better Perplexity Alternative GGUFs
#13 opened about 3 hours ago
by
ubergarm
chat template is broken
2
#12 opened about 13 hours ago
by
grapevine-AI
Is it possible to release a version with low bit quantization?
2
#11 opened 2 days ago
by
lan0004
How do I run it using Oobabooga? I'm getting the following error
1
#10 opened 2 days ago
by
TeaDiffusion
What are the benchmarks of the 4 bit model vs the FP8 model?
2
#9 opened 3 days ago
by
Grossor
Make this model more visible on the hub
🚀
5
1
#8 opened 4 days ago
by
victor
INT8 quantization for KVCache on DGX Spark/GB10
2
#6 opened 5 days ago
by
JDWarner
Int8/Q8 gguf when?
#5 opened 5 days ago
by
e1732a364fed
config.json file needed at root?
1
#4 opened 5 days ago
by
pathosethoslogos
cool model !!
👍
1
3
#3 opened 5 days ago
by
gopi87
great job! thanks!
#1 opened 5 days ago
by
semon017