which rocm vllm docker to use for AMD GPU?
#58 opened about 5 hours ago
by
vivekag-ai
Less Context Length than Expected(600k)
#57 opened about 6 hours ago
by
Forcewithme
Add Terminal-Bench evaluation result (52.4%)
#56 opened about 7 hours ago
by
burtenshaw
Quick question: is Qwen/Qwen2.5-Math-1.5B-Instruct derived from Qwen/Qwen2.5-Math-1.5B?
1
#55 opened 1 day ago
by
dqdw
Technical question: Lineage of Qwen/Qwen2.5-Coder-1.5B-Instruct
#54 opened 1 day ago
by
dqdw
Add SWE-Bench Verified evaluation result
#53 opened 2 days ago
by
nielsr
fine-tune
1
#52 opened 6 days ago
by
m-hasnain-sabqi
Add MathArena evaluation result for hmmt/hmmt_feb_2026
#51 opened 7 days ago
by
JasperDekoninck
Upload IMG-20260223-WA0006.jpg
#50 opened 7 days ago
by
Awepeter
How to benchmark MMMU properly in SGLang?
#49 opened 7 days ago
by
JacobChang
Update .gitattributes
#48 opened 9 days ago
by
Uzef
how to use it
#47 opened 11 days ago
by
nownownhan
GLM-5-Flash
β€οΈ 11
1
#46 opened 11 days ago
by
exxxistent
I have reasoning datasets from this
2
#45 opened 12 days ago
by
crownelius
Add MathArena evaluation result for aime/aime_2026
#44 opened 12 days ago
by
JasperDekoninck
"Hierarchical Context Management" Reproducibility?
β 3
#43 opened 12 days ago
by
pandemo
How to navigate through
β 1
2
#41 opened 12 days ago
by
Thatboydan
Add arXiv metadata and update citation with paper links
1
#40 opened 12 days ago
by
nielsr
what the best ai to work on
4
#39 opened 12 days ago
by
ldeath131416
Will there be a glm-5-air?
β 3
1
#38 opened 13 days ago
by
ianncity
Ola
1
#37 opened 13 days ago
by
Olasconefo
I like this model but...
β 2
#36 opened 14 days ago
by
Carnyzzle
Question about training data sources for GLM-5
1
#33 opened 14 days ago
by
Mustina
Module not Found Error
#32 opened 14 days ago
by
aniket2025
New discussion here
#26 opened 16 days ago
by
bytecypher
Upload Screenshot_20260213-040717.jpg
#25 opened 17 days ago
by
janesjohn
Update README.md
#20 opened 17 days ago
by
janesjohn
Preshcyptz
#19 opened 17 days ago
by
janesjohn
Great Model but not accessible anymore
β π₯ 4
8
#17 opened 18 days ago
by
darkstar3537
Availabilty
1
#16 opened 18 days ago
by
kashifo
Action Model
#15 opened 18 days ago
by
CrypoCrackid
prime
2
#14 opened 18 days ago
by
dcmark74gmail
how to create a bot in 1min
1
#13 opened 18 days ago
by
0xcrypticmilex
How to Run GLM-5 Locally Guide! π₯
β€οΈ 4
#12 opened 18 days ago
by
danielhanchen
Decode Context Parallel not working
#11 opened 18 days ago
by
pratiknarola
Update README.md
#10 opened 18 days ago
by
UnicornChan
We're sooooo back!!!!
π 2
1
#9 opened 19 days ago
by
NickupAI
GLM-5 Thorough Testing Video - Thanks
π 1
#8 opened 19 days ago
by
fahdmirzac
Thank you so much for this!
π₯ π 13
1
#7 opened 19 days ago
by
SicariusSicariiStuff
Native Context Window
#6 opened 19 days ago
by
akumaburn
Please release a model with native 4-bit quantization
π β 10
4
#4 opened 19 days ago
by
calycekr
We need Air and we need Flash
β π 73
21
#3 opened 19 days ago
by
jacek2024
apple sillicon
β π 2
#2 opened 19 days ago
by
ox-ox
Base model
π₯ π 18
#1 opened 19 days ago
by
FriskyFennec