Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Building on HF
126.0
TFLOPS
6
19
32
Zixi "Oz" Li
PRO
OzTianlu
Follow
John6666's profile picture
Wahaj-Ali's profile picture
mrs83's profile picture
19 followers
Β·
21 following
https://github.com/lizixi-0x2F
lizixi-0x2F
AI & ML interests
My research focuses on deep reasoning with small language models, Transformer architecture innovation, and knowledge distillation for efficient alignment and transfer.
Recent Activity
new
activity
about 1 hour ago
mradermacher/Kai-3B-Instruct-GGUF:
[Update Request] Please re-pull weights for Kai-3B-Instruct (v1.1 fixes mode collapse)
replied
to
their
post
about 1 hour ago
π¨ URGENT: To the 13k+ users downloading Kai-3B-Instruct β Please update to v1.1! (Official Q8_0 GGUF inside) https://huggingface.co/OzTianlu/Kai-3B-Instruct-Q8_0-GGUF Wow. Waking up to see over 13,000 combined downloads for the Kai-3B-Instruct GGUFs is absolutely mind-blowing. Thank you so much to the community and to the awesome creators (@SimplySara & @mradermacher) for the auto-quantization! However, we have a slight "suffering from success" situation here. π β οΈ THE ISSUE: You are likely running the v1.0 "Logic-Poisoned" weights. If your model is acting like a cold, emotionless robot that only replies with a rigid Analysis -> Approach -> Solution template even when you just say "Hello", you have v1.0. In our initial release, the model overfitted to its reasoning corpus, causing a complete "conversational mode collapse." π THE FIX: Official v1.1 is Live! We have completed a 4000-step annealing phase to restore its sanity.
posted
an
update
about 1 hour ago
π¨ URGENT: To the 13k+ users downloading Kai-3B-Instruct β Please update to v1.1! (Official Q8_0 GGUF inside) https://huggingface.co/OzTianlu/Kai-3B-Instruct-Q8_0-GGUF Wow. Waking up to see over 13,000 combined downloads for the Kai-3B-Instruct GGUFs is absolutely mind-blowing. Thank you so much to the community and to the awesome creators (@SimplySara & @mradermacher) for the auto-quantization! However, we have a slight "suffering from success" situation here. π β οΈ THE ISSUE: You are likely running the v1.0 "Logic-Poisoned" weights. If your model is acting like a cold, emotionless robot that only replies with a rigid Analysis -> Approach -> Solution template even when you just say "Hello", you have v1.0. In our initial release, the model overfitted to its reasoning corpus, causing a complete "conversational mode collapse." π THE FIX: Official v1.1 is Live! We have completed a 4000-step annealing phase to restore its sanity.
View all activity
Organizations
OzTianlu
's models
1
Sort:Β Recently updated
OzTianlu/Kai-3B-Instruct-Q8_0-GGUF
Text Generation
β’
3B
β’
Updated
about 2 hours ago
β’
38