Zixi "Oz" Li's picture

Building on HF

Zixi "Oz" Li PRO

OzTianlu

NoesisLab

·

https://github.com/lizixi-0x2F

lizixi-0x2F

AI & ML interests

My research focuses on deep reasoning with small language models, Transformer architecture innovation, and knowledge distillation for efficient alignment and transfer.

Recent Activity

new activity about 1 hour ago

mradermacher/Kai-3B-Instruct-GGUF:[Update Request] Please re-pull weights for Kai-3B-Instruct (v1.1 fixes mode collapse)

replied to their post about 1 hour ago

🚨 URGENT: To the 13k+ users downloading Kai-3B-Instruct — Please update to v1.1! (Official Q8_0 GGUF inside) https://huggingface.co/OzTianlu/Kai-3B-Instruct-Q8_0-GGUF Wow. Waking up to see over 13,000 combined downloads for the Kai-3B-Instruct GGUFs is absolutely mind-blowing. Thank you so much to the community and to the awesome creators (@SimplySara & @mradermacher) for the auto-quantization! However, we have a slight "suffering from success" situation here. 😅 ⚠️ THE ISSUE: You are likely running the v1.0 "Logic-Poisoned" weights. If your model is acting like a cold, emotionless robot that only replies with a rigid Analysis -> Approach -> Solution template even when you just say "Hello", you have v1.0. In our initial release, the model overfitted to its reasoning corpus, causing a complete "conversational mode collapse." 🚀 THE FIX: Official v1.1 is Live! We have completed a 4000-step annealing phase to restore its sanity.

posted an update about 1 hour ago

🚨 URGENT: To the 13k+ users downloading Kai-3B-Instruct — Please update to v1.1! (Official Q8_0 GGUF inside) https://huggingface.co/OzTianlu/Kai-3B-Instruct-Q8_0-GGUF Wow. Waking up to see over 13,000 combined downloads for the Kai-3B-Instruct GGUFs is absolutely mind-blowing. Thank you so much to the community and to the awesome creators (@SimplySara & @mradermacher) for the auto-quantization! However, we have a slight "suffering from success" situation here. 😅 ⚠️ THE ISSUE: You are likely running the v1.0 "Logic-Poisoned" weights. If your model is acting like a cold, emotionless robot that only replies with a rigid Analysis -> Approach -> Solution template even when you just say "Hello", you have v1.0. In our initial release, the model overfitted to its reasoning corpus, causing a complete "conversational mode collapse." 🚀 THE FIX: Official v1.1 is Live! We have completed a 4000-step annealing phase to restore its sanity.

View all activity

Organizations

OzTianlu 's models 1

OzTianlu/Kai-3B-Instruct-Q8_0-GGUF

Text Generation • 3B • Updated about 2 hours ago • 38