Zhu Lin's picture

Zhu Lin

czl

·

https://czl.my/

AI & ML interests

Computer Vision, LLM

Recent Activity

updated a dataset about 22 hours ago

czl/nangang_sports_center

updated a dataset about 22 hours ago

czl/xinyi_public_gym

updated a dataset about 22 hours ago

czl/zhongshan_public_gym

View all activity

Organizations

upvoted an article 3 days ago

Article

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

4 days ago

•

49

upvoted a paper 5 days ago

Multimodal OCR: Parse Anything from Documents

Paper • 2603.13032 • Published 8 days ago • 31

upvoted an article about 1 month ago

Article

Forge: Scalable Agent RL Framework and Algorithm

Feb 13

•

139

upvoted 2 articles 3 months ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

43

Article

What makes good reasoning data

Oct 30, 2025

•

44

upvoted 3 articles 4 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

+8

Oct 23, 2025

•

151

Article

There is no such thing as a tokenizer-free lunch

Sep 25, 2025

•

95

Article

Evaluate Your Own RAG: Why Best Practices Failed Us

Nov 5, 2025

•

14

upvoted a paper 4 months ago

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published Nov 11, 2025 • 35

upvoted an article 5 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

78

upvoted a collection 5 months ago

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 12 items • Updated about 18 hours ago • 126

upvoted a paper 5 months ago

SSDD: Single-Step Diffusion Decoder for Efficient Image Tokenization

Paper • 2510.04961 • Published Oct 6, 2025 • 5

upvoted a collection 7 months ago

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated about 18 hours ago • 102

upvoted an article 7 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

454

upvoted a collection 7 months ago

Instruct datasets

5 items • Updated May 5, 2025 • 5

upvoted 2 collections 9 months ago

🧠 SmolLM3

Smol, multilingual, long-context reasoner • 14 items • Updated Oct 9, 2025 • 97

Gemma 3n

4 items • Updated 9 days ago • 269

upvoted a collection about 1 year ago

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 10 days ago • 266

upvoted a paper over 1 year ago

Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices

Paper • 2410.11795 • Published Oct 15, 2024 • 18

upvoted a paper about 2 years ago

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Paper • 2402.13616 • Published Feb 21, 2024 • 49