-
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models
Paper • 2602.16609 • Published • 6 -
lightonai/ColBERT-Zero
Sentence Similarity • 0.1B • Updated • 722 • 24 -
lightonai/ColBERT-Zero-supervised
Sentence Similarity • 0.1B • Updated • 56 • 3 -
lightonai/ColBERT-Zero-unsupervised
Sentence Similarity • 0.1B • Updated • 34 • 1
Collections
Discover the best community collections!
Collections trending this week
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 76.6k • 82 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • Updated • 54k • • 398 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 759k • 146 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • Updated • 153k • • 763
-
cerebras/Qwen3-Coder-REAP-363B-A35B-FP8
Text Generation • Updated • 41 • 15 -
cerebras/Qwen3-Coder-REAP-246B-A35B-FP8
Text Generation • 246B • Updated • 678 • 21 -
cerebras/Qwen3-Coder-REAP-363B-A35B
Text Generation • 363B • Updated • 16 • 5 -
cerebras/Qwen3-Coder-REAP-246B-A35B
Text Generation • 246B • Updated • 15 • 8
-
facebook/dinov3-vit7b16-pretrain-lvd1689m
Image Feature Extraction • Updated • 20.1k • 214 -
facebook/dinov3-vits16-pretrain-lvd1689m
Image Feature Extraction • 21.6M • Updated • 110k • 67 -
facebook/dinov3-convnext-small-pretrain-lvd1689m
Image Feature Extraction • 49.5M • Updated • 31.6k • 22 -
facebook/dinov3-vitb16-pretrain-lvd1689m
Image Feature Extraction • 85.7M • Updated • 582k • 102
-
Qwen3 VL Demo
😻385Chat with an AI that understands text, images, and videos
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 2.76M • • 378 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 349k • • 370 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 25.9k • 27
-
TeichAI/claude-4.5-opus-high-reasoning-250x
Viewer • Updated • 250 • 5.55k • 290 -
Qwen3 Claude Opus
🚀24Chat with an AI for various inquiries
-
TeichAI/Nemotron-Cascade-14B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill-GGUF
15B • Updated • 3.41k • 10 -
TeichAI/Nemotron-Cascade-14B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill
Text Generation • Updated • 223 • 6
-
tokyotech-llm/GPT-OSS-Swallow-20B-RL-v0.1
Text Generation • 21B • Updated • 4.03k • 13 -
tokyotech-llm/GPT-OSS-Swallow-120B-RL-v0.1
Text Generation • 117B • Updated • 2.05k • 9 -
tokyotech-llm/GPT-OSS-Swallow-20B-SFT-v0.1
Text Generation • 21B • Updated • 1.97k • 5 -
tokyotech-llm/GPT-OSS-Swallow-120B-SFT-v0.1
Text Generation • 117B • Updated • 3.06k • 2
-
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models
Paper • 2602.16609 • Published • 6 -
lightonai/ColBERT-Zero
Sentence Similarity • 0.1B • Updated • 722 • 24 -
lightonai/ColBERT-Zero-supervised
Sentence Similarity • 0.1B • Updated • 56 • 3 -
lightonai/ColBERT-Zero-unsupervised
Sentence Similarity • 0.1B • Updated • 34 • 1
-
facebook/dinov3-vit7b16-pretrain-lvd1689m
Image Feature Extraction • Updated • 20.1k • 214 -
facebook/dinov3-vits16-pretrain-lvd1689m
Image Feature Extraction • 21.6M • Updated • 110k • 67 -
facebook/dinov3-convnext-small-pretrain-lvd1689m
Image Feature Extraction • 49.5M • Updated • 31.6k • 22 -
facebook/dinov3-vitb16-pretrain-lvd1689m
Image Feature Extraction • 85.7M • Updated • 582k • 102
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 76.6k • 82 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • Updated • 54k • • 398 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 759k • 146 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • Updated • 153k • • 763
-
Qwen3 VL Demo
😻385Chat with an AI that understands text, images, and videos
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 2.76M • • 378 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 349k • • 370 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 25.9k • 27
-
TeichAI/claude-4.5-opus-high-reasoning-250x
Viewer • Updated • 250 • 5.55k • 290 -
Qwen3 Claude Opus
🚀24Chat with an AI for various inquiries
-
TeichAI/Nemotron-Cascade-14B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill-GGUF
15B • Updated • 3.41k • 10 -
TeichAI/Nemotron-Cascade-14B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill
Text Generation • Updated • 223 • 6
-
cerebras/Qwen3-Coder-REAP-363B-A35B-FP8
Text Generation • Updated • 41 • 15 -
cerebras/Qwen3-Coder-REAP-246B-A35B-FP8
Text Generation • 246B • Updated • 678 • 21 -
cerebras/Qwen3-Coder-REAP-363B-A35B
Text Generation • 363B • Updated • 16 • 5 -
cerebras/Qwen3-Coder-REAP-246B-A35B
Text Generation • 246B • Updated • 15 • 8
-
tokyotech-llm/GPT-OSS-Swallow-20B-RL-v0.1
Text Generation • 21B • Updated • 4.03k • 13 -
tokyotech-llm/GPT-OSS-Swallow-120B-RL-v0.1
Text Generation • 117B • Updated • 2.05k • 9 -
tokyotech-llm/GPT-OSS-Swallow-20B-SFT-v0.1
Text Generation • 21B • Updated • 1.97k • 5 -
tokyotech-llm/GPT-OSS-Swallow-120B-SFT-v0.1
Text Generation • 117B • Updated • 3.06k • 2