Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

2,378

Full-text search

Active filters: multimodal

Mungert/Qwen2.5-VL-3B-Instruct-GGUF

Image-Text-to-Text • 3B • Updated Sep 24 • 19.2k • 25

Qwen/Qwen2.5-Omni-3B

Any-to-Any • 6B • Updated Apr 30 • 294k • 311

imageomics/bioclip-2

Zero-Shot Image Classification • Updated Oct 16 • 16.1k • 23

unsloth/Qwen2.5-Omni-7B-GGUF

Any-to-Any • 8B • Updated May 28 • 10.1k • 47

mispeech/midashenglm-7b-0804-fp32

Audio-Text-to-Text • 8B • Updated Oct 31 • 33.1k • 76

Qwen/Qwen3-Omni-30B-A3B-Captioner

Any-to-Any • 32B • Updated Sep 22 • 22.1k • 177

Qwen/Qwen3-Omni-30B-A3B-Thinking

Any-to-Any • 32B • Updated Sep 22 • 50k • 230

Kwai-Keye/Keye-VL-671B-A37B

Video-Text-to-Text • 672B • Updated 18 days ago • 126 • 17

yasserrmd/Fara-TARS-7B

Image-Text-to-Text • 8B • Updated 12 days ago • 209 • 4

Cognitive-Lab/NetraEmbed

Visual Document Retrieval • 4B • Updated about 3 hours ago • 257 • 2

lijiayangCS/DiTFuse

Image-to-Image • Updated 6 days ago • 2

Lewdiculous/Nyanade_Stunna-Maid-7B-v0.2-GGUF-IQ-Imatrix

7B • Updated May 4, 2024 • 2.18k • 59

lmms-lab/llava-onevision-qwen2-0.5b-ov

Text Generation • 0.9B • Updated Sep 2, 2024 • 27.3k • 26

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6 • 1.55M • • 1.24k

Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int4

Image-Text-to-Text • 13B • Updated Sep 24, 2024 • 345 • 29

unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit

Image-to-Text • 6B • Updated Dec 10, 2024 • 517k • 80

unsloth/Llama-3.2-11B-Vision-Instruct

Image-to-Text • 11B • Updated Dec 10, 2024 • 21k • 86

nvidia/NVLM-D-72B

Image-Text-to-Text • 79B • Updated Jan 14 • 54.8k • 775

bartowski/Qwen2-VL-7B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated Dec 17, 2024 • 6.55k • 41

OpenGVLab/InternVideo2_5_Chat_8B

Video-Text-to-Text • 8B • Updated Aug 4 • 45.5k • 86

ByteDance-Seed/UI-TARS-72B-DPO

Image-Text-to-Text • 73B • Updated Jan 25 • 2.27k • 147

Qwen/Qwen2.5-VL-72B-Instruct-AWQ

Image-Text-to-Text • 13B • Updated Mar 7 • 60k • 69

Qwen/Qwen2.5-VL-7B-Instruct-AWQ

Image-Text-to-Text • 3B • Updated Apr 6 • 162k • 94

sbintuitions/sarashina2-vision-8b

Image-to-Text • 8B • Updated Mar 27 • 4.99k • 10

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Apr 14 • 258k • • 469

openbmb/AgentCPM-GUI

Image-Text-to-Text • 8B • Updated Jun 14 • 240 • 128

unsloth/Qwen2.5-VL-3B-Instruct-GGUF

Image-Text-to-Text • 3B • Updated May 12 • 5.6k • 18

BAAI/Video-XL-2

Video-Text-to-Text • 8B • Updated Jun 6 • 400 • 55

lingshu-medical-mllm/Lingshu-32B

Image-Text-to-Text • 33B • Updated Sep 17 • 1.11k • 69

Mungert/Qwen2.5-Omni-3B-GGUF

Any-to-Any • 3B • Updated Sep 24 • 703 • 3