Curated models for AI infrastructure, LLM deployment, and edge computing. Optimized for NVIDIA DGX Spark and Docker Swarm clusters.
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation ⢠33B ⢠Updated ⢠739k ⢠⢠2k -
sentence-transformers/all-MiniLM-L6-v2
Sentence Similarity ⢠22.7M ⢠Updated ⢠171M ⢠⢠4.51k -
BAAI/bge-large-en-v1.5
Feature Extraction ⢠Updated ⢠5.39M ⢠⢠631 -
meta-llama/Llama-3.3-70B-Instruct
Text Generation ⢠Updated ⢠749k ⢠⢠2.67k