1 29 7

Dotanoob7

Dotanoob

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale

liked a model 3 days ago

Qwen/Qwen3.5-9B

liked a Space 6 months ago

HuggingFaceFW/blogpost-fineweb-v1

View all activity

Organizations

None yet

upvoted a paper 1 day ago

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale

Paper • 2602.23866 • Published 6 days ago • 55

liked a model 3 days ago

Qwen/Qwen3.5-9B

Image-Text-to-Text • 10B • Updated 3 days ago • 172k • 413

liked 2 Spaces 6 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.3k

Generate a curated web‑text dataset for LLM training

The Ultra-Scale Playbook

🌌

3.72k

The ultimate guide to training LLM on large GPU Clusters

upvoted 5 papers 6 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 206

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 268

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 214

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 272

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 298

upvoted a collection 6 months ago

InternVL3.5

Collection

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 45 items • Updated 3 days ago • 106

upvoted an article 8 months ago

Article

Upskill your LLMs With Gradio MCP Servers

Jul 9, 2025

•

liked a model 8 months ago

black-forest-labs/FLUX.1-Kontext-dev

Image-to-Image • Updated Jan 1 • 124k • • 2.56k

liked a Space 8 months ago

GPU Poor LLM Arena

🏆

356

Compact LLM Battle Arena: Frugal AI Face-Off!

upvoted a paper 8 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 251

upvoted 2 papers 10 months ago

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Paper • 2505.04601 • Published May 7, 2025 • 29

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 98

upvoted 2 papers 11 months ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9, 2025 • 77

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7, 2025 • 110

liked a model 12 months ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27, 2025 • 270k • • 3.09k

upvoted a paper about 1 year ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22, 2025 • 90