SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning Paper • 2602.13515 • Published 8 days ago • 35
view article Article Train AI models with Unsloth and Hugging Face Jobs for FREE +4 2 days ago • 56
Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception Paper • 2602.11858 • Published 10 days ago • 58
BitDance Collection BitDance: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model. • 10 items • Updated 3 days ago • 8
Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments Paper • 2602.11964 • Published 9 days ago • 12
Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning Paper • 2602.11748 • Published 10 days ago • 30
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Paper • 2602.12036 • Published 9 days ago • 93
GENIUS: Generative Fluid Intelligence Evaluation Suite Paper • 2602.11144 • Published 10 days ago • 53
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning Paper • 2602.12099 • Published 9 days ago • 56
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing Paper • 2602.12205 • Published 9 days ago • 78
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies Paper • 2602.09877 • Published 11 days ago • 189
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling 9 days ago • 44
view article Article OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments +3 10 days ago • 25
PhyCritic: Multimodal Critic Models for Physical AI Paper • 2602.11124 • Published 10 days ago • 51