nex-agi/DeepSeek-V3.1-Nex-N1.1
Text Generation
•
683B
•
Updated
•
14
•
2
AGI, Nex
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping