Do Vision-Language Models Respect Contextual Integrity in Location Disclosure? Paper • 2602.05023 • Published Feb 4 • 2
Growing Through Experience: Scaling Episodic Grounding in Language Models Paper • 2506.01312 • Published Jun 2, 2025
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment Paper • 2411.10606 • Published Nov 15, 2024 • 1
LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement Paper • 2504.16053 • Published Apr 22, 2025
SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs Paper • 2510.05069 • Published Oct 6, 2025 • 13
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models Paper • 2507.14204 • Published Jul 14, 2025
Superficial Self-Improved Reasoners Benefit from Model Merging Paper • 2503.02103 • Published Mar 3, 2025
Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners Paper • 2510.04454 • Published Oct 6, 2025
Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning Paper • 2510.04072 • Published Oct 5, 2025 • 4
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation Paper • 2506.18349 • Published Jun 23, 2025 • 13