FastWan Collection models trained with video sparse attention: https://arxiv.org/abs/2505.13389 and distillation • 9 items • Updated 20 days ago • 10
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization Paper • 2406.05981 • Published Jun 10, 2024 • 16
LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models Paper • 2407.12772 • Published Jul 17, 2024 • 35