How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study Paper • 2404.14047 • Published Apr 22, 2024 • 45
A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms Paper • 2409.16694 • Published Sep 25, 2024
QVGen: Pushing the Limit of Quantized Video Generative Models Paper • 2505.11497 • Published May 16, 2025 • 4
DB-LLM: Accurate Dual-Binarization for Efficient LLMs Paper • 2402.11960 • Published Feb 19, 2024 • 3
LinVideo: A Post-Training Framework towards O(n) Attention in Efficient Video Generation Paper • 2510.08318 • Published Oct 9, 2025
Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention Paper • 2602.04789 • Published 13 days ago • 3
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit Paper • 2405.06001 • Published May 9, 2024
MC#: Mixture Compressor for Mixture-of-Experts Large Models Paper • 2510.10962 • Published Oct 13, 2025
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models Paper • 2512.20557 • Published Dec 23, 2025 • 50
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models Paper • 2512.20557 • Published Dec 23, 2025 • 50
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13, 2025 • 181
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper • 2510.15870 • Published Oct 17, 2025 • 91
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO Paper • 2505.13031 • Published May 19, 2025 • 4
EmbRACE-3K: Embodied Reasoning and Action in Complex Environments Paper • 2507.10548 • Published Jul 14, 2025 • 37
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published Sep 26, 2025 • 188
QVGen: Pushing the Limit of Quantized Video Generative Models Paper • 2505.11497 • Published May 16, 2025 • 4