Efficient Intelligence and Systems

community

Efficient-ML

Activity Feed

AI & ML interests

Low-bit Quantization of Large Language Models (LLMs)

Recent Activity

mack-williams authored a paper 11 days ago

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

mack-williams authored a paper 11 days ago

A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms

mack-williams authored a paper 11 days ago

QVGen: Pushing the Limit of Quantized Video Generative Models

View all activity

mack-williams

authored 8 papers 11 days ago

LinVideo: A Post-Training Framework towards O(n) Attention in Efficient Video Generation

Paper • 2510.08318 • Published Oct 9, 2025

Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention

Paper • 2602.04789 • Published 13 days ago • 3

PTQ4SAM: Post-Training Quantization for Segment Anything

Paper • 2405.03144 • Published May 6, 2024

LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit

Paper • 2405.06001 • Published May 9, 2024

AaronHuangWei

authored 2 papers about 2 months ago

MC#: Mixture Compressor for Mixture-of-Experts Large Models

Paper • 2510.10962 • Published Oct 13, 2025

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published Dec 23, 2025 • 50

AaronHuangWei

submitted a paper to Daily Papers about 2 months ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published Dec 23, 2025 • 50

AaronHuangWei

authored 5 papers 4 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 181

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 91

MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO

Paper • 2505.13031 • Published May 19, 2025 • 4

EmbRACE-3K: Embodied Reasoning and Action in Complex Environments

Paper • 2507.10548 • Published Jul 14, 2025 • 37

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 188

HaotongQin

authored a paper 5 months ago

Quantized Visual Geometry Grounded Transformer

Paper • 2509.21302 • Published Sep 25, 2025 • 9

nightYue

authored a paper 5 months ago

An Empirical Study of Qwen3 Quantization

Paper • 2505.02214 • Published May 4, 2025 • 25

AaronHuangWei

authored a paper 7 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 160

HaotongQin

authored a paper 9 months ago

QVGen: Pushing the Limit of Quantized Video Generative Models

Paper • 2505.11497 • Published May 16, 2025 • 4

AI & ML interests

Recent Activity

Team members 9

Efficient-ML's activity