Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ruisi Cai's picture
1 4

Ruisi Cai

CCCCRS
·

AI & ML interests

None yet

Organizations

DeepMamba's profile picture

upvoted a paper 8 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17 • 93
upvoted a paper 11 months ago

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing

Paper • 2501.00658 • Published Dec 31, 2024 • 7
upvoted a paper about 1 year ago

Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design

Paper • 2410.19123 • Published Oct 24, 2024 • 15
upvoted a paper over 1 year ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs