Pre-computed Q-Filters for efficient KV cache compression.
Nathan Godey
nthngdy
AI & ML interests
None yet
Recent Activity
updated
a model
6 days ago
nthngdy/bttl_2B
new activity
16 days ago
facebook/blt-7b:hf_integration
published
a model
about 1 month ago
nthngdy/bttl_2B