Prune Once for All: Sparse Pre-Trained Language Models Paper • 2111.05754 • Published Nov 10, 2021 • 2
view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 Dec 4, 2025 • 39
view article Article Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models +3 Sep 29, 2025 • 24
view article Article Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models +3 Sep 29, 2025 • 24
view article Article Breaking Language Barriers in Mathematical AI: Introducing Hebrew Math Tutor Sep 7, 2025 • 3
view article Article Introducing HELMET: Holistically Evaluating Long-context Language Models +5 Apr 16, 2025 • 42
view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques Mar 24, 2025 • 20
SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models Paper • 2502.09390 • Published Feb 13, 2025 • 16
Speculative Decoding Draft Models Collection Collection of OpenVINO optimized efficient draft models for speculative decoding • 5 items • Updated Jan 15 • 10