Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published 20 days ago • 134
Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads Paper • 2511.06209 • Published 28 days ago • 17
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 7 days ago • 96
view article Article Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio Jul 31 • 60
LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs Paper • 2504.14655 • Published Apr 20 • 20
PaperBench: Evaluating AI's Ability to Replicate AI Research Paper • 2504.01848 • Published Apr 2 • 36
view article Article LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone! Mar 7 • 88
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention Oct 7, 2024 • 60
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published Jan 17 • 53
view article Article Docmatix - a huge dataset for Document Visual Question Answering Jul 18, 2024 • 78
view article Article Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning Feb 20, 2024 • 30