Open to Collab

7 32 96

dev7halo PRO

dev7halo

HaloKim

AI & ML interests

None yet

Recent Activity

liked a model 20 days ago

Sehyo/Qwen3.5-122B-A10B-NVFP4

commented on a paper 20 days ago

Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers

upvoted a paper 21 days ago

Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers

View all activity

Organizations

upvoted a paper 21 days ago

Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers

Paper • 2602.18292 • Published 25 days ago • 10

upvoted an article 22 days ago

Article

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

24 days ago

•

upvoted a paper about 1 month ago

Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation

Paper • 2602.03619 • Published Feb 3 • 26

upvoted a paper about 2 months ago

Post-LayerNorm Is Back: Stable, ExpressivE, and Deep

Paper • 2601.19895 • Published Jan 27 • 25

upvoted an article about 2 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

upvoted a paper 2 months ago

Over-Searching in Search-Augmented Large Language Models

Paper • 2601.05503 • Published Jan 9 • 7

upvoted a paper 3 months ago

MemLoRA: Distilling Expert Adapters for On-Device Memory Systems

Paper • 2512.04763 • Published Dec 4, 2025 • 5

upvoted an article 3 months ago

Article

I Built a RAG System That Listens to Live BBC News and Answers Questions About "What Happened 10 Minutes Ago"

Dec 9, 2025

•

upvoted 3 papers 4 months ago

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 97

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published Nov 17, 2025 • 137

Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

Paper • 2511.06209 • Published Nov 9, 2025 • 19

upvoted a collection 6 months ago

Tulu 3 Datasets

Collection

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 15 days ago • 96

upvoted 2 articles 7 months ago

Article

Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio

Jul 31, 2025

•

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Aug 8, 2025

•

upvoted an article 8 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

764

upvoted a paper 9 months ago

Orthogonal Finetuning Made Scalable

Paper • 2506.19847 • Published Jun 24, 2025 • 11

upvoted a paper 11 months ago

LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs

Paper • 2504.14655 • Published Apr 20, 2025 • 21

upvoted a paper 12 months ago

PaperBench: Evaluating AI's Ability to Replicate AI Research

Paper • 2504.01848 • Published Apr 2, 2025 • 37

upvoted 2 articles about 1 year ago

Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

Mar 7, 2025

•

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Oct 7, 2024

•

dev7halo PRO

AI & ML interests

Recent Activity

Organizations

dev7halo's activity

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

I Built a RAG System That Listens to Live BBC News and Answers Questions About "What Happened 10 Minutes Ago"

Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

SmolLM3: smol, multilingual, long-context reasoner

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

Efficient LLM Pretraining: Packed Sequences and Masked Attention