In a Training Loop 🔄

4 9 43

David Andrews PRO

Broyojo

https://broyojo.com

AI & ML interests

Tranformer models, diffusion models, reinforcement learning, AI accelerators, computer architecture, VSLI

Recent Activity

liked a model 8 days ago

Zyphra/ZUNA

upvoted a paper 15 days ago

Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop

upvoted a paper 16 days ago

GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics

View all activity

Organizations

upvoted a paper 15 days ago

Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop

Paper • 2506.10968 • Published Jun 12, 2025 • 1

upvoted a paper 16 days ago

GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics

Paper • 2602.12617 • Published 19 days ago • 20

upvoted a paper about 2 months ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published Jan 13 • 148

upvoted a paper 9 months ago

AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

Paper • 2505.24298 • Published May 30, 2025 • 29

upvoted a collection about 1 year ago

🧠 Reasoning Models

Collection

8 items • Updated Jan 4 • 42

upvoted an article about 1 year ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

Dec 4, 2024

•

upvoted a collection about 1 year ago

Skywork-o1-Open

Collection

Skywork o1 open model collections • 3 items • Updated Jun 12, 2025 • 22

upvoted a collection over 1 year ago

Llama-3.1-Nemotron-70B

Collection

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated about 18 hours ago • 155

upvoted a paper over 2 years ago

Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 86

David Andrews PRO

AI & ML interests

Recent Activity

Organizations

Broyojo's activity

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs