Ai2 Open Coding Agents - Django, Sphinx, Sympy Data
AI & ML interests
Building breatkthrough AI to solve the world's biggest problems.
Recent Activity
Papers
TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics
How2Everything: Mining the Web for How-To Procedures to Evaluate and Improve LLMs
Organization Card
spaces 13
pinned
Running
20
AstaBench Leaderboard
🥇
View benchmark leaderboards
pinned
Running
422
Reward Bench Leaderboard
📐
Explore RewardBench model rankings and scores
pinned
Sleeping
2
HREF Leaderboard
📐
Browse and search HREF leaderboard data
pinned
Running
91
Zebra Logic Bench
🦓
Show leaderboard and explore model puzzle results
pinned
Running
3
SUPER Leaderboard
🤖
Display a static leaderboard from a JSON file
pinned
Running
53
ZeroEval Leaderboard
📊
Embed ZeroEval for evaluation
models 854
allenai/FlexOlmo-7x7B-1T-RT
Text Generation • 33B • Updated
• 52 • 7
allenai/FlexOlmo-7x7B-1T
Text Generation • 33B • Updated
• 194 • 38
allenai/Flex-public-7B-1T
Text Generation • 7B • Updated
• 286 • 5
allenai/Flex-reddit-2x7B-1T
Text Generation • 12B • Updated
• 4.65k • 7
allenai/Flex-pes2o-2x7B-1T
Text Generation • 12B • Updated
• 187 • 2
allenai/Flex-news-2x7B-1T
Text Generation • 12B • Updated
• 182 • 2
allenai/Flex-creative-2x7B-1T
Text Generation • 12B • Updated
• 289 • 5
allenai/Flex-code-2x7B-1T
Text Generation • 12B • Updated
• 400 • 2
allenai/Flex-math-2x7B-1T
Text Generation • 12B • Updated
• 388 • 3
allenai/olmo-3.2-tokenizer-think-release
Updated
• 2
datasets 381
allenai/molmospaces
Viewer
• Updated
• 772k • 6.17k • 39
allenai/Dolci-Instruct-SFT-Tool-Use-SA
Viewer
• Updated
• 1.6k • 56 • 3
allenai/Dolci-Think-SFT-32B
Viewer
• Updated
• 2.25M • 1.46k • 24
allenai/asta-summary-citation-counts
Viewer
• Updated
• 49.2M • 439 • 8
allenai/code_fresh_0825_1225
Viewer
• Updated
• 66.5k • 183 • 3
allenai/Molmo2-VideoPoint
Viewer
• Updated
• 1.32M • 360 • 5
allenai/SimpleToM
Viewer
• Updated
• 4.59k • 256 • 10
allenai/asta-user-interactions
Viewer
• Updated
• 14M • 52 • 6
allenai/dolma3_pool_staging
Viewer
• Updated
• 1 • 200 • 1
allenai/prescience
Viewer
• Updated
• 839k • 81 • 17