2 7 36

Boxi Cao

Bowieee

AI & ML interests

None yet

Recent Activity

upvoted an article about 20 hours ago

Announcing ReasoningLens — Visualizing and Diagnosing LLM Reasoning at a Glance

published an article about 20 hours ago

Announcing ReasoningLens — Visualizing and Diagnosing LLM Reasoning at a Glance

liked a model about 2 months ago

Lite-Coder/LiteCoder-4b-Terminal-preview

View all activity

Organizations

upvoted an article about 20 hours ago

Article

Announcing ReasoningLens — Visualizing and Diagnosing LLM Reasoning at a Glance

about 20 hours ago

•

published an article about 20 hours ago

Article

Announcing ReasoningLens — Visualizing and Diagnosing LLM Reasoning at a Glance

about 20 hours ago

•

liked a model about 2 months ago

Lite-Coder/LiteCoder-4b-Terminal-preview

4B • Updated Dec 17, 2025 • 4 • 5

upvoted an article about 2 months ago

Article

Announcing LiteCoder-Terminal: Lightweight Terminal Agents with <1k Synthesized Trajectories

Dec 18, 2025

•

upvoted a paper 3 months ago

When Models Outthink Their Safety: Mitigating Self-Jailbreak in Large Reasoning Models with Chain-of-Guardrails

Paper • 2510.21285 • Published Oct 24, 2025 • 4

liked 2 datasets 10 months ago

agentica-org/DeepCoder-Preview-Dataset

Viewer • Updated Apr 9, 2025 • 25k • 1.65k • 97

inclusionAI/AReaL-boba-Data

Preview • Updated Mar 29, 2025 • 37 • 23

liked a dataset 11 months ago

open-r1/codeforces-cots

Viewer • Updated Mar 28, 2025 • 254k • 1.3k • 201

liked a dataset about 1 year ago

allenai/olmo-mix-1124

Viewer • Updated Aug 19, 2025 • 621M • 9.22k • 86

upvoted a paper about 1 year ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 24

authored a paper about 1 year ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 24

liked a dataset over 1 year ago

GAIR/o1-journey

Viewer • Updated Oct 16, 2024 • 327 • 49 • 134

updated a Space over 1 year ago

StructEval Leaderboard

🥇

Display StructEval leaderboard with customizable columns

New activity in Bowieee/StructEval_leaderboard over 1 year ago

Link Space to the paper

#1 opened over 1 year ago by

nielsr

authored a paper over 1 year ago

StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation

Paper • 2408.03281 • Published Aug 6, 2024 • 10

liked a Space over 1 year ago

StructEval Leaderboard

🥇

Display StructEval leaderboard with customizable columns

updated a collection over 1 year ago

Leaderboard

Collection

4 items • Updated Aug 26, 2024

upvoted a paper over 1 year ago

StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation

Paper • 2408.03281 • Published Aug 6, 2024 • 10

Boxi Cao

AI & ML interests

Recent Activity

Organizations

Bowieee's activity

Announcing ReasoningLens — Visualizing and Diagnosing LLM Reasoning at a Glance

Announcing ReasoningLens — Visualizing and Diagnosing LLM Reasoning at a Glance

Announcing LiteCoder-Terminal: Lightweight Terminal Agents with <1k Synthesized Trajectories

StructEval Leaderboard

Link Space to the paper

StructEval Leaderboard