Running 230 BigCodeBench Leaderboard π₯ 230 Explore code-generation model leaderboards and task details
Leaderboards and benchmarks β¨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... β’ 88 items β’ Updated 13 days ago β’ 116
view article Article ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models Jul 27, 2024 β’ 35
Running on CPU Upgrade 591 GAIA Leaderboard π¦Ύ 591 Submit your model answers to GAIA benchmark and view leaderboard
Running Featured 560 Vision Arena (Testing VLMs side-by-side) πΌ 560 Analyze images with multiple vision models for labels and boxes
Running 232 AI2 WildBench Leaderboard (V2) π¦ 232 Display and explore a leaderboard of language models
lauralex/distilbert-base-uncased-finetuned-emotion Text Classification β’ Updated Jan 31, 2023 β’ 4