Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Evaluation datasets
community
Activity Feed
Follow
74
AI & ML interests
None defined yet.
Recent Activity
alozowski
authored
a paper
3 days ago
YourBench: Easy Custom Evaluation Sets for Everyone
SaylorTwift
new
activity
9 days ago
OpenEvals/SimpleQA:
adds_eval_yaml
SaylorTwift
updated
a dataset
9 days ago
OpenEvals/SimpleQA
View all activity
Team members
8
lighteval
's datasets
192
Sort: Recently updated
lighteval/piqa
Viewer
•
Updated
20 days ago
•
21k
•
762
•
1
lighteval/logiqa_harness
Updated
Aug 19
•
32
lighteval/sacrebleu_manual
Viewer
•
Updated
Aug 19
•
936k
•
9k
lighteval/lextreme
Viewer
•
Updated
Aug 19
•
194k
•
723
lighteval/bbh
Viewer
•
Updated
Aug 18
•
78.3k
•
614
•
1
lighteval/synthetic_reasoning
Viewer
•
Updated
Aug 18
•
33k
•
826
•
7
lighteval/covid_dialogue
Viewer
•
Updated
Aug 18
•
614
•
87
•
1
lighteval/numeracy
Viewer
•
Updated
Aug 18
•
1.6k
•
292
•
1
lighteval/synthetic_reasoning_natural
Viewer
•
Updated
Aug 18
•
22k
•
104
•
15
lighteval/hendrycks_ethics
Viewer
•
Updated
Aug 18
•
116k
•
185
lighteval/civil_comments_helm
Viewer
•
Updated
Aug 18
•
623k
•
1.68k
•
1
lighteval/TwitterAAE
Viewer
•
Updated
Aug 18
•
100k
•
1.58k
lighteval/EntityMatching
Viewer
•
Updated
Aug 18
•
153k
•
441
•
7
lighteval/me_q_sum
Viewer
•
Updated
Aug 18
•
1.5k
•
14
lighteval/DyckLanguage
Viewer
•
Updated
Aug 18
•
1.51k
•
165
lighteval/lexglue
Viewer
•
Updated
Aug 18
•
473k
•
643
lighteval/wmt_14
Viewer
•
Updated
Aug 18
•
126k
•
243
lighteval/copyright_helm
Viewer
•
Updated
Aug 18
•
17.8k
•
165
lighteval/med_dialog
Viewer
•
Updated
Aug 18
•
257k
•
160
•
8
lighteval/mutual_harness
Viewer
•
Updated
Aug 18
•
17.7k
•
45
•
2
lighteval/boolq_helm
Viewer
•
Updated
Aug 18
•
12.7k
•
702
•
2
lighteval/legal_summarization
Viewer
•
Updated
Aug 18
•
26.9k
•
305
•
25
lighteval/med_paragraph_simplification
Viewer
•
Updated
Aug 18
•
4.46k
•
93
lighteval/code_generation_lite
Viewer
•
Updated
Aug 15
•
12.8k
•
12.6k
•
1
lighteval/lsat_qa
Viewer
•
Updated
Aug 14
•
459
•
194
•
4
lighteval/wikifact
Viewer
•
Updated
Aug 14
•
58.4k
•
1.67k
•
2
lighteval/bigbench_helm
Viewer
•
Updated
Aug 14
•
22.3k
•
1.59k
lighteval/bold_helm
Viewer
•
Updated
Aug 14
•
4.58k
•
143
lighteval/bbq_helm
Viewer
•
Updated
Aug 14
•
11.9k
•
559
•
4
lighteval/winograd_wsc
Viewer
•
Updated
Aug 13
•
558
•
44
Previous
1
2
3
...
7
Next