| Tasks | Version | Filter | n-shot | Metric | Value | Stderr | ||
|---|---|---|---|---|---|---|---|---|
| arc_challenge | 1 | none | 0 | acc | ↑ | 0.2176 | ± | 0.0121 |
| none | 0 | acc_norm | ↑ | 0.2628 | ± | 0.0129 | ||
| arc_easy | 1 | none | 0 | acc | ↑ | 0.2584 | ± | 0.0090 |
| none | 0 | acc_norm | ↑ | 0.2567 | ± | 0.0090 | ||
| boolq | 2 | none | 0 | acc | ↑ | 0.4171 | ± | 0.0086 |
| hellaswag | 1 | none | 0 | acc | ↑ | 0.2565 | ± | 0.0044 |
| none | 0 | acc_norm | ↑ | 0.2639 | ± | 0.0044 | ||
| openbookqa | 1 | none | 0 | acc | ↑ | 0.1620 | ± | 0.0165 |
| none | 0 | acc_norm | ↑ | 0.2800 | ± | 0.0201 | ||
| piqa | 1 | none | 0 | acc | ↑ | 0.5419 | ± | 0.0116 |
| none | 0 | acc_norm | ↑ | 0.5234 | ± | 0.0117 | ||
| winogrande | 1 | none | 0 | acc | ↑ | 0.5272 | ± | 0.0140 |
- Downloads last month
- 7
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support