jdopensource
/

JoyAI-LLM-Flash

Text Generation

joyai_llm_flash

Model card Files Files and versions

Add evaluation results for GPQA-Diamond, MMLU-Pro

#6

by SaylorTwift HF Staff - opened 16 days ago

base: refs/heads/main

←

from: refs/pr/6

Discussion Files changed

Invalid content in Eval Result file .eval_results/gpqa_diamond.yaml

Check out the documentation for more information.

Show details

Task ID "diamond" does not match any task in dataset "Idavidrein/gpqa". Available: none

Invalid content in Eval Result file .eval_results/mmlu_pro.yaml

Check out the documentation for more information.

Show details

Task ID "mmlu_pro" does not match any task in dataset "TIGER-Lab/MMLU-Pro". Available: none

Files changed (2) hide show

.eval_results/gpqa_diamond.yaml +9 -0
.eval_results/mmlu_pro.yaml +9 -0

.eval_results/gpqa_diamond.yaml ADDED Viewed

	@@ -0,0 +1,9 @@

+- dataset:
+    id: Idavidrein/gpqa
+    task_id: diamond
+  value: 74.43
+  date: '2026-02-16'
+  source:
+    url: https://huggingface.co/jdopensource/JoyAI-LLM-Flash
+    name: Model Card
+    user: SaylorTwift

.eval_results/mmlu_pro.yaml ADDED Viewed

	@@ -0,0 +1,9 @@

+- dataset:
+    id: TIGER-Lab/MMLU-Pro
+    task_id: mmlu_pro
+  value: 81.02
+  date: '2026-02-16'
+  source:
+    url: https://huggingface.co/jdopensource/JoyAI-LLM-Flash
+    name: Model Card
+    user: SaylorTwift