Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
evalstate
/
hf-papers
like
1
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
hf-papers
/
scripts
108 kB
1 contributor
History:
3 commits
evalstate
HF Staff
Routing challenges: explicit following-feed negative test; use user feed for authenticated 50-activity task
1391d0b
verified
6 days ago
README.md
3.32 kB
sync: promote hf_hub_community prompt v3 + add prompt/coverage harness
7 days ago
eval_hf_hub_prompt_ab.py
10.7 kB
sync: promote hf_hub_community prompt v3 + add prompt/coverage harness
7 days ago
eval_tool_description_ab.py
29.8 kB
sync: promote hf_hub_community prompt v3 + add prompt/coverage harness
7 days ago
hf_hub_community_challenges.txt
2.05 kB
Challenges: mark following-feed as explicit negative test; switch main 50-activity prompt to user feed
6 days ago
hf_hub_community_coverage_prompts.json
5 kB
sync: promote hf_hub_community prompt v3 + add prompt/coverage harness
7 days ago
hf_hub_prompt_variants.json
663 Bytes
sync: promote hf_hub_community prompt v3 + add prompt/coverage harness
7 days ago
publish_space.sh
1.47 kB
sync: promote hf_hub_community prompt v3 + add prompt/coverage harness
7 days ago
run_all_evals.sh
1.32 kB
sync: promote hf_hub_community prompt v3 + add prompt/coverage harness
7 days ago
run_hf_hub_prompt_variant.py
1.99 kB
sync: promote hf_hub_community prompt v3 + add prompt/coverage harness
7 days ago
run_tool_routing_batch.py
6.3 kB
sync: promote hf_hub_community prompt v3 + add prompt/coverage harness
7 days ago
score_hf_hub_community_challenges.py
17.8 kB
sync: promote hf_hub_community prompt v3 + add prompt/coverage harness
7 days ago
score_hf_hub_community_coverage.py
12 kB
sync: promote hf_hub_community prompt v3 + add prompt/coverage harness
7 days ago
score_tool_routing_confusion.py
12.9 kB
sync: promote hf_hub_community prompt v3 + add prompt/coverage harness
7 days ago
tool_routing_challenges.txt
2.78 kB
Routing challenges: explicit following-feed negative test; use user feed for authenticated 50-activity task
6 days ago