TurtleBench TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles Paper • 2410.05262 • Published Oct 7, 2024 • 11 Duguce/TurtleBench1.5k Viewer • Updated Oct 30, 2024 • 3.06k • 19 • 6
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles Paper • 2410.05262 • Published Oct 7, 2024 • 11
TurtleBench TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles Paper • 2410.05262 • Published Oct 7, 2024 • 11 Duguce/TurtleBench1.5k Viewer • Updated Oct 30, 2024 • 3.06k • 19 • 6
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles Paper • 2410.05262 • Published Oct 7, 2024 • 11