Running 65 UncheatableEval ๐ Compare and analyze AI model compression performance across different sizes and metrics