Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -88,7 +88,6 @@ This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-a
 ![Humanity's Last Exam](HLE.png)
-![Combined Benchmark](combined_benchmark.png)
 | Benchmark | Alpie-Core (32B-4bit) | DeepSeek-V2 (236B) | Qwen2.5 72B | Llama 3.1 405B | Llama 3.1 70B | Gemma-3 27B-PT | Mistral-Small-24B-Base-2501 |
 |-----------|----------------------|-------------------|-------------|---------------|---------------|----------------|----------------------------|
@@ -99,6 +98,8 @@ This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-a
 | MBPP (pass@1) | **75.20%** | 65.0% | 72.6% | 68.4% | - | 65.6% | 69.64% |
 | HumanEval (pass@1) | **57.23%** | 43.3% | 53.0% | 54.9% | - | 48.8% | = |
 ### SWE-Bench Verified Performance
 ![SWE-Bench Performance](swe.png)

 ![Humanity's Last Exam](HLE.png)
 | Benchmark | Alpie-Core (32B-4bit) | DeepSeek-V2 (236B) | Qwen2.5 72B | Llama 3.1 405B | Llama 3.1 70B | Gemma-3 27B-PT | Mistral-Small-24B-Base-2501 |
 |-----------|----------------------|-------------------|-------------|---------------|---------------|----------------|----------------------------|
 | MBPP (pass@1) | **75.20%** | 65.0% | 72.6% | 68.4% | - | 65.6% | 69.64% |
 | HumanEval (pass@1) | **57.23%** | 43.3% | 53.0% | 54.9% | - | 48.8% | = |
+![Combined Benchmark](combined_benchmark.png)
 ### SWE-Bench Verified Performance
 ![SWE-Bench Performance](swe.png)