deepanshupillm commited on
Commit
b795416
·
verified ·
1 Parent(s): 01a69a6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -88,7 +88,6 @@ This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-a
88
 
89
  ![Humanity's Last Exam](HLE.png)
90
 
91
- ![Combined Benchmark](combined_benchmark.png)
92
 
93
  | Benchmark | Alpie-Core (32B-4bit) | DeepSeek-V2 (236B) | Qwen2.5 72B | Llama 3.1 405B | Llama 3.1 70B | Gemma-3 27B-PT | Mistral-Small-24B-Base-2501 |
94
  |-----------|----------------------|-------------------|-------------|---------------|---------------|----------------|----------------------------|
@@ -99,6 +98,8 @@ This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-a
99
  | MBPP (pass@1) | **75.20%** | 65.0% | 72.6% | 68.4% | - | 65.6% | 69.64% |
100
  | HumanEval (pass@1) | **57.23%** | 43.3% | 53.0% | 54.9% | - | 48.8% | = |
101
 
 
 
102
  ### SWE-Bench Verified Performance
103
 
104
  ![SWE-Bench Performance](swe.png)
 
88
 
89
  ![Humanity's Last Exam](HLE.png)
90
 
 
91
 
92
  | Benchmark | Alpie-Core (32B-4bit) | DeepSeek-V2 (236B) | Qwen2.5 72B | Llama 3.1 405B | Llama 3.1 70B | Gemma-3 27B-PT | Mistral-Small-24B-Base-2501 |
93
  |-----------|----------------------|-------------------|-------------|---------------|---------------|----------------|----------------------------|
 
98
  | MBPP (pass@1) | **75.20%** | 65.0% | 72.6% | 68.4% | - | 65.6% | 69.64% |
99
  | HumanEval (pass@1) | **57.23%** | 43.3% | 53.0% | 54.9% | - | 48.8% | = |
100
 
101
+ ![Combined Benchmark](combined_benchmark.png)
102
+
103
  ### SWE-Bench Verified Performance
104
 
105
  ![SWE-Bench Performance](swe.png)