deepanshupillm commited on
Commit
937a539
·
verified ·
1 Parent(s): 4952319

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -92,7 +92,7 @@ This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-a
92
 
93
  ![BBH Benchmark](BBH.png)
94
 
95
-
96
 
97
  | Benchmark | Alpie-Core (32B-4bit) | DeepSeek-V2 (236B) | Qwen2.5 72B | Llama 3.1 405B | Llama 3.1 70B | Gemma-3 27B-PT | Mistral-Small-24B-Base-2501 |
98
  |-----------|----------------------|-------------------|-------------|---------------|---------------|----------------|----------------------------|
@@ -103,7 +103,7 @@ This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-a
103
  | MBPP (pass@1) | **75.20%** | 65.0% | 72.6% | 68.4% | - | 65.6% | 69.64% |
104
  | HumanEval (pass@1) | **57.23%** | 43.3% | 53.0% | 54.9% | - | 48.8% | = |
105
 
106
- ![Combined Benchmark](combined_benchmark.png)
107
 
108
  ### SWE-Bench Verified Performance
109
 
 
92
 
93
  ![BBH Benchmark](BBH.png)
94
 
95
+ ![Combined Benchmark](combined_benchmark.png)
96
 
97
  | Benchmark | Alpie-Core (32B-4bit) | DeepSeek-V2 (236B) | Qwen2.5 72B | Llama 3.1 405B | Llama 3.1 70B | Gemma-3 27B-PT | Mistral-Small-24B-Base-2501 |
98
  |-----------|----------------------|-------------------|-------------|---------------|---------------|----------------|----------------------------|
 
103
  | MBPP (pass@1) | **75.20%** | 65.0% | 72.6% | 68.4% | - | 65.6% | 69.64% |
104
  | HumanEval (pass@1) | **57.23%** | 43.3% | 53.0% | 54.9% | - | 48.8% | = |
105
 
106
+
107
 
108
  ### SWE-Bench Verified Performance
109