Update README.md
Browse files
README.md
CHANGED
|
@@ -44,10 +44,15 @@ Alpie-Core is one of the world's first fine-tuned 4-bit reasoning models, provin
|
|
| 44 |
**Alpie-Core** has undergone extensive **supervised fine-tuning (SFT)** to strengthen reasoning, robustness, and safety. The training leveraged a diverse mixture of curated open-source datasets and proprietary synthetic data, optimized with high-quality LLM-generated responses. The fine-tuning process emphasized adherence to rigorous safety and usability standards, including:
|
| 45 |
|
| 46 |
1)**User Understanding and Clarity** – ensuring outputs are direct, interpretable, and pedagogically sound.
|
|
|
|
| 47 |
2)**Security and Ethical Guidelines** – filtering unsafe or harmful generations during and after training.
|
|
|
|
| 48 |
3)**Limitations, Disclaimers, and Knowledge Boundaries** – transparently communicating uncertainty and scope.
|
|
|
|
| 49 |
4)**Handling Complex and Sensitive Topics** – balancing informativeness with responsible guardrails.
|
|
|
|
| 50 |
5)**Safety and Respectful Engagement** – maintaining politeness, inclusivity, and cultural sensitivity.
|
|
|
|
| 51 |
6)**Confidentiality and Responsible Use** – preventing leakage of private training data, proprietary prompts, or internal reasoning traces.
|
| 52 |
|
| 53 |
This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-aware responses while maintaining safety across a broad range of use cases.
|
|
|
|
| 44 |
**Alpie-Core** has undergone extensive **supervised fine-tuning (SFT)** to strengthen reasoning, robustness, and safety. The training leveraged a diverse mixture of curated open-source datasets and proprietary synthetic data, optimized with high-quality LLM-generated responses. The fine-tuning process emphasized adherence to rigorous safety and usability standards, including:
|
| 45 |
|
| 46 |
1)**User Understanding and Clarity** – ensuring outputs are direct, interpretable, and pedagogically sound.
|
| 47 |
+
|
| 48 |
2)**Security and Ethical Guidelines** – filtering unsafe or harmful generations during and after training.
|
| 49 |
+
|
| 50 |
3)**Limitations, Disclaimers, and Knowledge Boundaries** – transparently communicating uncertainty and scope.
|
| 51 |
+
|
| 52 |
4)**Handling Complex and Sensitive Topics** – balancing informativeness with responsible guardrails.
|
| 53 |
+
|
| 54 |
5)**Safety and Respectful Engagement** – maintaining politeness, inclusivity, and cultural sensitivity.
|
| 55 |
+
|
| 56 |
6)**Confidentiality and Responsible Use** – preventing leakage of private training data, proprietary prompts, or internal reasoning traces.
|
| 57 |
|
| 58 |
This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-aware responses while maintaining safety across a broad range of use cases.
|