yaniseuranova
/

setfit-rag-hybrid-search-query-router

@@ -9,11 +9,17 @@ base_model: sentence-transformers/all-mpnet-base-v2
 metrics:
 - accuracy
 widget:
-- text: Quels sont les enjeux éthiques des algorithmes de décision automatisés?
-- text: Who is the founder of Tesla Motors?
-- text: How do I create a new email account on Gmail?
-- text: How can we use artificial intelligence to improve mental health diagnosis?
-- text: What is the definition of a database management system?
 pipeline_tag: text-classification
 inference: true
 model-index:
@@ -48,7 +54,7 @@ The model has been trained using an efficient few-shot learning technique that i
 - **Sentence Transformer body:** [sentence-transformers/all-mpnet-base-v2](https://huggingface.co/sentence-transformers/all-mpnet-base-v2)
 - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
 - **Maximum Sequence Length:** 384 tokens
-- **Number of Classes:** 4 classes
 <!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
 <!-- - **Language:** Unknown -->
 <!-- - **License:** Unknown -->
@@ -60,12 +66,10 @@ The model has been trained using an efficient few-shot learning technique that i
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
-| Label         | Examples                                                                                                                                                                                                                            |
-|:--------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
-| very_semantic | <ul><li>'Quels sont les principes fondamentaux du corps humain?'</li><li>"Comment améliorer l'efficacité énergétique dans les bâtiments?"</li><li>'Combien de calories dans une pomme?'</li></ul>                                   |
-| very_lexical  | <ul><li>"Quelle est la capitale de l'Italie?"</li><li>"Qui est l'auteur de '1984'?"</li><li>'What is the current unemployment rate in France?'</li></ul>                                                                            |
-| semantic      | <ul><li>"Quels sont les avantages de l'apprentissage machine dans le secteur de la santé?"</li><li>'Comment puis-je optimiser les performances de mon site web?'</li><li>'What are the main challenges in cybersecurity?'</li></ul> |
-| lexical       | <ul><li>'Quel est le numéro de téléphone du service client ou du customer support?'</li><li>'Comment fonctionne la blockchain?'</li><li>'How can I reset my user password?'</li></ul>                                               |
 ## Evaluation
@@ -92,7 +96,7 @@ from setfit import SetFitModel
 # Download from the 🤗 Hub
 model = SetFitModel.from_pretrained("yaniseuranova/setfit-paraphrase-mpnet-base-v2-sst2")
 # Run inference
-preds = model("Who is the founder of Tesla Motors?")
 ```
 <!--
@@ -122,16 +126,14 @@ preds = model("Who is the founder of Tesla Motors?")
 ## Training Details
 ### Training Set Metrics
-| Training set | Min | Median | Max |
-|:-------------|:----|:-------|:----|
-| Word count   | 4   | 8.7667 | 15  |
-| Label         | Training Sample Count |
-|:--------------|:----------------------|
-| very_semantic | 39                    |
-| semantic      | 30                    |
-| lexical       | 26                    |
-| very_lexical  | 25                    |
 ### Training Hyperparameters
 - batch_size: (16, 16)
@@ -151,66 +153,18 @@ preds = model("Who is the founder of Tesla Motors?")
 - load_best_model_at_end: True
 ### Training Results
-| Epoch   | Step     | Training Loss | Validation Loss |
-|:-------:|:--------:|:-------------:|:---------------:|
-| 0.0015  | 1        | 0.3698        | -               |
-| 0.0749  | 50       | 0.2642        | -               |
-| 0.1497  | 100      | 0.2307        | -               |
-| 0.2246  | 150      | 0.1452        | -               |
-| 0.2994  | 200      | 0.0772        | -               |
-| 0.3743  | 250      | 0.0149        | -               |
-| 0.4491  | 300      | 0.0036        | -               |
-| 0.5240  | 350      | 0.0009        | -               |
-| 0.5988  | 400      | 0.0009        | -               |
-| 0.6737  | 450      | 0.0008        | -               |
-| 0.7485  | 500      | 0.0006        | -               |
-| 0.8234  | 550      | 0.0003        | -               |
-| 0.8982  | 600      | 0.0003        | -               |
-| 0.9731  | 650      | 0.0003        | -               |
-| 1.0     | 668      | -             | 0.0001          |
-| 1.0479  | 700      | 0.0002        | -               |
-| 1.1228  | 750      | 0.0002        | -               |
-| 1.1976  | 800      | 0.0002        | -               |
-| 1.2725  | 850      | 0.0003        | -               |
-| 1.3473  | 900      | 0.0003        | -               |
-| 1.4222  | 950      | 0.0001        | -               |
-| 1.4970  | 1000     | 0.0002        | -               |
-| 1.5719  | 1050     | 0.0002        | -               |
-| 1.6467  | 1100     | 0.0003        | -               |
-| 1.7216  | 1150     | 0.0001        | -               |
-| 1.7964  | 1200     | 0.0001        | -               |
-| 1.8713  | 1250     | 0.0002        | -               |
-| 1.9461  | 1300     | 0.0001        | -               |
-| 2.0     | 1336     | -             | 0.0001          |
-| 2.0210  | 1350     | 0.0001        | -               |
-| 2.0958  | 1400     | 0.0001        | -               |
-| 2.1707  | 1450     | 0.0002        | -               |
-| 2.2455  | 1500     | 0.0002        | -               |
-| 2.3204  | 1550     | 0.0001        | -               |
-| 2.3952  | 1600     | 0.0001        | -               |
-| 2.4701  | 1650     | 0.0002        | -               |
-| 2.5449  | 1700     | 0.0001        | -               |
-| 2.6198  | 1750     | 0.0001        | -               |
-| 2.6946  | 1800     | 0.0001        | -               |
-| 2.7695  | 1850     | 0.0001        | -               |
-| 2.8443  | 1900     | 0.0001        | -               |
-| 2.9192  | 1950     | 0.0001        | -               |
-| 2.9940  | 2000     | 0.0001        | -               |
-| 3.0     | 2004     | -             | 0.0             |
-| 3.0689  | 2050     | 0.0001        | -               |
-| 3.1437  | 2100     | 0.0001        | -               |
-| 3.2186  | 2150     | 0.0001        | -               |
-| 3.2934  | 2200     | 0.0001        | -               |
-| 3.3683  | 2250     | 0.0001        | -               |
-| 3.4431  | 2300     | 0.0001        | -               |
-| 3.5180  | 2350     | 0.0001        | -               |
-| 3.5928  | 2400     | 0.0001        | -               |
-| 3.6677  | 2450     | 0.0001        | -               |
-| 3.7425  | 2500     | 0.0001        | -               |
-| 3.8174  | 2550     | 0.0001        | -               |
-| 3.8922  | 2600     | 0.0001        | -               |
-| 3.9671  | 2650     | 0.0001        | -               |
-| **4.0** | **2672** | **-**         | **0.0**         |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions

 metrics:
 - accuracy
 widget:
+- text: What is the primary difference between homomorphic encryption and multi-party
+    computation in the context of secure multi-party computation protocols?
+- text: How do organizations balance the need for innovation with the potential risks
+    and unintended consequences of emerging technologies?
+- text: How doCompaniesbalanceIndividualCreativitywithTeamCollaboration to driveInnovationinthe
+    WORKPlace?
+- text: How do companies balance the need for innovation with the risk of disrupting
+    their existing business models?
+- text: What is the primary application of Natural Language Processing (NLP) in Google's
+    BERT language model, and how does it utilize masked language modeling to improve
+    contextual understanding?
 pipeline_tag: text-classification
 inference: true
 model-index:
 - **Sentence Transformer body:** [sentence-transformers/all-mpnet-base-v2](https://huggingface.co/sentence-transformers/all-mpnet-base-v2)
 - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
 - **Maximum Sequence Length:** 384 tokens
+- **Number of Classes:** 2 classes
 <!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
 <!-- - **Language:** Unknown -->
 <!-- - **License:** Unknown -->
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
+| Label    | Examples                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
+|:---------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| semantic | <ul><li>'How do artificial intelligence systems navigate the trade-off between simplicity and accuracy when modeling complex real-world phenomena?'</li><li>'How do complex systems, consisting of many interconnected components, give rise to emergent properties that cannot be predicted from the characteristics of their individual parts?'</li><li>'How do complex systems, such as those found in nature and human societies, exhibit emergent properties that arise from the interactions of individual components?'</li></ul> |
+| lexical  | <ul><li>'What is the primary difference between a generative adversarial network (GAN) and a variational autoencoder (VAE) in deep learning?'</li><li>'What is the primary difference between a Decision Tree and a Random Forest in Machine Learning, and how do they alleviate overfitting?'</li><li>'What is the primary difference between a Bayesian neural network and a traditional feedforward neural network in the context of machine learning?'</li></ul>                                                                    |
 ## Evaluation
 # Download from the 🤗 Hub
 model = SetFitModel.from_pretrained("yaniseuranova/setfit-paraphrase-mpnet-base-v2-sst2")
 # Run inference
+preds = model("How doCompaniesbalanceIndividualCreativitywithTeamCollaboration to driveInnovationinthe WORKPlace?")
 ```
 <!--
 ## Training Details
 ### Training Set Metrics
+| Training set | Min | Median  | Max |
+|:-------------|:----|:--------|:----|
+| Word count   | 5   | 18.8511 | 32  |
+| Label    | Training Sample Count |
+|:---------|:----------------------|
+| lexical  | 23                    |
+| semantic | 24                    |
 ### Training Hyperparameters
 - batch_size: (16, 16)
 - load_best_model_at_end: True
 ### Training Results
+| Epoch   | Step    | Training Loss | Validation Loss |
+|:-------:|:-------:|:-------------:|:---------------:|
+| 0.0139  | 1       | 0.2662        | -               |
+| 0.6944  | 50      | 0.0007        | -               |
+| 1.0     | 72      | -             | 0.0003          |
+| 1.3889  | 100     | 0.0004        | -               |
+| 2.0     | 144     | -             | 0.0001          |
+| 2.0833  | 150     | 0.0003        | -               |
+| 2.7778  | 200     | 0.0002        | -               |
+| 3.0     | 216     | -             | 0.0001          |
+| 3.4722  | 250     | 0.0002        | -               |
+| **4.0** | **288** | **-**         | **0.0001**      |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "checkpoints/step_2672",
   "architectures": [
     "MPNetModel"
   ],

 {
+  "_name_or_path": "checkpoints/step_288",
   "architectures": [
     "MPNetModel"
   ],

config_setfit.json CHANGED Viewed

@@ -1,9 +1,7 @@
 {
   "normalize_embeddings": false,
   "labels": [
-    "very_semantic",
-    "semantic",
     "lexical",
-    "very_lexical"
   ]
 }

 {
   "normalize_embeddings": false,
   "labels": [
     "lexical",
+    "semantic"
   ]
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e383b67c5d576d05d2fcff25a7c9988abf5d98876540eebb026b38637f51f3bc
 size 437967672

 version https://git-lfs.github.com/spec/v1
+oid sha256:8dedbddc75ebb08be5ba7197043ab354aa2000a2466f382af22d7a93a7995589
 size 437967672

model_head.pkl CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d3224952c00d1183b6f7ededc25e464c290820fedd782232512f62f766ebb24b
-size 25655

 version https://git-lfs.github.com/spec/v1
+oid sha256:e9620192f6f9c9c643e965c1aa1dec6d39196685b3d03c44e788dd075ab17785
+size 7039