luxdelux7 commited on
Commit
46f7c19
·
verified ·
1 Parent(s): aece731

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -120
README.md CHANGED
@@ -24,126 +24,6 @@ Made for the **Forbidden Vision** ComfyUI custom nodes
24
  </a>
25
  </div>
26
 
27
- ---
28
-
29
- ## 🎯 Why These Models Exist
30
-
31
- Standard face detection models are optimized for narrow use cases and struggle in generative AI workflows. These models address four specific failure modes:
32
-
33
- | **Problem** | **Why It Matters** |
34
- |-------------|-------------------|
35
- | 🎨 **Domain-locked** | Existing models excel at *either* anime *or* realistic — never both |
36
- | 🖼️ **Distribution mismatch** | Models trained on clean photography break on AI-generated imagery |
37
- | 👁️‍🗨️ **Detail blindness** | Most models miss stylized features like anime eyebrows, realistic eyelashes, etc. |
38
- | 🎲 **Generation artifacts** | Standard datasets don't include diffusion model quirks and failure modes |
39
-
40
- **These models solve all 4.**
41
-
42
- <div align="center">
43
- <img src="./images/masks.webp" alt="Mask Example" style="border-radius: 6px; box-shadow: 0 0 12px rgba(0,0,0,0.1);">
44
- <p><em>The segmentation model predicts face masks including stylistic features like eyebrows and eyelashes.</em></p>
45
- </div>
46
-
47
- ---
48
-
49
- ## 📊 Training Foundation
50
-
51
- ### The Dataset Difference
52
-
53
- Built from **14,000+ manually annotated images** spanning the full range of domains encountered in real generative AI workflows:
54
-
55
- <table>
56
- <tr>
57
- <td width="50%">
58
-
59
- **🎨 Multi-Domain Coverage**
60
- - SDXL, SD1.5, Pony, Illustrious outputs
61
- - Curated Danbooru (anime styles)
62
- - Real photography
63
- - Unfiltered image distributions across all content ratings
64
-
65
- </td>
66
- <td width="50%">
67
-
68
- **💎 Edge Case Priority**
69
- - ✓ Extreme angles & occlusions
70
- - ✓ Failed/broken generations
71
- - ✓ Low-quality artifacts
72
- - ✓ Unusual expressions & poses
73
- - ✓ Edge cases other models ignore
74
-
75
- </td>
76
- </tr>
77
- </table>
78
-
79
- ### What This Means For You
80
- ```
81
- Traditional models: Trained on clean celebrity faces
82
-
83
- Fail on real workflows
84
-
85
- These models: Trained on what you actually generate
86
-
87
- Work when you need them
88
- ```
89
-
90
- **One model family. Every domain. Zero compromises.**
91
-
92
- ---
93
-
94
- ## Model Details
95
-
96
- ### Face Detection (YOLOv11-Small)
97
-
98
- **Purpose:** Primary face detection with high recall across mixed domains
99
-
100
- **Training Approach:**
101
- - Iterative hard-mining pipeline: after each training run, the model was evaluated on a new mixed dataset; failures were collected, corrected, and folded back into training until acceptable performance was reached
102
- - Trained at 640px resolution — inference should use the same resolution
103
-
104
- **Why YOLOv11-Small instead of nano?**
105
- More reliable detection across mixed realistic/anime domains with an acceptable speed tradeoff.
106
-
107
- ---
108
-
109
- ### Segmentation (EfficientNetV2-S)
110
-
111
- **Purpose:** Precise face mask generation
112
-
113
- **Training Approach:**
114
- - Initial dataset prepared using the Forbidden Vision YOLO model at 512px resolution
115
- - Multi-phase iterative hard-mining:
116
- 1. Train on initial 700 samples
117
- 2. Evaluate on held-out images to surface failure cases
118
- 3. Correct failed masks and expand the dataset
119
- 4. Retrain on expanded dataset
120
- 5. Repeat until failure rate approaches zero
121
- - Final dataset: 4,000+ images
122
-
123
- **Features:**
124
- - Captures stylized facial features often missed by standard models: protruding anime eyebrows, realistic eyelashes extending beyond the face boundary, etc.
125
- - Treats accessories like glasses as part of the face region, even when they extend outside the face shape
126
- - Robust across anime, realistic, and 3D rendering styles — including content ratings that cause other models to fail
127
-
128
- ---
129
-
130
- ## Usage
131
-
132
- These models are automatically downloaded and used by the **Fixer** node in ComfyUI Forbidden Vision. No manual setup required.
133
-
134
- ---
135
-
136
- ## Intended Use
137
-
138
- These models are designed for use in generative AI post-processing pipelines — specifically face detection and masking within ComfyUI workflows. They are not intended for surveillance, biometric identification, or any application involving real individuals without consent.
139
-
140
- ---
141
-
142
- ## License
143
-
144
- Apache 2.0
145
-
146
- ---
147
 
148
  ## Contact
149
 
 
24
  </a>
25
  </div>
26
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
 
28
  ## Contact
29