Improve model card: add `image-to-image` pipeline tag, `transformers` library, and paper link (#2)
Browse files- Improve model card: add `image-to-image` pipeline tag, `transformers` library, and paper link (23fc97c54ff6f231eaaa6ec2f3844f38ca8d0108)
Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>
README.md
CHANGED
|
@@ -1,15 +1,19 @@
|
|
| 1 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
license: mit
|
|
|
|
| 3 |
tags:
|
| 4 |
- image-editing
|
| 5 |
- HiDream.ai
|
| 6 |
-
|
| 7 |
-
- en
|
| 8 |
-
pipeline_tag: any-to-any
|
| 9 |
-
base_model:
|
| 10 |
-
- FoundationVision/Infinity
|
| 11 |
---
|
| 12 |
-
|
|
|
|
|
|
|
|
|
|
| 13 |
|
| 14 |

|
| 15 |
|
|
@@ -20,17 +24,17 @@ Try our online demos: [π€VAREdit-8B-1024](https://huggingface.co/spaces/HiDrea
|
|
| 20 |
|
| 21 |
## π Key Features
|
| 22 |
|
| 23 |
-
-
|
| 24 |
-
-
|
| 25 |
-
-
|
| 26 |

|
| 27 |
|
| 28 |
## π Model Variants
|
| 29 |
|
| 30 |
-
| Model Variant
|
| 31 |
-
|
| 32 |
-
| VAREdit-8B-512
|
| 33 |
-
| VAREdit-8B-1024
|
| 34 |
|
| 35 |
## π Quick Start
|
| 36 |
|
|
@@ -43,18 +47,18 @@ Before starting, ensure you have:
|
|
| 43 |
|
| 44 |
### Installation
|
| 45 |
|
| 46 |
-
1.
|
| 47 |
```bash
|
| 48 |
git clone https://github.com/HiDream-ai/VAREdit.git
|
| 49 |
cd VAREdit
|
| 50 |
```
|
| 51 |
|
| 52 |
-
2.
|
| 53 |
```bash
|
| 54 |
pip install -r requirements.txt
|
| 55 |
```
|
| 56 |
|
| 57 |
-
3.
|
| 58 |
|
| 59 |
Download the VAREdit model checkpoints:
|
| 60 |
```bash
|
|
@@ -91,7 +95,7 @@ edited_image = generate_image(
|
|
| 91 |
### Model Sampling Parameters
|
| 92 |
|
| 93 |
| Parameter | Description | Default |
|
| 94 |
-
|
| 95 |
| `cfg` | Classifier-free guidance scale | 3.0 |
|
| 96 |
| `tau` | Temperature for sampling | 0.1 |
|
| 97 |
| `seed` | Random seed for reproducibility | -1 (random) |
|
|
|
|
| 1 |
---
|
| 2 |
+
base_model:
|
| 3 |
+
- FoundationVision/Infinity
|
| 4 |
+
language:
|
| 5 |
+
- en
|
| 6 |
license: mit
|
| 7 |
+
pipeline_tag: image-to-image
|
| 8 |
tags:
|
| 9 |
- image-editing
|
| 10 |
- HiDream.ai
|
| 11 |
+
library_name: transformers
|
|
|
|
|
|
|
|
|
|
|
|
|
| 12 |
---
|
| 13 |
+
|
| 14 |
+
# VAREdit: Visual Autoregressive Modeling for Instruction-Guided Image Editing
|
| 15 |
+
|
| 16 |
+
[π Paper](https://huggingface.co/papers/2508.15772)
|
| 17 |
|
| 18 |

|
| 19 |
|
|
|
|
| 24 |
|
| 25 |
## π Key Features
|
| 26 |
|
| 27 |
+
- **Strong Instruction Follow**: Follows instructions more accurately due to the autoregressive nature of the model.
|
| 28 |
+
- **Efficient Inference**: Optimized for fast generation with less than 1 seconds for 8B model.
|
| 29 |
+
- **Flexible Resolution**: Supports 512Γ512 and 1024Γ1024 image resolutions
|
| 30 |

|
| 31 |
|
| 32 |
## π Model Variants
|
| 33 |
|
| 34 |
+
| Model Variant | Resolutions | HuggingFace Model | Time (H800) | VRAM (GB) |
|
| 35 |
+
|:--------------|:------------|:---------------------------------------------------------------------------------|:----------|:----------|
|
| 36 |
+
| VAREdit-8B-512 | 512Γ512 | [VAREdit-8B-512](https://huggingface.co/HiDream-ai/VAREdit) | ~0.7s | 50.41 |
|
| 37 |
+
| VAREdit-8B-1024 | 1024Γ1024 | [VAREdit-8B-1024](https://huggingface.co/HiDream-ai/VAREdit) | ~1.99s | 50.41 |
|
| 38 |
|
| 39 |
## π Quick Start
|
| 40 |
|
|
|
|
| 47 |
|
| 48 |
### Installation
|
| 49 |
|
| 50 |
+
1. **Clone the repository**
|
| 51 |
```bash
|
| 52 |
git clone https://github.com/HiDream-ai/VAREdit.git
|
| 53 |
cd VAREdit
|
| 54 |
```
|
| 55 |
|
| 56 |
+
2. **Install dependencies**
|
| 57 |
```bash
|
| 58 |
pip install -r requirements.txt
|
| 59 |
```
|
| 60 |
|
| 61 |
+
3. **Download model checkpoints**
|
| 62 |
|
| 63 |
Download the VAREdit model checkpoints:
|
| 64 |
```bash
|
|
|
|
| 95 |
### Model Sampling Parameters
|
| 96 |
|
| 97 |
| Parameter | Description | Default |
|
| 98 |
+
|:----------|:------------|:--------|
|
| 99 |
| `cfg` | Classifier-free guidance scale | 3.0 |
|
| 100 |
| `tau` | Temperature for sampling | 0.1 |
|
| 101 |
| `seed` | Random seed for reproducibility | -1 (random) |
|