Update README.md
Browse files
README.md
CHANGED
|
@@ -23,15 +23,16 @@ Welcome to the official repository for the Z-Image(造相)project!
|
|
| 23 |
|
| 24 |
## 🎨 Z-Image
|
| 25 |
|
| 26 |
-
**Z-Image** is the foundation model behind Z-Image-Turbo, designed for high-quality image generation with strong controllability, broad stylistic coverage, and support for downstream development.
|
|
|
|
| 27 |
|
| 28 |
### 🌟 Key Features
|
| 29 |
|
| 30 |
-
#### 🎨
|
| 31 |
Z-Image supports a wide range of aesthetics and artistic styles, including realistic photography, anime, illustration, digital art, and stylized visuals.
|
| 32 |
It is suitable for creative scenarios that require rich stylistic expression rather than a single preferred aesthetic.
|
| 33 |
|
| 34 |
-
####
|
| 35 |
Z-Image emphasizes diversity across multiple generative dimensions:
|
| 36 |
- Variations in facial identity, body pose, composition, and layout across different seeds
|
| 37 |
- Distinct appearances for individuals in multi-person scenes
|
|
@@ -51,11 +52,11 @@ Z-Image responds strongly to negative prompts, enabling reliable suppression of
|
|
| 51 |
|
| 52 |
| Aspect | Z-Image | Z-Image-Turbo |
|
| 53 |
|------|------|------|
|
| 54 |
-
| CFG
|
| 55 |
-
|
|
| 56 |
-
|
|
| 57 |
-
| Negative
|
| 58 |
-
|
|
| 59 |
|
| 60 |
## 🚀 Quick Start
|
| 61 |
|
|
@@ -78,7 +79,7 @@ HF_XET_HIGH_PERFORMANCE=1 hf download Tongyi-MAI/Z-Image
|
|
| 78 |
- **Guidance scale:** 3.0 – 5.0
|
| 79 |
- **Inference steps:** 28 – 50
|
| 80 |
- **Negative prompts:** Strongly recommended for better control
|
| 81 |
-
- **CFG normalization:** `False` for general stylism, `True` for
|
| 82 |
|
| 83 |
### Usage Example
|
| 84 |
|
|
@@ -95,7 +96,7 @@ pipe = ZImagePipeline.from_pretrained(
|
|
| 95 |
pipe.to("cuda")
|
| 96 |
|
| 97 |
# Generate image
|
| 98 |
-
prompt = "两名年轻亚裔女性紧密站在一起,背景为朴素的灰色纹理墙面,可能是室内地毯地面。左侧女性留着长卷发,身穿藏青色毛衣,左袖有奶油色褶皱装饰,内搭白色立领衬衫,下身白色裤子;佩戴小巧金色耳钉,双臂交叉于背后。右侧女性留直肩长发,身穿奶油色卫衣,胸前印有“
|
| 99 |
negative_prompt = "" # Optional, but would be powerful when you want to remove some unwanted content
|
| 100 |
|
| 101 |
image = pipe(
|
|
@@ -103,9 +104,9 @@ image = pipe(
|
|
| 103 |
negative_prompt=negative_prompt,
|
| 104 |
height=1280,
|
| 105 |
width=720,
|
| 106 |
-
cfg_normalization=
|
| 107 |
num_inference_steps=50,
|
| 108 |
-
guidance_scale=4
|
| 109 |
generator=torch.Generator("cuda").manual_seed(42),
|
| 110 |
).images[0]
|
| 111 |
|
|
|
|
| 23 |
|
| 24 |
## 🎨 Z-Image
|
| 25 |
|
| 26 |
+
**Z-Image** is the foundation model behind Z-Image-Turbo, designed for high-quality image generation with strong controllability, broad stylistic coverage, and support for downstream development.
|
| 27 |
+
It serves as the primary community model in the ⚡️- Image family, while Z-Image-Turbo focuses on high-speed inference.
|
| 28 |
|
| 29 |
### 🌟 Key Features
|
| 30 |
|
| 31 |
+
#### 🎨 Aesthetics
|
| 32 |
Z-Image supports a wide range of aesthetics and artistic styles, including realistic photography, anime, illustration, digital art, and stylized visuals.
|
| 33 |
It is suitable for creative scenarios that require rich stylistic expression rather than a single preferred aesthetic.
|
| 34 |
|
| 35 |
+
#### 🌈 Diversity
|
| 36 |
Z-Image emphasizes diversity across multiple generative dimensions:
|
| 37 |
- Variations in facial identity, body pose, composition, and layout across different seeds
|
| 38 |
- Distinct appearances for individuals in multi-person scenes
|
|
|
|
| 52 |
|
| 53 |
| Aspect | Z-Image | Z-Image-Turbo |
|
| 54 |
|------|------|------|
|
| 55 |
+
| CFG | ✅ | ❌ |
|
| 56 |
+
| Steps | 50 | 8 |
|
| 57 |
+
| Fintunablity | ✅ | ❌ |
|
| 58 |
+
| Negative Prompting | ✅ | ❌ |
|
| 59 |
+
| Diversity | High | Low |
|
| 60 |
|
| 61 |
## 🚀 Quick Start
|
| 62 |
|
|
|
|
| 79 |
- **Guidance scale:** 3.0 – 5.0
|
| 80 |
- **Inference steps:** 28 – 50
|
| 81 |
- **Negative prompts:** Strongly recommended for better control
|
| 82 |
+
- **CFG normalization:** `False` for general stylism, `True` for realism
|
| 83 |
|
| 84 |
### Usage Example
|
| 85 |
|
|
|
|
| 96 |
pipe.to("cuda")
|
| 97 |
|
| 98 |
# Generate image
|
| 99 |
+
prompt = "两名年轻亚裔女性紧密站在一起,背景为朴素的灰色纹理墙面,可能是室内地毯地面。左侧女性留着长卷发,身穿藏青色毛衣,左袖有奶油色褶皱装饰,内搭白色立领衬衫,下身白色裤子;佩戴小巧金色耳钉,双臂交叉于背后。右侧女性留直肩长发,身穿奶油色卫衣,胸前印有“Tun the tables”字样,下方为“New ideas”,搭配白色裤子;佩戴银色小环耳环,双臂交叉于胸前。两人均面带微笑直视镜头。照片,自然光照明,柔和阴影,以藏青、奶油白为主的中性色调,休闲时尚摄影,中等景深,面部和上半身对焦清晰,姿态放松,表情友好,室内环境,地毯地面,纯色背景。"
|
| 100 |
negative_prompt = "" # Optional, but would be powerful when you want to remove some unwanted content
|
| 101 |
|
| 102 |
image = pipe(
|
|
|
|
| 104 |
negative_prompt=negative_prompt,
|
| 105 |
height=1280,
|
| 106 |
width=720,
|
| 107 |
+
cfg_normalization=False,
|
| 108 |
num_inference_steps=50,
|
| 109 |
+
guidance_scale=4,
|
| 110 |
generator=torch.Generator("cuda").manual_seed(42),
|
| 111 |
).images[0]
|
| 112 |
|