QJerry commited on
Commit
7637c84
·
verified ·
1 Parent(s): 7015346

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -12
README.md CHANGED
@@ -23,15 +23,16 @@ Welcome to the official repository for the Z-Image(造相)project!
23
 
24
  ## 🎨 Z-Image
25
 
26
- **Z-Image** is the foundation model behind Z-Image-Turbo, designed for high-quality image generation with strong controllability, broad stylistic coverage, and support for downstream development. It serves as the primary community model in the ⚡️- Image family, while Z-Image-Turbo focuses on high-speed inference.
 
27
 
28
  ### 🌟 Key Features
29
 
30
- #### 🎨 Aesthetic & Artistic Diversity
31
  Z-Image supports a wide range of aesthetics and artistic styles, including realistic photography, anime, illustration, digital art, and stylized visuals.
32
  It is suitable for creative scenarios that require rich stylistic expression rather than a single preferred aesthetic.
33
 
34
- #### 🧬 Generative Diversity
35
  Z-Image emphasizes diversity across multiple generative dimensions:
36
  - Variations in facial identity, body pose, composition, and layout across different seeds
37
  - Distinct appearances for individuals in multi-person scenes
@@ -51,11 +52,11 @@ Z-Image responds strongly to negative prompts, enabling reliable suppression of
51
 
52
  | Aspect | Z-Image | Z-Image-Turbo |
53
  |------|------|------|
54
- | CFG support | Yes | No |
55
- | Fine-tuning | Yes | Limited |
56
- | Aesthetic diversity | High | Reduced |
57
- | Negative prompt control | Strong | None |
58
- | Inference speed | Slower | Faster |
59
 
60
  ## 🚀 Quick Start
61
 
@@ -78,7 +79,7 @@ HF_XET_HIGH_PERFORMANCE=1 hf download Tongyi-MAI/Z-Image
78
  - **Guidance scale:** 3.0 – 5.0
79
  - **Inference steps:** 28 – 50
80
  - **Negative prompts:** Strongly recommended for better control
81
- - **CFG normalization:** `False` for general stylism, `True` for more realism
82
 
83
  ### Usage Example
84
 
@@ -95,7 +96,7 @@ pipe = ZImagePipeline.from_pretrained(
95
  pipe.to("cuda")
96
 
97
  # Generate image
98
- prompt = "两名年轻亚裔女性紧密站在一起,背景为朴素的灰色纹理墙面,可能是室内地毯地面。左侧女性留着长卷发,身穿藏青色毛衣,左袖有奶油色褶皱装饰,内搭白色立领衬衫,下身白色裤子;佩戴小巧金色耳钉,双臂交叉于背后。右侧女性留直肩长发,身穿奶油色卫衣,胸前印有“Tunthetables”字样,下方为“New ideas”,搭配白色裤子;佩戴银色小环耳环,双臂交叉于胸前。两人均面带微笑直视镜头。照片,自然光照明,柔和阴影,以藏青、奶油白为主的中性色调,休闲时尚摄影,中等景深,面部和上半身对焦清晰,姿态放松,表情友好,室内环境,地毯地面,纯色背景。"
99
  negative_prompt = "" # Optional, but would be powerful when you want to remove some unwanted content
100
 
101
  image = pipe(
@@ -103,9 +104,9 @@ image = pipe(
103
  negative_prompt=negative_prompt,
104
  height=1280,
105
  width=720,
106
- cfg_normalization=True,
107
  num_inference_steps=50,
108
- guidance_scale=4.0,
109
  generator=torch.Generator("cuda").manual_seed(42),
110
  ).images[0]
111
 
 
23
 
24
  ## 🎨 Z-Image
25
 
26
+ **Z-Image** is the foundation model behind Z-Image-Turbo, designed for high-quality image generation with strong controllability, broad stylistic coverage, and support for downstream development.
27
+ It serves as the primary community model in the ⚡️- Image family, while Z-Image-Turbo focuses on high-speed inference.
28
 
29
  ### 🌟 Key Features
30
 
31
+ #### 🎨 Aesthetics
32
  Z-Image supports a wide range of aesthetics and artistic styles, including realistic photography, anime, illustration, digital art, and stylized visuals.
33
  It is suitable for creative scenarios that require rich stylistic expression rather than a single preferred aesthetic.
34
 
35
+ #### 🌈 Diversity
36
  Z-Image emphasizes diversity across multiple generative dimensions:
37
  - Variations in facial identity, body pose, composition, and layout across different seeds
38
  - Distinct appearances for individuals in multi-person scenes
 
52
 
53
  | Aspect | Z-Image | Z-Image-Turbo |
54
  |------|------|------|
55
+ | CFG | | |
56
+ | Steps | 50 | 8 |
57
+ | Fintunablity | | |
58
+ | Negative Prompting | | |
59
+ | Diversity | High | Low |
60
 
61
  ## 🚀 Quick Start
62
 
 
79
  - **Guidance scale:** 3.0 – 5.0
80
  - **Inference steps:** 28 – 50
81
  - **Negative prompts:** Strongly recommended for better control
82
+ - **CFG normalization:** `False` for general stylism, `True` for realism
83
 
84
  ### Usage Example
85
 
 
96
  pipe.to("cuda")
97
 
98
  # Generate image
99
+ prompt = "两名年轻亚裔女性紧密站在一起,背景为朴素的灰色纹理墙面,可能是室内地毯地面。左侧女性留着长卷发,身穿藏青色毛衣,左袖有奶油色褶皱装饰,内搭白色立领衬衫,下身白色裤子;佩戴小巧金色耳钉,双臂交叉于背后。右侧女性留直肩长发,身穿奶油色卫衣,胸前印有“Tun the tables”字样,下方为“New ideas”,搭配白色裤子;佩戴银色小环耳环,双臂交叉于胸前。两人均面带微笑直视镜头。照片,自然光照明,柔和阴影,以藏青、奶油白为主的中性色调,休闲时尚摄影,中等景深,面部和上半身对焦清晰,姿态放松,表情友好,室内环境,地毯地面,纯色背景。"
100
  negative_prompt = "" # Optional, but would be powerful when you want to remove some unwanted content
101
 
102
  image = pipe(
 
104
  negative_prompt=negative_prompt,
105
  height=1280,
106
  width=720,
107
+ cfg_normalization=False,
108
  num_inference_steps=50,
109
+ guidance_scale=4,
110
  generator=torch.Generator("cuda").manual_seed(42),
111
  ).images[0]
112