Update README.md
Browse files
README.md
CHANGED
@@ -42,7 +42,21 @@ license: apache-2.0
|
|
42 |
|
43 |
# PixArt-900M-DiT v0.1
|
44 |
|
45 |
-
This is
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
46 |
|
47 |
|
48 |
# Credits
|
|
|
42 |
|
43 |
# PixArt-900M-DiT v0.1
|
44 |
|
45 |
+
This is an experimental 900M DiT model that has been fully fine-tuned on a large-scale, high-quality dataset of images aligned with human preferences. It uses the same VAE and text encoder as PixArt-Sigma, so you only need to replace the transformer.
|
46 |
+
|
47 |
+
# How to use it?
|
48 |
+
```python
|
49 |
+
weight_dtype = torch.bfloat16
|
50 |
+
device = torch.device("cuda")
|
51 |
+
MODEL_ID = "PixArt-alpha/pixart_sigma_sdxlvae_T5_diffusers"
|
52 |
+
transformer = Transformer2DModel.from_pretrained("Owen777/pixart-900m", torch_dtype=weight_dtype)
|
53 |
+
pipe = PixArtSigmaPipeline.from_pretrained(MODEL_ID, transformer=transformer, torch_dtype=weight_dtype)
|
54 |
+
pipe.vae = AutoencoderKL.from_pretrained("madebyollin/sdxl-vae-fp16-fix", torch_dtype=weight_dtype).cuda()
|
55 |
+
pipe.to(device)
|
56 |
+
image = pipe("a cute cat with pure white background.",guidance_scale=3.5,num_inference_steps=30,height=1024,width=102)[0]
|
57 |
+
image.save("result.jpeg")
|
58 |
+
|
59 |
+
```
|
60 |
|
61 |
|
62 |
# Credits
|