Owen777 commited on
Commit
0392506
·
verified ·
1 Parent(s): 9b37ce0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -42,7 +42,21 @@ license: apache-2.0
42
 
43
  # PixArt-900M-DiT v0.1
44
 
45
- This is a 900M DiT model with Full finetuning on Million-Level High-quality, human-preferred images.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
46
 
47
 
48
  # Credits
 
42
 
43
  # PixArt-900M-DiT v0.1
44
 
45
+ This is an experimental 900M DiT model that has been fully fine-tuned on a large-scale, high-quality dataset of images aligned with human preferences. It uses the same VAE and text encoder as PixArt-Sigma, so you only need to replace the transformer.
46
+
47
+ # How to use it?
48
+ ```python
49
+ weight_dtype = torch.bfloat16
50
+ device = torch.device("cuda")
51
+ MODEL_ID = "PixArt-alpha/pixart_sigma_sdxlvae_T5_diffusers"
52
+ transformer = Transformer2DModel.from_pretrained("Owen777/pixart-900m", torch_dtype=weight_dtype)
53
+ pipe = PixArtSigmaPipeline.from_pretrained(MODEL_ID, transformer=transformer, torch_dtype=weight_dtype)
54
+ pipe.vae = AutoencoderKL.from_pretrained("madebyollin/sdxl-vae-fp16-fix", torch_dtype=weight_dtype).cuda()
55
+ pipe.to(device)
56
+ image = pipe("a cute cat with pure white background.",guidance_scale=3.5,num_inference_steps=30,height=1024,width=102)[0]
57
+ image.save("result.jpeg")
58
+
59
+ ```
60
 
61
 
62
  # Credits