/work4ai/PIXART-α - Scrapbox Reader

generated at 2/17/2025, 5:36:22 PM
PIXART-α

https://github.com/PixArt-alpha/PixArt-alphaPixArt-alpha/PixArt-alpha
Demohttps://huggingface.co/spaces/PixArt-alpha/PixArt-alpha
https://pixart-alpha.github.io/Project
https://arxiv.org/abs/2310.00426PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
https://huggingface.co/PixArt-alpha/PixArt-XL-2-1024-MSPixArt-alpha/PixArt-XL-2-1024-MS
Diffusion Transformer
Transformerベースの画像生成モデル
データセット
LLaVAを使ってキャプションをつける
LAION、Segment Anything、Internal
>Internalは美的に見える画像のデータセット
学習コストとCO2排出量は正の相関にある
PIXART-αの学習コストはGigaGANの11.1%、RAPHAELと比べると0.85%で済む
ほへー

スクラッチで作る場合2500万枚必要
1024×1024でA100 15000GPUhours
Stable Diffusion 2を0から作るコストは16万ドルから3万ドルまで下がった？


#画像生成モデル