/ | image | video |
text | text2image | text2video |
image | image2image | image2video |
video | / | video2video |
/ | audio | music | speak |
text | text2audio | text2music | text2speech |
/ | text |
text | text2text |
image | captioning |
/ | model | pose | motion |
text | text2model | text2pose | text2motion |
image | image2model |
any2any | text | image | video | audio | music | speak | model | pose | motion | neuroimaging |
text | text2text | text2image | text2video | text2audio | text2music | text2speech(TTS) | text2model | text2pose | text2motion | |
image | image2text | image2image | image2video | image2audio | image2music | image2speak | image2model | image2pose | ←次元が違う | |
video | video2text | 切り出すだけ? | video2video | video2audio | video2Music | video2model | 次元が違う→ | video2motion | ||
audio | audio2text | |||||||||
music | music2text | music2motion | ||||||||
speak | speech2text | speech2speech | ||||||||
model | ||||||||||
pose | ||||||||||
motion | ||||||||||
neuroimaging | neuroimaging2image |