/work4ai/StyleTTS 2 - Scrapbox Reader

generated at 2/17/2025, 5:33:07 PM
StyleTTS 2
https://styletts2.github.io/Project
https://arxiv.org/abs/2306.07691StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
https://github.com/yl4579/StyleTTS2yl4579/StyleTTS2
Demo https://huggingface.co/spaces/styletts2/styletts2
>style diffusionとlarge speech language models (SLMs)による敵対的学習を活用し、人間レベルのTTS合成を実現するtext-to-speech(TTS)モデルであるStyleTTS 2を紹介する
>large speech language modelsは、敵対的学習において識別器として用いられ、音声言語モデルの知識を音声生成タスクに転移することで、音声の自然さを向上させる。

StyleTTS