EzAudio: Open Source Hyperrealistic Text-to-Audio Model

19 September 2024
ezaudio text-to-audio model generation ai

EzAudio: Open Source Hyperrealistic Text-to-Audio Model

EzAudio, a new transformer-based text-to-audio (T2A) diffusion model developed by researchers from Tencent AI Lab and Johns Hopkins University. EzAudio addresses key challenges in T2A generation, including generation quality, computational…