Voice Engine: OpenAI’s Voice Synthesis Model

1 April 2024
voice engine openai

Voice Engine: OpenAI’s Voice Synthesis Model

OpenAI has unveiled Voice Engine, a model capable of voice cloning from a 15-second audio recording. Among the users of the model, the company mentions podcasters, announcers, audiobook authors, advertisers,…

AudioPaLM: Google’s Multimodal Model for Voice Translation

29 June 2023
audiopalm google

AudioPaLM: Google’s Multimodal Model for Voice Translation

Google has introduced AudioPaLM, a large language model for speech processing and generation that combines two Google language models, PaLM-2 and AudioLM, into a multimodal architecture. The model can recognize…