Mini-Omni: Open-Source Model for Real-Time Speech Interaction
2 September 2024
Mini-Omni: Open-Source Model for Real-Time Speech Interaction
Current academic language models still rely on external Text-to-Speech (TTS) systems, causing undesirable latency in speech synthesis. To address this, the Mini-Omni model introduces an audio-based, end-to-end conversational capability that…