Open source TTS models Kokoro, Orpheus, and Piper are tested on symbols, abbreviations, and prosody with CER and MOS results.
Mistral launches Voxtral TTS, extending its model family into speech generation and enabling end-to-end voice workflows.
Kokoro 82M is an 82-million-parameter text-to-speech model that beats many TTS APIs while running locally on CPUs, including ...
A two-person startup by the name of Nari Labs has introduced Dia, a 1.6 billion parameter text-to-speech (TTS) model designed to produce naturalistic dialogue directly from text prompts — and one of ...
The enterprise voice AI market is in the middle of a land grab. ElevenLabs and IBM announced a collaboration just this week to bring premium voice capabilities into IBM's watsonx Orchestrate platform.
OpenAI Gives Its Agents a Voice – Now a ‘Medieval Knight’ Can Read Your Work Emails Your email has been sent The text-to-speech and speech-to-text tools are all based on GPT-4o. OpenAI hinted it may ...
The AI company ElevenLabs has launched a new text-to-speech model called Turbo 2.5. It introduces support for three new languages: Vietnamese, Hungarian, and Norwegian. The API is available too. The ...
Microsoft announced MAI-Transcribe-1, a new speech-to-text model, and made its in-house MAI-Voice-1 and MAI-Image-2 models ...
Discover how voice AI is transforming customer interactions in the BFSI sector. Learn the latest trends and best practices to ...
I wore the world's first HDR10 smart glasses TCL's new E Ink tablet beats the Remarkable and Kindle Anker's new charger is one of the most unique I've ever seen Best laptop cooling pads Best flip ...
Overview: Multilingual text-to-speech tools now deliver natural voices with emotion and a realistic human-like tonePlatforms ...