Voice Models - Search News

4don MSN

Microsoft takes on AI rivals with three new foundational models

MAI released models that can transcribe voice into text as well as generate audio and images after the group's formation six ...

3d

Microsoft released 3 new AI models, ramping up competition with its close partner, OpenAI

Microsoft AI has made its in-house models for transcription, speech recognition, and image generation available on Foundry.

11don MSN

Cohere launches an open-source voice model specifically for transcription

Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self ...

6don MSN

Speechify’s Windows app uses local models for transcription and dictation

Speechify just launched a native Windows app that employs locally stored models to enable dictation and transcription across ...

4d

Microsoft launches new high-speed voice and image models

Microsoft says that MAI-Image-2 is at least twice as fast as its previous-generation image generator. The second new model ...

CNET on MSN

Microsoft's New AI Models Go Beyond Just Text

Microsoft's New AI Models Go Beyond Just Text ...

The Financial Express

‘Voice cloning opens up a wide range of use cases’ Q&A with Gnani AI cofounder Ganesh Gopalan

Voice AI startup raises $10 million to expand globally, invest in R&D and build advanced multilingual AI models under India’s ...

17d

Scale AI launches Voice Showdown, the first real-world benchmark for voice AI — and the results are humbling for some top models

The results, drawn from thousands of spontaneous voice conversations across more than 60 languages, reveal capability gaps ...

4d

Microsoft shivs OpenAI with three new AI models for speech and images

OpenAI just happens to offer its own speech recognition, speech generation, and text-to-image models. Microsoft's models are available through Foundry (formerly Azure AI Studio), a platform to develop ...

Qwen 3.5 Omni: Alibaba’s AI Model Can Now Hear, Watch, and Clone Your Voice

Alibaba’s Qwen 3.5 Omni brings true real-time omnimodal AI to the frontier race: voice cloning, 10-hour audio, real-time ...

Analytics Insight

Mistral’s Open-Source Voice Model Sparks New AI Assistant Rivalry

Mistral AI has made a move that has surprised the AI world. The French startup has released a new open-source voice model.

11d

Mistral releases an open-weights ‘speaking’ AI model with Voxtral TTS

The Paris-based Mistral AI SAS today announced the release of Voxtral TTS, its first text-to-speech artificial intelligence ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results