Microsoft Releases New AI Models That Can Generate Images, Audio and Transcribe Text

Apr 3, 2026 - 14:00
Microsoft Releases New AI Models That Can Generate Images, Audio and Transcribe Text
Microsoft released three specialised artificial intelligence (AI) models on Thursday, focusing on image generation, voice generation, and speech-to-text transcription. The Redmond-based tech giant claims that these models outperform specialised models from rival companies, such as Google, OpenAI, and others. The models, MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2, are also said to focus on fast generation and competitive pricing.