💨 Abstract

Microsoft AI released three new models: MAI-Transcribe-1 for speech-to-text in 25 languages, MAI-Voice-1 for audio generation, and MAI-Image-2 for video generation. These models are part of Microsoft's push to develop its own AI stack and compete with rivals like Google and OpenAI. The models are available on Microsoft Foundry and MAI Playground, with pricing lower than competitors.

Courtesy: Rebecca Szkutak