Microsoft Releases New AI Models for Speech, Audio and Video Generation

April 2, 2026

Microsoft has introduced three new multimodal AI models—MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2—marking an expansion of its in-house AI development as it builds out its own model stack alongside its partnership with OpenAI.

The models, announced Thursday by Microsoft AI, span speech-to-text, voice generation, and visual content creation. MAI-Transcribe-1 supports transcription across 25 languages and, according to the company, operates 2.5 times faster than its Azure Fast alternative. MAI-Voice-1 can generate up to 60 seconds of audio in one second and includes tools for creating custom voices. MAI-Image-2, a video-generating model, had previously launched on March 19 through the company’s MAI Playground and is now being rolled out more broadly.

All three models are being made available through Microsoft Foundry, while the transcription and voice models are also accessible in MAI Playground, a testing environment for large language models.

The release comes from Microsoft’s MAI Superintelligence team, led by Microsoft AI CEO Mustafa Suleyman and established in November 2025. The move reflects Microsoft’s effort to deepen its capabilities across multiple AI modalities while continuing to compete with other major AI developers.

“At Microsoft AI, we’re building Humanist AI. We have a distinct view when creating our AI models — putting humans at the center, optimizing for how people actually communicate, training for practical use,” Suleyman wrote in a blog post. “You’ll see more models from us soon in Foundry and directly in Microsoft products and experiences.”

Microsoft is positioning the models in part on cost, with pricing set below some competing offerings. MAI-Transcribe-1 starts at $0.36 per hour, MAI-Voice-1 at $22 per 1 million characters, and MAI-Image-2 at $5 per 1 million input tokens and $33 per 1 million output tokens.

Despite the internal push, the company said it remains committed to its long-standing relationship with OpenAI, which has included more than $13 billion in investment.

This analysis is based on reporting from TechCrunch.

Image courtesy of Unsplash.

This article was generated with AI assistance and reviewed for accuracy and quality.

Last updated: April 2, 2026

Report Error

About this article: This article was generated with AI assistance and reviewed by our editorial team to ensure it follows our editorial standards for accuracy and independence. We maintain strict fact-checking protocols and cite all sources.

Word count: 329Reading time: 0 minutes

Explore More AI Resources

Continue with high-value guides related to this topic.

Compare AI Models

See ChatGPT, Claude, and Gemini side-by-side in one place.

Best AI Newsletters

Find top AI newsletters and subscribe to ChatAI Daily.

AI FAQ

Quick answers about ChatAI, AI tools, and multi-model chat.

AI Tools

Use free AI tools for summarization, translation, and more.

📧 Stay Updated

Get the latest AI news delivered to your inbox every morning.

Continue Reading

Tesla Brings Robotaxi Service to Miami With Florida Expansion

Tesla has expanded its robotaxi service to Miami, marking the company’s first autonomous ride-hailing launch in Florida and its fourth metropolitan market overall. The company announced the rollout...

July 3, 2026•5 min read

Alibaba Reportedly Cuts Off Anthropic’s Claude Code After Backdoor Claims

Alibaba plans to prohibit employees from using Anthropic’s Claude Code in its workplace environments starting July 10, according to a person familiar with the matter cited by Reuters. Alibaba has not...

July 3, 2026•5 min read

Meta Launches Pocket, a New AI App for Creating Interactive Games With Prompts

Meta has quietly introduced a new AI-powered app called Pocket, a social platform that lets users create, share, and explore interactive AI experiences known as “gizmos.” The app has appeared in...

July 2, 2026•5 min read

Explore All Articles

Microsoft Releases New AI Models for Speech, Audio and Video Generation

Explore More AI Resources

Compare AI Models

Best AI Newsletters

AI FAQ

AI Tools

AI Tools for this Article

Settings

📧 Stay Updated

Related Articles

Tesla Brings Robotaxi Service to Miami With Florida Expansion

Alibaba Reportedly Cuts Off Anthropic’s Claude Code After Backdoor Claims

Meta Launches Pocket, a New AI App for Creating Interactive Games With Prompts

Continue Reading

Tesla Brings Robotaxi Service to Miami With Florida Expansion

Alibaba Reportedly Cuts Off Anthropic’s Claude Code After Backdoor Claims

Meta Launches Pocket, a New AI App for Creating Interactive Games With Prompts

Stay Ahead of AI

Go Premium

Follow Our Community

ChatAI

Go Premium

ChatAI

Follow Our Community