3 New world class MAI models, available in Foundry (microsoft.ai) AI
Microsoft announced three new MAI models—MAI-Transcribe-1 for speech-to-text, MAI-Voice-1 for voice generation (including custom voice creation), and MAI-Image-2 for image generation—now available in Microsoft Foundry and MAI Playground. The company says MAI-Transcribe-1 targets fast, accurate transcription for the most-used languages, MAI-Voice-1 can generate 60 seconds of audio per second of compute and preserve speaker identity, and MAI-Image-2 delivers faster image generation with similar quality. Microsoft also lists starting prices for each model and notes enterprise controls and red-teaming for safer deployment.
April 05, 2026 21:50
Source: Hacker News