Tuesday, August 12, 2025

“Azure AI Speech Capable of Voice Cloning with Just Seconds of Audio” • The Register

Microsoft has rolled out exciting enhancements to Azure AI Speech, making voice replication quicker and more lifelike than ever before. The personal voice feature, now generally available, utilizes the newly upgraded “DragonV2.1Neural” model.

Key Upgrades Include:

  • Zero-shot Text-to-Speech: Generate voices with just a few seconds of audio.
  • Naturalness and Expressiveness: Experience more realistic speech with improved prosody.
  • Language Variety: Supports audio generation in over 100 languages.

Potential Applications:

  • Customizing chatbot voices.
  • Dubbing videos in an original actor’s voice.
  • Personalizing immersive audio experiences.

While these advancements bring tremendous opportunities, they also raise concerns regarding potential misuse, such as audio deepfakes. Microsoft addresses this with necessary safeguards, including the need for explicit consent and content disclosure.

Explore the future of AI and its impact on voice technology.

🔗 Share your thoughts and engage in the discussion!

Source link

Share

Read more

Local News