Trending on Hugging Face: The 40-Second Open-Source Speech Model

NineNineSix has launched Kani TTS 2, an advanced open-source text-to-speech (TTS) model that enhances audio generation length and stability, focusing on high-quality speech AI for underrepresented languages. This version generates up to 40 seconds of continuous speech, more than doubling the previous limit, and is trending on Hugging Face as a top TTS model.

Kani TTS 2 maintains its lightweight architecture while supporting zero-shot voice cloning, allowing developers to replicate speakers’ tones from brief audio samples. The full pretraining code is available, enabling diverse organizations to train TTS systems for various languages, especially low-resource ones.

With 400 million parameters trained on 10,000 hours of speech data, the model is efficient, needing about 3 GB of GPU memory for deployment. This architectural efficiency positions NineNineSix as a key player in democratizing speech AI, addressing the critical issue of language inclusion in AI technologies.

Source link

News

Company:

Join our community of SUBSCRIBERS and be part of the conversation.

AI Revolutionizes Cybersecurity Access: Empowering Defenders with Advanced Tools

Adobe Unveils Firefly AI Assistant, Featuring Enhanced Generative AI and Creative Tools – Moneycontrol.com

IDC MarketScape: Vendor Assessment of Global AI-Driven Enterprise Asset Management Solutions for Asset-Intensive Industries (2025-2026)

Cathay FHC Integrates OpenAI into Group Operations – Embracing Data Science Innovation

SoftBank Issues New Bonds to Refinance Debt and Support OpenAI – Finimize

Cirrus CI is Closing: Transition to a Scalable, AI-Driven Solution

Sal Khan’s Vision: Rethinking the Impact of AI on Education

Harnessing AI in Intelligent Organizations: Exploring Jevons Paradox and Its Impact on the Workforce

Exploiting MCP Servers in AI Systems: The Risk of Tool Modifications Post-Approval

The AI Quandary: Navigating Challenges and Controversies

Trending on Hugging Face: The 40-Second Open-Source Speech Model

Introducing My Browser Idle Game: Train Your Own AI Models!

Emergent Unveils AI Assistant ‘Wingman’ to Transform Coding Startups

Disney and Universal Join Forces to Sue AI Photo Generator Midjourney for Copyright Infringement

Join the Community: sudomake-friends by audiodude on GitHub

Microsoft Develops New Agent Inspired by OpenClaw

Local News

AI Revolutionizes Cybersecurity Access: Empowering Defenders with Advanced Tools

Cirrus CI is Closing: Transition to a Scalable, AI-Driven Solution

Adobe Unveils Firefly AI Assistant, Featuring Enhanced Generative AI and Creative Tools – Moneycontrol.com

Sal Khan’s Vision: Rethinking the Impact of AI on Education

AI Revolutionizes Cybersecurity Access: Empowering Defenders with Advanced Tools

Cirrus CI is Closing: Transition to a Scalable, AI-Driven Solution

Adobe Unveils Firefly AI Assistant, Featuring Enhanced Generative AI and Creative Tools – Moneycontrol.com