AI Hacker News

Introducing Koshei AI: A Voice-Driven Language University from A1 to D2 Levels

March 6, 2026

Transforming Language Learning with AI 🌟

I’m excited to share my journey in creating a voice-first AI language teacher right in your browser, leveraging cutting-edge technologies like:

Next.js
Gemini Audio API for Speech-to-Text (STT)
Gemini Text-to-Speech (TTS)
Supabase for backend support

💡 Vision: A dynamic AI “language university” rather than just another chatbot. My goal is to empower language learners with personalized, structured instruction.

🔍 Current Challenges:

Implementing lightweight browser-native lip sync for static avatar images during TTS audio playback.
Ensuring MediaRecorder reliability on mobile Safari.
Enhancing voice UX for guided teaching.

Demo & Repository:
Check out the demo here and my code on GitHub.

I’m eager for your technical suggestions and architecture feedback—let’s collaborate to shape the future of language learning!

👉 Share your thoughts and let’s innovate together!

Source link

{{post_title}}

Introducing Koshei AI: A Voice-Driven Language University from A1 to D2 Levels

NO COMMENTS

LEAVE A REPLY Cancel reply

Loading…

Here are the results for the search: "{{td_search_query}}"

No results!

{{post_title}}

RELATED ARTICLES

Cirrus CI is Closing: Transition to a Scalable, AI-Driven Solution

Sal Khan’s Vision: Rethinking the Impact of AI on Education

Harnessing AI in Intelligent Organizations: Exploring Jevons Paradox and Its Impact...

NO COMMENTS

LEAVE A REPLY Cancel reply