Transforming Language Learning with AI š
Iām excited to share my journey in creating a voice-first AI language teacher right in your browser, leveraging cutting-edge technologies like:
- Next.js
- Gemini Audio API for Speech-to-Text (STT)
- Gemini Text-to-Speech (TTS)
- Supabase for backend support
š” Vision: A dynamic AI ālanguage universityā rather than just another chatbot. My goal is to empower language learners with personalized, structured instruction.
š Current Challenges:
- Implementing lightweight browser-native lip sync for static avatar images during TTS audio playback.
- Ensuring MediaRecorder reliability on mobile Safari.
- Enhancing voice UX for guided teaching.
Demo & Repository:
Check out the demo here and my code on GitHub.
Iām eager for your technical suggestions and architecture feedbackāletās collaborate to shape the future of language learning!
š Share your thoughts and letās innovate together!
