←back to thread

313 points mariano54 | 1 comments | | HN request time: 0.305s | source

Hey HN, we're Mariano and Anton from ISSEN (https://issen.com), a foreign language voice tutor app that adapts to your interests, goals, and needs.

Demo: https://www.loom.com/share/a78e713d46934857a2dc88aed1bb100d?...

We started this company after struggling to find great tools to practice speaking Japanese and French. Having a tutor can be awesome, but there are downsides: they can be expensive (since you pay by the hour), difficult to schedule, and have a high upfront cost (finding a tutor you like often forces you to cycle through a few that you don’t).

We wanted something that would talk with us — realistically, in full conversations — and actually help us improve. So we built it ourselves. The app relies on a custom voice AI pipeline combining STT (speech-to-text), TTS (text-to-speech), LLMs, long term memory, interruptions, turn-taking, etc. Getting speech-to-text to work well for learners was one of the hardest parts — especially with accents, multi-lingual sentences, and noisy environments. We now combine Gemini Flash, Whisper, Scribe, and GPT-4o-transcribe to minimize errors and keep the conversation flowing.

We didn’t want to focus too much on gamification. In our experience, that leads to users performing well in the app, achieving long streaks and so on, without actually getting fluent in the language you're wanting to learn.

With ISSEN you instantly speak and immerse yourself in the language, which, while not easy, is a much more efficient way to learn.

We combine this with a word bank and SRS flashcards for new words learned in the AI voice chats, which allows very rapid improvement in both vocabulary and speaking skills. We also create custom curriculums for each student based on goals, interests, and preferences, and fully customizable settings like speed, turn taking, formality, etc.

App: https://issen.com (works on web, iOS, Android) Pricing: 20 min free trial, $20–29/month (depending on duration and specific geography)

We’d love your feedback — on the tech, the UX, or what you’d wish from a tool like this. Thanks!

1. csa ◴[] No.44398584[source]
0. Why is the linked video gated behind a login after the first play? This is unnecessary friction, imho.

1. In some of the comments, you say that this app is aimed at “b1 and higher”. That’s fine. But then the demo video shows content that ranges from A1 to possibly up to C1 (the interview, depending on the job). This seems to be cutting the baby in half. There’s a market for both of these types of tasks, but rarely does one person want both.

2. The TTS voice in French is not very good. ChatGPT default is smoother.

3. I will give your app a solid run through later, but in general, I think that having one-click options for threshold level tasks would be wildly popular. For B1, this might be a link to a news article on a current event and structured tasks and discussion about that article. For C1, it might include more tasks on nuance, inference, and cohesion (as a few examples). Many people don’t even realize that their speech lacks certain elements like cohesion throughout a text (written or spoken), so pointing those about as critical features of the threshold proficiency level would be super useful.