Voice: hybrid streaming TTS with trackbar player #89
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Context
v0.7.1-dev has TTS working (Kokoro local + Groq fallback), pause/play/stop controls, and progress tracking infrastructure in voice.rs. Currently TTS generates audio AFTER the full AI response completes — user waits for entire response before hearing anything.
Goal
Hybrid streaming TTS: hear audio sentence-by-sentence while AI is still responding, then full trackbar with seek/replay after response completes.
Implementation
Phase 1: Sentence-level SSE audio streaming
event: audio(multiple events per response)Phase 2: Full trackbar after response
[⏸] [━━━━●━━━━] 0:12/0:35Phase 3: Polish
Technical notes
.!?\nwith min 20 charsRelated