You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If calling real time API the STT and TTS phases can be shortcut, because the model itself is capable. The question in that case is how to obtain the user speech's text format for vector indexing
The newest Gemini API audio output sounds like can produce Journey like speech output