Application/Real-time API Does Not Wait for Speaker to Finish Speaking #52

madhubandru · 2024-11-12T21:30:40Z

The application currently identifies short pauses (around 1-2 seconds) during user speech as the end of the speaker's input. As a result, the application prematurely responds based on incomplete sentences or partial questions, leading to incomplete responses or unnecessary follow-up questions.

To improve user experience, we need to extend the wait duration to allow users to complete their thoughts before the application processes the input. This should account for natural pauses in speech to ensure the application only responds once the speaker has truly finished.

Request:

Implement or adjust a configurable delay/wait period after detecting speech pauses.
Ensure that brief pauses do not trigger the end of input, and responses only initiate when a user has likely finished speaking.

Expected Outcome:
The application should respond only after confirming that the user has completed their input, accommodating natural pauses without interruptions.

Any suggestions or discussions on this topic that could provide solutions or enhancements would be greatly appreciated. Thank you in advance!

madhubandru · 2024-11-12T21:31:34Z

Hi, @pamelafox @pablocastro can you please provide some direction on this issue? Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Application/Real-time API Does Not Wait for Speaker to Finish Speaking #52

Application/Real-time API Does Not Wait for Speaker to Finish Speaking #52

madhubandru commented Nov 12, 2024

madhubandru commented Nov 12, 2024 •

edited

Loading

Application/Real-time API Does Not Wait for Speaker to Finish Speaking #52

Application/Real-time API Does Not Wait for Speaker to Finish Speaking #52

Comments

madhubandru commented Nov 12, 2024

madhubandru commented Nov 12, 2024 • edited Loading

madhubandru commented Nov 12, 2024 •

edited

Loading