Draw words in sync with audio playback #114

johnwdubois · 2019-01-07T16:59:34Z

Background

Seeing the words of a conversation drawn to the screen at the same time as you are hearing the audio can be useful for visualizing talk. This is key for doing experiments on splitting intonation units in a language you don't know.
The intended effect is as if Rezonator "hears" the words as they are spoken, updating each token automatically to show in black on the main screen.

What to do

Synchronize the drawing of words to the screen with the simultaneous playback of audio. (Call this Sync-Play.)
To visualize Sync-Play, when the user is playing audio for a given unit, change the text color from grey to black for each token as it is played:

All tokens in units with a UnitEnd time earlier than the current playback time are shown in black
All tokens in units with a UnitStart value later than the current playback time are shown in grey
Only the currently playing Unit (or 2 or more overlapping Units; see below) has a mix of black text and grey text, updating dynamically

To get the timestamps needed to sync the drawing of a word with the audio currently being heard, use one of 2 ways:

Estimate when the word is spoken based on the UnitStart time, UnitEnd tiime, and number of words in the current unit.
If available, use word-level timestamps provided in the original imported file

Overlapping speech by two different speakers represents a special challenge, which must be addressed as follows:

Because overlapping words in 2 different units may occur at the same time, both should be updated (switch from grey to black) at the same time
Each unit should be updated according to its own timeline; so 2 (or more) timelines must be managed at the same time.
(For audio playback, avoid playing the same sound twice)

for Units: instead of using UnitSeq, use UnitStart and UnitEnd
for tokens: instead of using DocSeq, use UnitStart and UnitEnd, plus the Order value for the token within its Unit
for tokens: Only if no UnitStart and UnitEnd values are available, use UnitSeq, plus the Order value for the token within its Unit

If timestamps are available at the word level, consider using those for drawing words.

Resources

johnwdubois · 2022-02-03T20:11:10Z

johnwdubois added the enhancement New feature or request label Jan 7, 2019

johnwdubois assigned terrydubois Jan 7, 2019

johnwdubois mentioned this issue Jan 7, 2019

Audio 1: PlayAudio #112

Closed

johnwdubois changed the title ~~SyncPlay~~ Audio 3: SyncPlay Jan 7, 2019

This was referenced Feb 21, 2019

Audio 2: AutoPlay #116

Open

Rez-play: Dynamic playback of Rez-links #167

Open

johnwdubois mentioned this issue Apr 12, 2019

Rez-Play 2: sync with audio playback #204

Open

johnwdubois mentioned this issue Sep 27, 2019

Audio 1.5: Play from here #374

Open

johnwdubois mentioned this issue Oct 12, 2019

Read mode scrolling option #390

Open

johnwdubois unassigned terrydubois May 8, 2020

johnwdubois mentioned this issue Sep 26, 2020

Spacebar vs. ENTER for audio playback #623

Open

johnwdubois assigned terrydubois Jan 28, 2022

johnwdubois added this to the 1.2 release milestone Jan 28, 2022

johnwdubois changed the title ~~Audio 3: SyncPlay~~ Draw words in sync with audio playback Jan 28, 2022

johnwdubois modified the milestones: 1.2 release, 1.1 release Jan 31, 2022

johnwdubois unassigned terrydubois Jan 31, 2022

johnwdubois added this to Experiment Aug 22, 2023

johnwdubois moved this to To Do in Experiment Aug 22, 2023

johnwdubois added this to Core Aug 22, 2023

johnwdubois moved this to To do in Core Aug 22, 2023

Provide feedback