Manual Align

A tool that can help you manually align text to audio, adjust existing alignments, edit transcripts, etc. in your process of creating a speech dataset.

Install

Get the code, and open it with any modern browser.

Usage

Loading files

To begin, load an audio file and a TXT or JSON file.

If you use TXT, split your text into lines.
For a JSON file, it should be like the following:

[{"Start": Starting time of this line in millisecond, 
"Stop": Ending time of this line in millisecond, 
"Text":The text of this line},
...]

After loading, play the audio for a few seconds so that the regions on the wave are placed at the correct positions.
After editing, click Save can save your edit into a JSON in the above format.

Keyboard shortcuts

There are some keyboard shortcuts to help you do your work easilier.
ENTER: Play/Pause the audio.
SPACE: Play audio of the current line.
UP: Go to previous line.
DOWN: Go to next line.
LEFT / RIGHT: Move the player cursor backward/forward.
CTRL + LEFT / RIGHT: Skip to the previous/next region boundary.
F: Set Start. D: Shift Start 10ms later. S: Shift Start 10ms earlier.
J: Set End. K: Shift End 10ms earlier. L: Shift End 10ms later.

If you need help about the keyboard shortcuts when editing, simply click the blue icon next to the title.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
manual-align.html		manual-align.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Manual Align

Install

Usage

Loading files

Keyboard shortcuts

About

Releases

Packages

Languages

License

godspirit00/manual-align

Folders and files

Latest commit

History

Repository files navigation

Manual Align

Install

Usage

Loading files

Keyboard shortcuts

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages