Skip to content

google-deepmind/librispeech-long

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

LibriSpeech-Long

This is a benchmark dataset for evaluating long-form variants of speech processing tasks such as speech continuation, speech recognition, and text-to-speech synthesis. It is derived from the LibriSpeech dev and test sets, whose utterances are reprocessed into contiguous examples of up to 4 minutes in length (in the manner of LibriLight's cut_by_vad.py script).

Audio, ground-truth transcripts, and duration information for all splits can be downloaded here (3GB).

This is part of a preprint that is work-in-progress; dataset may be subject to change.

Citation

@article{park2024long,
  author       = {Se Jin Park and
                  Julian Salazar and
                  Aren Jansen and
                  Keisuke Kinoshita and
                  Yong Man Ro and
                  R. J. Skerry{-}Ryan},
  title        = {Long-Form Speech Generation with Spoken Language Models},
  journal      = {CoRR},
  year         = {2024}
}

License and disclaimer

Copyright 2024 DeepMind Technologies Limited

The software and materials, except for the underlying LibriSpeech data, are licensed under the Creative Commons Attribution 4.0 International License (CC-BY). You may obtain a copy of the CC-BY license at: https://creativecommons.org/licenses/by/4.0/legalcode, or in the LICENSE file.

The materials contain adapted material from the LibriSpeech dataset. LibriSpeech is also licensed under the Creative Commons Attribution 4.0 International License (CC-BY). You may obtain a copy of the CC-BY license at: https://creativecommons.org/licenses/by/4.0/legalcode, or in the LICENSE file. LibriSpeech is available at https://www.openslr.org/12 and created by Vassil Panayotov, Guoguo Chen, Daniel Povey and Sanjeev Khudanpur, pursuant to the paper “LibriSpeech: an ASR corpus based on public domain audio books", ICASSP 2015 (https://ieeexplore.ieee.org/document/7178964).

Unless required by applicable law or agreed to in writing, all software and materials distributed here under the CC-BY licenses are distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the licenses for the specific language governing permissions and limitations under those licenses.

This is not an official Google product.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published