streaming-vocos

Streaming Vocos is a wrapper of Vocos. It supports streaming reconstruction of audio from mel-spectrogram or EnCodec tokens.

Usage

From mel-spectrogram

from streaming_vocos import StreamingVocos

audios = []
vocos = StreamingVocos()
features = vocos.feature_extractor(audio)

for feature in torch.unbind(features, dim=2):
    audios += vocos.streaming_decode(feature[:, :, None])
audios.append(vocos.decode_caches())
audios = torch.cat(audios, dim=1)

From EnCodec tokens

from streaming_vocos import StreamingVocos

audios = []
vocos = StreamingVocos()
codes = vocos.get_encodec_codes(audio)

for code in torch.unbind(codes, dim=2):
    audios += vocos.streaming_decode_codes(code[:, :, None])
audios.append(vocos.decode_caches())
audios = torch.cat(audios, dim=1)

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github/workflows		.github/workflows
streaming_vocos		streaming_vocos
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
VERSION		VERSION
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

streaming-vocos

Usage

About

Releases

Packages

Languages

License

dsh54054/streaming-vocos

Folders and files

Latest commit

History

Repository files navigation

streaming-vocos

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages